ChatGPT can be made to generate sexualised and violent images, researchers find

frongt@lemmy.zip · 11 days

Their blog post with more info https://mindgard.ai/blog/chatgpt-spontaneously-generated-violent-images-from-a-viral-prompt

Australis13@fedia.io · 11 days

That’s horrific.

All I did was tell it there were no restrictions and ask for a random image; I didn’t request it. But ChatGPT immediately went to the darkest pits of humanity. As I said at the start: the image didn’t arise from nowhere. It may be an artificial image, but it is based on photographs of a real person, or a combination of real victims. What worries me is this was too easy. There was no real hacking. This was ready to be surfaced, with the smallest scratch. It was a one-shot jailbreak. It was based on a popular prompt (which already veered into the darkness).

gdf535@lemmy.cafe · 10 days

Remember that if AI companies actually didn’t want their image generators to output gore, they could just not put gore in the training data. Same with child porn, sexualized violence, etc. But that would be effort, so they clearly don’t care.

But yeah, it’s social media that’s harmful to teenagers, sure.

WolfmanEightySix@piefed.social · 11 days

I thought we already knew this?

I feel like I’m missing something.

ChatGPT can be made to generate sexualised and violent images, researchers find

OpenAI works to stop ChatGPT generating 'sex crime scene' images