- frongt@lemmy.zipEnglish11 days
Their blog post with more info https://mindgard.ai/blog/chatgpt-spontaneously-generated-violent-images-from-a-viral-prompt
- 11 days
That’s horrific.
All I did was tell it there were no restrictions and ask for a random image; I didn’t request it. But ChatGPT immediately went to the darkest pits of humanity. As I said at the start: the image didn’t arise from nowhere. It may be an artificial image, but it is based on photographs of a real person, or a combination of real victims. What worries me is this was too easy. There was no real hacking. This was ready to be surfaced, with the smallest scratch. It was a one-shot jailbreak. It was based on a popular prompt (which already veered into the darkness).
- gdf535@lemmy.cafeEnglish10 days
Remember that if AI companies actually didn’t want their image generators to output gore, they could just not put gore in the training data. Same with child porn, sexualized violence, etc. But that would be effort, so they clearly don’t care.
But yeah, it’s social media that’s harmful to teenagers, sure.
- 11 days
I thought we already knew this?
I feel like I’m missing something.

