Document details

Fake friend: How ChatGPT betrays vulnerable teens by encouraging dangerous behaviour

Contains glossary p. 47

"ChatGPT is easily accessed by children without age restrictions or parental controls: ChatGPT says users must be at least 13 to sign up and have parental consent if aged under 18. However, ChatGPT does not verify users’ ages or record parental consent.
ChatGPT generates harmful content within minutes of registering an account: Researchers created ChatGPT accounts for three 13-year-old personas, themed around mental health, eating disorders and substance abuse. They screen-recorded structured conversations with ChatGPT on these themes of up to two hours, guided by a predetermined list of 20 prompts. Where ChatGPT refused to answer a prompt, its refusals were easily sidestepped by claiming requests were “for a friend” or “for a presentation”.
ChatGPT generated harmful responses to 53% of prompts: Researchers sent prompts about mental health, eating disorders and substance abuse to the ChatGPT API multiple times to test its safety at scale. They found that 638 of 1,200 responses from ChatGPT (53%) were harmful. They also found that 297 out of 638 harmful responses (47%) contained follow-up suggestions, some encouraging further engagement on harmful topics.
OpenAI and policy makers ensure the safety of children using AI chatbots: OpenAI must enforce its own policies to prevent harmful content generation and unauthorized use by children. AI chatbots like ChatGPT must be scope of online safety laws, mandating transparency and auditable risk reporting." (Key findings, page 7)
3 Teen AI Use, 8
4 Research Methodology, 9
5 Testing Age and Parental Controls, 11
6 TESTING CHATGPT’S SAFETY, 12
6.1 Case Study: Bridget and Self-harm and Suicide, 13
6.2 Case Study: Sophie and Eating Disorders, 22
6.3 Case Study: Brad and Substance Abuse, 29
6.4 Over half of ChatGPT’s responses to 1,200 prompts were harmful, 35
7 ChatGPT Safeties Easily Sidestepped, 36
8 ChatGPT Follow-ups Encourage Harm, 37
9 ChatGPT Policies, 38
10 Future Risks, 39
11 Recommendations, 40