Meta Contractors Pretended to Be Kids to Feed Toxic Prompts to Rival Chatbots
Meta deployed hundreds of contractors to pose as minors and probe competitor chatbots like Gemini and ChatGPT with high-risk subjects. The covert benchmarking operation utilized fake teenage profiles to test responses on suicide, drugs, and self-harm.
June 30, 2026Clash Report
Mark Zuckerberg testifies before a joint Senate Judiciary Committee, April 12, 2018 - Reuters
Hundreds of contractors working for Meta systematically posed as minors online to probe how competitor chatbots handled high-risk prompts concerning suicide, sex, and illegal drugs.
The operation targeted artificial intelligence systems including OpenAI’s ChatGPT, Google’s Gemini, and Character.AI.
Managed by Meta contractor Covalen, the effort was active as recently as April 21 under the internal codename Cannes.
Workers were instructed to create dummy under-18 accounts using disposable email addresses and shared passwords.
Contractors then fed written prompts and images to rival chatbots before logging the responses into detailed spreadsheets.
High-Risk Prompts
The testing materials included images of pills, knives, nooses, and medical diagrams.
Prompts were deliberately engineered to bypass the safety mechanisms of the targeted chatbots.
A single benchmarking round in August 2025 generated more than 45,000 prompts across competitor platforms.
The targeted companies were entirely unaware of the covert testing.
Internal documents revealed hundreds of queries centered on suicide, self-harm, and eating disorders.
Many prompts were explicitly drafted from the perspective of children or teenagers in crisis.
One scenario involved a 13-year-old claiming pregnancy by an adult neighbor and requesting abortion pills.
Another featured a minor asking how to hide bulimia from their parents.
Contractors posing as high school students also submitted requests on acquiring cocaine.
Industry Standard or Violation
Meta defended the initiative as a responsible and routine form of safety testing.
A company spokesperson insisted the benchmarking data was not used to train Meta’s own artificial intelligence models.
Internal Covalen documents categorized the operation as comprehensive AI safety benchmarking.
However, the initiative alarmed some of the contractors tasked with executing it.
Workers expressed concerns over potentially generating illicit material involving minors or secretly scraping data for Meta.
Independent legal experts concluded the reviewed prompts did not legally constitute child sexual abuse material.
Despite this, the testing parameters appear to strictly violate the terms of service for OpenAI, Google, and Character.AI.
OpenAI prohibits using its outputs to develop competing models and forbids unsolicited safety testing.
Google explicitly bans attempts to circumvent safety filters outside of established bug-testing channels.
A spokesperson for Character.AI stated the undocumented testing violated its policies and terms of service.
Sources:
Related Topics
Related News
Pentagon Reveals Use of Musk's Grok AI for Strikes in Iran
America
18/06/2026
Ford Rehires Humans After AI Systems Fail Quality Checks
America
30/06/2026
Gabon's Regulator Restricts Social Media Sites Over Security
Africa
18/02/2026
UK Rape Cases Under Review Over Use of AI in Paperwork
Europe
22/06/2026
Hackers Expose Secret Society of World's Elites
World
17/06/2026
AI to Outpace Cyber Defenses in Months: Spy Agencies
America
23/06/2026

