OpenAI Reports Rise in Malicious AI Use on Websites and Social Media

OpenAI has released a new threat intelligence report on February 26, 2026, highlighting the increasing use of AI models by malicious actors on websites and social platforms. The report, titled "Disrupting Malicious Uses of AI," provides insights into the tactics used by threat groups and suggests advanced detection strategies and defensive measures for platforms and regulators.

Key Findings from OpenAI's Report

The report identifies a significant increase in AI-driven malicious campaigns. Adversaries are using publicly available AI models, often fine-tuned or prompted via APIs, to create scalable phishing sites, deepfake hubs, and automated disinformation networks. These are embedded in legitimate-looking web domains.

Attackers are combining large language models (LLMs) with social media APIs to create hyper-personalized scams that mimic trusted brands, bypassing traditional content filters.
Over 200 instances were documented in the past six months where AI was used to generate malicious websites hosting credential-harvesting forms or malware droppers.
The rise of AI-orchestrated botnets on social platforms is noted, with these botnets using synthetic media to influence elections or incite unrest.

A core theme is the detection challenge: Traditional signature-based tools are ineffective against AI-generated content, which mutates dynamically. OpenAI proposes hybrid defenses, including watermarking for its models and behavioral analytics to flag anomalous API usage patterns.

OpenAI's Track Record in AI Safety Enforcement

This report builds on OpenAI's ongoing efforts to curb malicious AI deployments. Since its 2023 Preparedness Framework, OpenAI has faced challenges, such as a temporary GDPR ban in Italy in 2024. However, it has made significant progress, reporting an 85% increase in blocking abuse attempts by mid-2025, thanks to partnerships with Microsoft Azure.

OpenAI: Blocked 4.8 million malicious prompts in 2025 using watermarking and API behavioral flags.
Anthropic: Focuses on "constitutional AI" but lacks OpenAI's scale, reporting 2.1 million disruptions.
Google DeepMind: Uses model cards and federated learning but has faced criticism for slow API abuse responses.
xAI: Prioritizes fewer restrictions, resulting in higher misuse rates in unregulated regions.

Strategic Context Amid Global AI Scrutiny

The release of this report aligns with increasing regulatory pressures and geopolitical tensions. A global AI summit in Paris recently called for "secure, trustworthy, and robust AI," addressing threats like job disruption and energy demands.

U.S. intelligence has warned of AI-fueled election interference peaking in the 2026 midterms, prompting OpenAI's proactive disclosure.
Industry leaders are signaling compliance to preempt broader regulatory crackdowns, as seen with DeepSeek facing bans and fines for lax data practices.

Implications for Detection and Defense

The report advocates for cross-industry sharing of threat intelligence. Websites should integrate AI-specific scanners, and social media giants like Meta and X could adopt OpenAI's detection APIs.

Regulators are encouraged to mandate AI provenance labeling, as in the EU AI Act's high-risk tiers.

This release underscores the dual-use dilemma of AI: while innovation accelerates threats, vigilant stewardship is essential. OpenAI positions itself as a leader, but sustained impact requires collaboration beyond proprietary silos.