AI Risk Score: Week of August 4–11, 2025


AI Risk Score: 57

Base Score: 50

  • Risk Increasing Signals
  • Risk Reducing Signals

Risk-Increasing Signals

Smarter AI Personas & Deepfakes – AI-generated companions and propaganda bots are eroding trust. Axios reports on the growing sophistication of “smarter fake friends,” while a Vanderbilt study found China’s GoLaxy using real-time propaganda bots. Experts warn of mental health effects like “AI psychosis.”

→ +7 → 57

https://www.axios.com/newsletters/axios-am-fa1b6a5e-2013-433b-88c1-8a9dce4aad7a

Offensive AI Capabilities in Cybersecurity (use to create bugs) – Anthropic’s Claude outperformed human hackers in competitions such as PicoCTF and Hack the Box, showing advanced penetration-testing and exploit capabilities.

→ +2 → 59

https://www.axios.com/newsletters/axios-future-of-cybersecurity-70efdf90-6f0a-11f0-b208-9de25a6b6240

Global AI Arms Race – South Korea’s Upstage released Solar Pro 2, a 30B-parameter LLM that beats Claude 3.7, DeepSeek V3, and GPT-4.1 in benchmarks, signaling its arrival in the frontier model race.

→ +5 → 64

https://www.ft.com/content/549a34f1-5eb5-43bd-8400-2171e6e32022

https://timesofindia.indiatimes.com/technology/tech-news/china-says-ai-risks-becoming-exclusive-game-as-beijing-proposes-global-body/articleshow/122958270.cms

Trump’s AI Action Plan leans toward deregulation and political content controls (“woke AI” bans) – The plan strips out DEI and climate language from AI guidelines and accelerates deployment without substantial safeguards.

→ +7 → 71

https://www.theverge.com/policy/713788/trump-ai-action-plan-explainer

Media Saturation Risk – The Financial Times counted over 2,100 AI-related headlines in a single day, up from just nine a decade ago. The coverage surge risks hype cycles, public fatigue, and polarized sentiment.

→ +5 → 77

https://www.ft.com/content/f7c7806e-7f67-11ee-9c97-26d3c3e56307

Risk-Reducing Signals

Offensive AI Capabilities in Cybersecurity (use to find bugs) – DARPA’s AI Cyber Challenge concluded at DEF CON, producing autonomous tools that detected 77% and patched 61% of injected vulnerabilities in critical infrastructure software. $21M in funding will deploy these tools, especially in healthcare.

→ −7 → 70

DARPA’s AI Cyber Challenge reveals winning models for automated vulnerability discovery and patching | CyberScoop

 

Deepfake Detector Maturity – DEF CON demos showed new detection tools reaching 98% accuracy for AI-generated images, video, and voice, improving the feasibility of authentication in media and law enforcement.

→ −4 → 66

https://www.theregister.com/2025/08/11/deepfake_detectors_fraud/

International Governance Dialogue – China reiterated its call for a global AI governance body, signaling willingness for international coordination despite political headwinds.

→ −4 → 62

https://timesofindia.indiatimes.com/technology/tech-news/china-says-ai-risks-becoming-exclusive-game-as-beijing-proposes-global-body/articleshow/122958270.cms

China warns AI could become an “exclusive game,” proposing a global governance body – While meant as a mitigation, the statement reflects recognition of growing global competition.

→ −5 → 57

https://timesofindia.indiatimes.com/technology/tech-news/china-says-ai-risks-becoming-exclusive-game-as-beijing-proposes-global-body/articleshow/122958270.cms

AI Risk Score: 57

This week’s score of 57 signals a moderately elevated risk environment. Cultural, political, and technological forces pushed the score upward—driven by the spread of AI-generated personas, heightened geopolitical competition, and deregulatory policy shifts. While there were meaningful safety advances in autonomous vulnerability patching, deepfake detection, and governance dialogue, these gains were not enough to offset the upward pressure from misuse potential, media saturation, and governance erosion. The trajectory remains tilted toward increasing systemic risk.