AI safety & Anthropic

News on AI safety and security around Claude: alignment, red teaming, jailbreaks, vulnerabilities, disclosures and Anthropic's safety policies.

Security·Hacker News

Anthropic's Safety Superpower

2d ago212
Security·Hacker News

Anthropic suspends new AI tools over US Government security concerns

3d ago7
Security·Sud Ouest

États-Unis : Anthropic obligé de suspendre son IA la plus puissante, Washington invoque la sécurité nationale

4d ago
Security·Boursorama

Anthropic rend publique son IA la plus puissante, bridée pour des raisons de sécurité

Jun 9
Security·BFM

Anthropic rend publique son IA la plus puissante, mais dans une version bridée pour les domaines sensibles comme la cybersécurité et les risques biologiques

Jun 9
Security·Pèse sur start

Anthropic rend publique son IA la plus puissante... avec des limites pour la cybersécurité et les risques biologiques

Jun 9
Security·Hacker News

Anthropic: Measuring LLMs' impact on N-day exploits

Jun 86
Security·Hacker News

Show HN: GitHub Copilot port of Anthropic's AI vulnerability discovery harness

Last week, Anthropic released https://github.com/anthropics/defending-code-reference-harne... , a reference harness for autonomous vulnerability discovery that uses Claude Code agents to find, verify, and patch memory-safety bugs. I wanted to use it but I only have access to Git

Jun 82
Security·Hacker News

What people don't get about safety at Anthropic

Jun 52
Security·Hacker News

ZEC drops 30% after Anthropic AI finds Zcash counterfeit vulnerability

Jun 520
Security·Hacker News

Anthropic's open-source framework for AI-powered vulnerability discovery

Jun 4539
Security·Hacker News

Harvard Law: Anthropic is about to sell a safety mission Wall Street can veto

Jun 33

← All Claude news