AI safety & Anthropic
News on AI safety and security around Claude: alignment, red teaming, jailbreaks, vulnerabilities, disclosures and Anthropic's safety policies.
Security·Hacker News
Anthropic's Safety Superpower
2d ago▲ 212
Security·Hacker News
Anthropic suspends new AI tools over US Government security concerns
3d ago▲ 7
Security·Sud Ouest
États-Unis : Anthropic obligé de suspendre son IA la plus puissante, Washington invoque la sécurité nationale
4d ago
Security·Boursorama
Anthropic rend publique son IA la plus puissante, bridée pour des raisons de sécurité
Jun 9
Security·BFM
Anthropic rend publique son IA la plus puissante, mais dans une version bridée pour les domaines sensibles comme la cybersécurité et les risques biologiques
Jun 9
Security·Pèse sur start
Anthropic rend publique son IA la plus puissante... avec des limites pour la cybersécurité et les risques biologiques
Jun 9
Security·Hacker News
Anthropic: Measuring LLMs' impact on N-day exploits
Jun 8▲ 6
Security·Hacker News
Show HN: GitHub Copilot port of Anthropic's AI vulnerability discovery harness
Last week, Anthropic released https://github.com/anthropics/defending-code-reference-harne... , a reference harness for autonomous vulnerability discovery that uses Claude Code agents to find, verify, and patch memory-safety bugs. I wanted to use it but I only have access to Git
Jun 8▲ 2
Security·Hacker News
What people don't get about safety at Anthropic
Jun 5▲ 2
Security·Hacker News
ZEC drops 30% after Anthropic AI finds Zcash counterfeit vulnerability
Jun 5▲ 20
Security·Hacker News
Anthropic's open-source framework for AI-powered vulnerability discovery
Jun 4▲ 539
Security·Hacker News
Harvard Law: Anthropic is about to sell a safety mission Wall Street can veto
Jun 3▲ 3