AI giant Anthropic signs safety pact with Australia - The Canberra Times

GNews AI AustraliaApril 1, 20261 min read0 views

<a href="https://news.google.com/rss/articles/CBMioAFBVV95cUxPc0RicE52aW8xdVAtTS0zenVhX2pJSHphVkM3MGkyanNtZVpOdF8yZDVzYWNwT29qYlN3QTJVSlJjMmNjeGFsWm12Uk5FS0FtTjVrSjhSeXVSRkx4SmEwQXlBQS1fVmEtYzFvUVlOalpJcUdDU01pSV9CRW40UUNfSl8xczZ5dVNrdW0ySkIwQUpKck9ZVHduanpuWDZnelM3?oc=5" target="_blank">AI giant Anthropic signs safety pact with Australia</a> The Canberra Times

Could not retrieve the full article text.

Read on GNews AI Australia →

Original source

GNews AI Australia

https://news.google.com/rss/articles/CBMioAFBVV95cUxPc0RicE52aW8xdVAtTS0zenVhX2pJSHphVkM3MGkyanNtZVpOdF8yZDVzYWNwT29qYlN3QTJVSlJjMmNjeGFsWm12Uk5FS0FtTjVrSjhSeXVSRkx4SmEwQXlBQS1fVmEtYzFvUVlOalpJcUdDU01pSV9CRW40UUNfSl8xczZ5dVNrdW0ySkIwQUpKck9ZVHduanpuWDZnelM3?oc=5

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

safety

ModelsLive

Predicting When RL Training Breaks Chain-of-Thought Monitorability

Crossposted from the DeepMind Safety Research Medium Blog . Read our full paper about this topic by Max Kaufmann, David Lindner, Roland S. Zimmermann, and Rohin Shah. Overseeing AI agents by reading their intermediate reasoning “scratchpad” is a promising tool for AI safety. This approach, known as Chain-of-Thought (CoT) monitoring, allows us to check what a model is thinking before it acts, often helping us catch concerning behaviors like reward hacking and scheming . However, CoT monitoring can fail if a model’s chain-of-thought is not a good representation of the reasoning process we want to monitor. For example, training LLMs with reinforcement learning (RL) to avoid outputting problematic reasoning can result in a model learning to hide such reasoning without actually removing problem

LessWrong AI

8mabout 2 hours ago

ModelsLive

Predicting When RL Training Breaks Chain-of-Thought Monitorability

AI Alignment Forum

8mabout 2 hours ago

Laws & RegulationLive

Smart food safety: implementing AI for risk, compliance and control - New Food magazine

<a href="https://news.google.com/rss/articles/CBMivwFBVV95cUxNLTl5bzY1YTcyNThZLVowbG9Bb3haS1lOM3lpbmRxZktqR25kcEg3WjJDMDd1UFRDdnpNUHJPekt1amxuRXp3ZHBleFVPbS1HSEhVQl9YODJtOENYSVltbW52YW8xSFBlRHdENFRXekhqMTd0RFNsSFBITWhKcDdCcVp1TWxMSFVjYWhjU25VNlJVT09sWWVwbzRPZGZWT3pQNjU1RTcwMWlJd1RYQUZPWGNXeWRadGtISThCbVBnUQ?oc=5" target="_blank">Smart food safety: implementing AI for risk, compliance and control</a> New Food magazine

Google News: AI Safety

1m33 minutes ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 179 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

More in Countries

CountriesLive

LIMBO: Who We Are, What We Do, and an Exciting High-Impact Funding Opportunity

We are excited to publicly introduce the Laboratory for Importance-sampled Measure and Bayesian Observation (LIMBO), a small research group working at the intersection of cosmological theory, probability, and existential risk. We believe that the mechanisms by which observers continue to exist in the universe are important, neglected, and tractable to study and influence. Since our founding in October 2024, we have developed a mathematical framework for doing anthropic reasoning about rare-event estimation, and we have obtained significant empirical evidence which validates this framework. This empirical evidence was not cherry-picked: at LIMBO, we believe in putting our money where our mouth is, and we have a strong track record of success in financial and prediction markets downstream of

LessWrong AI

18mabout 1 hour ago

CountriesLive

Kuwait International Airport Remains Closed on April 1, 2026 After Fresh Iranian Drone Strike

Kuwait International Airport stayed shuttered to commercial passenger flights Wednesday as authorities battled a major fire at fuel storage tanks following the latest in a series of Iranian-linked drone attacks that have crippled the Gulf nation's key aviation hub.

International Business Times

6m29 minutes ago

CountriesLive

Strait of Hormuz Remains Choked in Iran Conflict as Limited Transits Resume Under Iranian Control

The strategically vital Strait of Hormuz stayed largely closed to normal commercial traffic Tuesday as the month-long Iran conflict continued, with only a handful of vessels transiting daily under tight Iranian oversight while the U.S. and allies debated military options to restore freedom of navigation and global oil flows.

International Business Times

6m24 minutes ago

CountriesLive

Advocacy groups urge YouTube to protect kids from 'AI slop' videos - Union-Bulletin

<a href="https://news.google.com/rss/articles/CBMi8AFBVV95cUxNQl9fYWcyaGJRSjhXRTduT3Yzc2JlUkRMWmZlbzViWTZaZ2lNNVI4QVBSNUdsa1lCWmFwZHdFTzA1U21ac0tQbDdyOWlzRFVPMFBubkxQVnNfdjJ1TEIwQUM3b3BRV0dOTzctU0Q3Tkk1R21SMXF1UXpxVTVDY0tyMGM3NEtva0otNF9vMmRXMXNKUk5WRFNRbXBkcndjd21jSUtmUE5Dd1RPRHQxR1J6aHFoYjQtVWZiY2pyTVJacW5scHEzY2hDd1ltUHEwZkRzNWtPUkxHSTE1UmUwcHA2OVFMeXBMbzVXQlA5TS10XzY?oc=5" target="_blank">Advocacy groups urge YouTube to protect kids from 'AI slop' videos</a> Union-Bulletin

Google News: Generative AI

1mabout 2 hours ago