The "Petri" tool deploys AI agents to evaluate frontier models. AI's ability to discern harm is still highly imperfect. Early tests showed Claude Sonnet 4.5 and GPT-5 to be safest. Anthropic has ...
Trustworthy AI isn’t just about predicting the right outcome; it’s about knowing how confident we should actually be.
Amazon Web Services is rolling out a slate of new homegrown AI models and a service for enterprise customers to build their own custom versions. The cloud provider launched Nova 2, a fleet of four new ...