Something is wrong with Sonnet 4.5

We're seeing an elevated number of failed tests in our coding benchmark for Sonnet 4.5. Sonnet 4 looks normal.

We're seeing an elevated number of failed tests in our coding benchmark for Sonnet 4.5. Sonnet 4 looks normal.

Sonnet 4.5 Test Results

Our automated testing shows a significant spike in failures for Sonnet 4.5, while Sonnet 4 continues to perform within expected parameters.

To stay up to date with the latest monitoring changes and updates, please join our subreddit: https://www.reddit.com/r/isitnerfed