Something is wrong with Sonnet 4.5
We're seeing an elevated number of failed tests in our coding benchmark for Sonnet 4.5. Sonnet 4 looks normal.
We're seeing an elevated number of failed tests in our coding benchmark for Sonnet 4.5. Sonnet 4 looks normal.

Our automated testing shows a significant spike in failures for Sonnet 4.5, while Sonnet 4 continues to perform within expected parameters.
To stay up to date with the latest monitoring changes and updates, please join our subreddit: https://www.reddit.com/r/isitnerfed