𝕏X (Twitter)

Alex Prompter (@alex_prompter)

🚨 Google just proved that longer chain-of-thought makes AI models LESS accurate, not more. raw token count has a negative correlation (r = -0.59) with getting the right answer. the more a model talks, the more likely it's wrong. but they found something better. a way to measure real reasoning effort by looking inside the model's layers. here's what they discovered:

Cargando tweet...