𝕏X (Twitter)
Alex Prompter (@alex_prompter)
🚨 Google just proved that longer chain-of-thought makes AI models LESS accurate, not more. raw token count has a negative correlation (r = -0.59) with getting the right answer. the more a model talks, the more likely it's wrong. but they found something better. a way to measure real reasoning effort by looking inside the model's layers. here's what they discovered:
Cargando tweet...