News Score: Score the News, Sort the News, Rewrite the Headlines

Is there a Half-Life for the Success Rates of AI Agents? — Toby Ord

Writing Building on the recent empirical work of Kwa et al. (2025), I show that within their suite of research-engineering tasks the performance of AI agents on longer-duration tasks can be explained by an extremely simple mathematical model — a constant rate of failing during each minute a human would take to do the task. This implies an exponentially declining success rate with the length of the task and that each agent could be characterised by its own half-life. This empirical regularity all...

Read more at tobyord.com

© News Score  score the news, sort the news, rewrite the headlines