TaxCalcBench: Evaluating Frontier Models on the Tax Calculation Task
View PDF
HTML (experimental)
Abstract:Can AI file your taxes? Not yet. Calculating US personal income taxes is a task that requires building an understanding of vast amounts of English text and using that knowledge to carefully compute results. We propose TaxCalcBench, a benchmark for determining models' abilities to calculate personal income tax returns given all of the necessary information. Our experiment shows that state-of-the-art models succeed in calculating less than a third of federal i...
Read more at arxiv.org