News Score: Score the News, Sort the News, Rewrite the Headlines

Neural audio codecs: how to get audio into LLMs

Václav VolhejnThank you for the valuable feedback on the drafts: Chung-Ming Chien, Moritz Boehle, Richard Hladík, Eugene Kharitonov, Patrick Perez, and Tom Sláma. I’d also like to thank the rest of the Kyutai team for the the research discussions without which this article could not exist. Click to playThe plan: sandwich a language model in an audio encoder/decoder pair (=neural audio codec), allowing it to predict audio continuations. As of October 2025, speech LLMs suck. Many LLMs have voice i...

Read more at kyutai.org

© News Score  score the news, sort the news, rewrite the headlines