ZAYA1-8B Matches DeepSeek-R1 on Math with Less Than 1B Active Parameters. - Firethering
- Advertisement -
Zyphra just dropped a model that’s doing something most people will scroll past without understanding why it’s interesting.
ZAYA1-8B matches DeepSeek-R1 on math benchmarks. Stays competitive with Claude Sonnet 4.5 on reasoning. Closes in on Gemini 2.5 Pro on coding. These are frontier model comparisons, the kind of numbers that usually come with billions of parameters and serious hardware requirements.
This one runs on less than 1 billion active parameters. And it was trained e...
Read more at firethering.com