ANN v3: 200ms p99 query latency over 100 billion vectors
January 21, 2026•Nathan VanBenschoten (Chief Architect)The pursuit of scale is not vanity. When you take existing systems and optimize
them from first principles to achieve a step change in scalability, you can
create something entirely new.
Nothing has demonstrated that more clearly than the explosion in deep learning
over the past decade. The ML community took decades-old ideas and combined them
with advancements in hardware, new algorithms, and hyper-specialization to forge
something remarkab...
Read more at turbopuffer.com