GitHub - monostate/weave-logprobs-reasoning-loop: A notebook that compares a reasoning model x a non reasoning model that runs a loop using logprobs found uncertainty
Logprobs Reasoning Loop with Weights & Biases Weave, an observability tool
Uncertainty-Aware Generation with OpenAI's Responses API
This project demonstrates a novel approach to improving AI model reasoning by leveraging token-level uncertainty metrics (logprobs) to create self-correcting generation loops. We compare this uncertainty-aware approach against traditional reasoning models to test whether explicit uncertainty handling can match or exceed the performance of dedicated reasoning archite...
Read more at github.com