Study finds classical algorithms CMA-ES and TPE beat LLMs at hyperparameter optimization; hybrid Centaur system combining both methods achieves best results using 0.8B model

Can LLMs Beat Classical Hyperparameter Optimization Algorithms? A Study on autoresearch

View PDF HTML (experimental) Abstract:The autoresearch repository enables an LLM agent to optimize hyperparameters by editing training code directly. We use it as a testbed to compare classical HPO algorithms against LLM-based methods on tuning the hyperparameters of a small language model under a fixed compute budget. When defining a fixed search space over autoresearch, classical methods such as CMA-ES and TPE consistently outperform LLM-based agents, where avoiding out-of-memory failures matt...