Discover LLMSYS-HPOBench, a benchmark suite designed to optimize hyperparameters in real-world large language models with extensive datasets and metrics.
Discover HTPO, a novel RL algorithm enhancing exploration-exploitation balance in LLMs via hierarchical token-level control for superior reasoning performa...
Discover how PolyLM uses large language models to predict polymer properties by analyzing synthesis and processing descriptions in scientific literature.