Discover Bayesian-LoRA, a probabilistic low-rank adaptation method that improves calibration and uncertainty in large language models with minimal paramete...
Discover how Calibration-Aware Policy Optimization (CAPO) improves reasoning accuracy and confidence in Large Language Models by addressing overconfidence...