Discover Orthrus, a dual-view diffusion framework that boosts token generation speed by 7.8x with minimal memory overhead and high-quality text output.
Discover how entropy-guided self-distillation improves large language model reasoning by respecting self-uncertainty for efficient training and accuracy.