SPICE improves large language model training by selecting conflict-aware data subsets, boosting performance while reducing training costs significantly.
Discover a novel weak supervision method to distill hallucination signals into transformer models for efficient, internal hallucination detection without e...
Discover Optimsyn, a framework that enhances synthetic data generation using influence-guided rubric optimization and reinforcement learning for better mod...