Discover Text2DistBench, a new benchmark evaluating large language models' ability to understand distributional reading comprehension beyond factual recall...
Discover a novel weak supervision method to distill hallucination signals into transformer models for efficient, internal hallucination detection without e...