Learn how contrastive decoding improves Large Language Models' scoring accuracy by reducing score range bias, boosting reliability in LLM evaluations by 11...
Discover DOVE, a novel framework for distributional open-ended evaluation of LLMs' cultural value alignment using a value codebook and optimal transport.