Discover a cognitive framework for detailed LLM evaluation across domains, enabling targeted training and accurate ability predictions beyond single scores...
Discover a novel tensor completion method for LLM evaluation using low-rank structures and semiparametric efficiency to improve accuracy and reliability.