From Scalars to Tensors: Enhancing Epistemic AI Evaluation

Date:

From Scalars to Tensors: Declared Losses Recover Epistemic Distinctions That Neutrosophic Scalars Cannot Express

Summary: arXiv:2604.09602v1 Announce Type: new

Abstract: Leyva-V\’azquez and Smarandache (2025) demonstrated that neutrosophic T/I/F evaluation, where Truth, Indeterminacy, and Falsity are independent dimensions not constrained to sum to 1.0, reveals “hyper-truth” (T+I+F > 1.0) in 35% of complex epistemic cases evaluated by LLMs. We extend their work in two directions.

Key Findings

  • We replicate and extend their experiment across five model families from five vendors: Anthropic, Meta, DeepSeek, Alibaba, and Mistral.
  • Our findings reveal hyper-truth in 84% of unconstrained evaluations, confirming that this phenomenon is cross-vendor under our prompt protocol.

Identifying Limitations

More significantly, we identify a limitation of scalar T/I/F that their framework cannot address. Models adopting an “Absorption” position (T=0, I=1, F=0) produce identical scalar outputs for fundamentally different epistemic situations, such as paradox, ignorance, and contingency. This collapse undermines the very distinctions that neutrosophic logic was designed to preserve.

Extending the Evaluation

Our research demonstrates that extending the evaluation to include declared losses—structured descriptions of what the model cannot evaluate and why—substantially recovers these distinctions. For instance, models that yield identical scalars for paradox and ignorance produce nearly disjoint loss vocabularies, with a Jaccard similarity of less than 0.10 on loss description keywords.

Domain-Specific Insights

Additionally, we observe that these models can provide domain-specific, severity-rated loss declarations. Such declarations significantly differentiate the nature of their uncertainties, suggesting that scalar T/I/F, while necessary, is an insufficient representation of the epistemic state.

The Case for Tensor-Structured Output

We conclude that tensor-structured output—which combines scalars and losses—provides a more faithful model of LLM epistemic capabilities. This approach not only addresses the limitations of the scalar framework but also enhances our understanding of how language models perceive and articulate uncertainty.

Future Directions

As AI systems continue to evolve, understanding the nuances of their epistemic evaluations will become increasingly crucial. Our findings pave the way for future research aimed at refining LLM evaluation frameworks, ensuring that the complexities of human-like reasoning are better represented and understood.

In summary, the integration of declared losses into the evaluation of LLMs not only enhances the fidelity of their epistemic representations but also opens up new avenues for research and application in artificial intelligence.


Related AI Insights

Lazarus Omolua
Lazarus Omoluahttps://richlyai.com/blog
My mission is to make sure that people in Africa are not left behind in the global AI revolution. RichlyAI exists to give everyone — students, founders, creators, and businesses — the tools to compete globally.

Subscribe

Popular

More like this
Related

How Business Ops Teams Boost Productivity with Codex

Discover how business operations teams use Codex to streamline documentation, enhance collaboration, and improve decision-making with AI-powered automation...

OpenAI Partners with Malta to Offer ChatGPT Plus Nationwide

OpenAI and Malta team up to provide free ChatGPT Plus access and AI training to all citizens, promoting digital literacy and responsible AI use.

Critical Linux Kernel Flaw Risks SSH Host Key Theft

A critical Linux kernel flaw risks stolen SSH host keys. Learn how to protect your systems and stay secure until patches are widely available.

Top External Hard Drives 2026: Expert Reviews & Buying Guide

Discover the best external hard drives of 2026 with expert reviews. Find top picks for speed, durability, and security to suit all storage needs.