Source-Sensitive Reasoning in Turkish: Humans vs LLMs

Benchmarking Source-Sensitive Reasoning in Turkish: Humans and LLMs under Evidential Trust Manipulation

This article presents groundbreaking research published in arXiv:2604.24665v1, examining the influence of source trustworthiness on Turkish evidential morphology and the capability of large language models (LLMs) to track this sensitivity. The study is significant in the fields of linguistics and artificial intelligence, particularly in understanding how trust affects language processing.

Research Overview

The primary objective of this research is to investigate how the trustworthiness of information sources impacts the use of Turkish evidential morphology, specifically the past-domain contrast between the suffixes -DI and -mIs. The study employs controlled cloze contexts where the information source is clearly external, and its perceived reliability is manipulated between High-Trust and Low-Trust scenarios.

Methodology

The investigation starts with a human production experiment involving native Turkish speakers, where participants are asked to produce sentences based on varying trust contexts. The findings reveal a significant trust effect: contexts deemed as High-Trust lead to a higher frequency of the suffix -DI, while Low-Trust contexts favor the suffix -mIs. This pattern remains consistent even across various sensitivity analyses.

LLM Evaluation

Following the human experiment, the research evaluates ten different LLMs through three distinct prompting paradigms:

Open gap-fill
Explicit past-tense gap-fill
Forced-choice A/B selection

The analysis of LLM behavior reveals a complex landscape characterized by a model- and prompt-dependency. Some models demonstrate minor or localized shifts in alignment with the trust manipulation, but the overall effects are unstable. In many cases, these effects are reversed or overshadowed by issues related to output compliance and a strong preference for base-rate suffixes.

Key Findings

The research outcomes provide compelling evidence for a trust- and commitment-based understanding of Turkish evidentiality. Moreover, a significant gap is identified between human and LLM performance in source-sensitive evidential reasoning. The findings underscore the limitations of current LLMs in replicating nuanced human cognitive processes, particularly in contexts where trust and source reliability are critical.

Implications of the Study

This study has several implications for both linguistic theory and artificial intelligence. The results advocate for a deeper exploration of how trust influences language use and could inform future LLM design. Understanding the cognitive mechanisms behind evidential reasoning in humans can guide the development of more sophisticated models that better replicate human-like reasoning.

In conclusion, the research presents a significant step forward in bridging the gap between human linguistic capabilities and artificial intelligence. As LLMs continue to evolve, insights gained from studies like this one will be crucial in refining their understanding of context, trust, and the subtleties of human language.

RichlyAI Blog AI Guide, Tutorials, Industrial Insights, & more!

Company

Source-Sensitive Reasoning in Turkish: Humans vs LLMs

Benchmarking Source-Sensitive Reasoning in Turkish: Humans and LLMs under Evidential Trust Manipulation

Research Overview

Methodology

LLM Evaluation

Key Findings

Implications of the Study

Related AI Insights

Subscribe

More like thisRelated

About us

Company

The latest

Subscribe

RichlyAI Blog
AI Guide, Tutorials, Industrial Insights, & more!

More like this
Related