Enhancing Math Learning with LLMs: Anxiety, Confidence & Performance

Math Education Digital Shadows: Bridging Gaps in Learning with LLMs

A groundbreaking study titled “Math Education Digital Shadows for facilitating learning with LLMs: Math performance, anxiety and confidence in simulated students and AIs,” recently published on arXiv, aims to elevate the role of Large Language Models (LLMs) in math education. The research introduces a novel dataset known as MEDS (Math Education Digital Shadows), designed to provide insights into how various LLMs perform in mathematics and their inherent biases across different prompts.

Understanding MEDS

The MEDS dataset encompasses an extensive collection of 28,000 personas derived from 14 distinct LLMs, including well-known families such as Mistral, Qwen, DeepSeek, Granite, Phi, and Grok. Each persona represents either a human student or an AI assistant, allowing for a comprehensive analysis of mathematical reasoning in both contexts.

Components of the Dataset

MEDS is unique in its multifaceted approach to assessing mathematical understanding. It features four primary types of tasks:

Open Math Interview: Allows for an unrestricted exploration of mathematical thinking.
Psychometric Tests: Three tests assessing math perceptions, accompanied by detailed explanations.
Cognitive Networks: These capture attitudes towards math, providing insight into emotional and psychological factors.
High-School Math Test Questions: Eighteen questions designed to evaluate proficiency, along with reasoning and confidence scores.

Innovative Approach

Unlike traditional benchmarks that focus solely on score outcomes, MEDS integrates several critical factors, including:

Self-Efficacy: Understanding how confident a student feels in their math abilities.
Math Anxiety: Examining the emotional responses associated with math tasks.
Cognitive Network Science: Exploring the relationships between various cognitive elements in math learning.

Key Findings

The validation process for the MEDS dataset demonstrated that the sampled LLMs maintain schema integrity, presenting consistent personas that reflect both human and AI characteristics. Notably, the study found family-specific peculiarities, such as:

Human-like negative math attitudes, indicating a tendency towards math anxiety.
Logical fallacies, showcasing common errors in reasoning.
Instances of math overconfidence, where models displayed unwarranted assurance in their answers.

Implications for Future Research

The introduction of MEDS holds significant promise for various fields. Learning analytics experts can leverage the dataset to improve educational strategies, while cognitive scientists can further investigate the psychological aspects of math learning. Additionally, developers of AI tutors can utilize the insights gained from MEDS to create safer, more effective tools for teaching mathematics, ultimately enhancing the educational experience for students.

This research not only fills a crucial gap in understanding LLMs in math education but also paves the way for future studies to explore the intricate dynamics of math learning, anxiety, and confidence, ensuring that both human and AI tutors can better support students in their educational journeys.

RichlyAI Blog AI Guide, Tutorials, Industrial Insights, & more!

Company

Enhancing Math Learning with LLMs: Anxiety, Confidence & Performance

Math Education Digital Shadows: Bridging Gaps in Learning with LLMs

Understanding MEDS

Components of the Dataset

Innovative Approach

Key Findings

Implications for Future Research

Related AI Insights

Subscribe

More like thisRelated

About us

Company

The latest

Subscribe

RichlyAI Blog
AI Guide, Tutorials, Industrial Insights, & more!

More like this
Related