Explore OmniBehavior, a benchmark using real-world data to evaluate LLMs on long-term, cross-scenario human behavior simulation and address model biases.
Explore the role of large language models in outpatient referrals, benchmarking performance, challenges, and future directions for healthcare integration.