Discover Slash, a training-free method to boost structural attention in LLMs, improving graph reasoning without costly fine-tuning or complex adapters.
Discover EXPO, a novel reinforcement learning method improving AI exploration via adaptive KL regulation and Gaussian curriculum sampling for better math r...