DepCap enhances diffusion language model inference with adaptive block-wise decoding, boosting speed up to 5.63x while maintaining high-quality output.
Discover Profile-Then-Reason, a framework enhancing efficiency and reliability in tool-augmented language agents by minimizing model calls and reducing lat...