Min-k Sampling: Robust Text Generation Without Temperature Bias

Min-$k$ Sampling: Decoupling Truncation from Temperature Scaling via Relative Logit Dynamics

Summary: arXiv:2604.11012v1 Announce Type: new

Abstract: The quality of text generated by large language models depends critically on the decoding sampling strategy. While mainstream methods such as Top-$k$, Top-$p$, and Min-$p$ achieve a balance between diversity and accuracy through probability-space truncation, they share an inherent limitation: extreme sensitivity to the temperature parameter.

Recent logit-space approaches like Top-$n\sigma$ achieve temperature invariance but rely on global statistics that are susceptible to long-tail noise, failing to capture fine-grained confidence structures among top candidates. To address these challenges, we propose Min-$k$ Sampling, a novel dynamic truncation strategy that analyzes the local shape of the sorted logit distribution to identify “semantic cliffs”: sharp transitions from high-confidence core tokens to uncertain long-tail tokens.

Key Features of Min-$k$ Sampling

Min-$k$ Sampling introduces several innovative features aimed at improving the text generation process:

Dynamic Truncation: By computing a position-weighted relative decay rate, Min-$k$ dynamically determines truncation boundaries at each generation step, allowing for more adaptive sampling.
Temperature Invariance: We formally prove that Min-$k$ achieves strict temperature invariance, meaning its performance is consistent across various temperature settings, unlike traditional methods.
Low Sensitivity to Hyperparameters: Empirical results demonstrate that Min-$k$ maintains robust performance even under extreme temperature settings where probability-based methods may collapse.

Impact on Text Quality

Extensive experiments conducted on multiple reasoning benchmarks and creative writing tasks reveal that Min-$k$ consistently improves text quality. This is particularly noteworthy in scenarios where other methods struggle due to their reliance on temperature parameters. Human evaluations further corroborate the effectiveness of Min-$k$, showcasing its ability to generate coherent and contextually appropriate text.

Conclusion and Availability

In conclusion, Min-$k$ Sampling presents a significant advancement in the realm of language model decoding strategies. By decoupling the truncation process from temperature scaling, it offers a more reliable and effective approach to text generation. We are committed to transparency and collaboration in the research community, and thus, we have made our code, models, and analysis tools publicly available for further exploration and enhancement by fellow researchers and practitioners.

RichlyAI Blog AI Guide, Tutorials, Industrial Insights, & more!

Company

Min-k Sampling: Robust Text Generation Without Temperature Bias

Min-$k$ Sampling: Decoupling Truncation from Temperature Scaling via Relative Logit Dynamics

Key Features of Min-$k$ Sampling

Impact on Text Quality

Conclusion and Availability

Related AI Insights

Subscribe

More like thisRelated

About us

Company

The latest

Subscribe

RichlyAI Blog
AI Guide, Tutorials, Industrial Insights, & more!

More like this
Related