SPARE: Efficient Self-Distillation for Model Unlearning

SPARE: Self-distillation for PARameter-Efficient Removal

Summary: arXiv:2602.07058v2

Announce Type: replace-cross

Abstract

Machine Unlearning aims to remove the influence of specific data or concepts from trained models while preserving overall performance—a capability increasingly required by data protection regulations and responsible AI practices. Despite recent progress, unlearning in text-to-image diffusion models remains challenging due to high computational costs and the difficulty of balancing effective forgetting with retention of unrelated concepts.

Introducing SPARE

We introduce Self-distillation for PARameter Efficient Removal (SPARE), a two-stage unlearning method for image generation that combines parameter localization with self-distillation. This innovative approach addresses the growing need for efficient unlearning mechanisms while ensuring the integrity of non-targeted concepts within models.

How SPARE Works

SPARE operates through the following two key stages:

Parameter Localization: SPARE first identifies the parameters that are most responsible for generating the unwanted concepts. This is achieved using gradient-based saliency techniques. The updates to these parameters are then constrained through sparse low-rank adapters, ensuring that only lightweight, localized modifications are made.
Self-Distillation Objective: In the second stage, SPARE utilizes a self-distillation objective that effectively overwrites the unwanted concept with a user-defined surrogate concept, while maintaining the model’s behavior concerning other unrelated concepts.

Innovative Timestep Sampling

In addition to the two-stage process, SPARE introduces a timestep sampling scheme specifically designed for diffusion models. This scheme targets only the crucial timesteps associated with a given concept, leading to more efficient unlearning processes. By focusing on essential timesteps, SPARE minimizes unnecessary computational overhead, enhancing performance.

Performance and Results

SPARE has demonstrated its efficacy by surpassing the current state-of-the-art on the UnlearnCanvas benchmark. Through extensive ablation studies conducted on various datasets, we have shown that SPARE provides fine-grained control over the forgetting-retention trade-off. These results are particularly promising as they indicate that SPARE achieves strong concept erasure while maintaining high retainability across diverse domains.

Conclusion

In conclusion, SPARE offers a robust solution for selective unlearning in diffusion-based image generation models. By effectively balancing the need to forget unwanted concepts while retaining the integrity of other data, SPARE addresses a significant challenge in the field of responsible AI. As data protection regulations continue to evolve, methods like SPARE will play a crucial role in ensuring compliance and ethical AI practices.

RichlyAI Blog AI Guide, Tutorials, Industrial Insights, & more!

Company

SPARE: Efficient Self-Distillation for Model Unlearning

SPARE: Self-distillation for PARameter-Efficient Removal

Abstract

Introducing SPARE

How SPARE Works

Innovative Timestep Sampling

Performance and Results

Conclusion

Related AI Insights

Subscribe

More like thisRelated

About us

Company

The latest

Subscribe

RichlyAI Blog
AI Guide, Tutorials, Industrial Insights, & more!

More like this
Related