Discover PoiCGAN, a novel targeted poisoning attack using feature-label perturbation in federated learning, achieving high success with minimal accuracy lo...
Discover proven methods like fine-tuning and input sanitization to prevent many-shot jailbreaking attacks on large language models, enhancing AI safety.