Liu, C. (CSE) – Enabling LLM Unlearning at Inference Time by Decomposing Detection and Intervention
Machine unlearning addresses the “right to be forgotten” under GDPR and enables privacy, copyright, and safety compliance in large language models. Training-based unlearning can remove targeted behavior on benchmarks, but it scales poorly, can degrade utility, and can fail under adversarial prompting that recovers supposedly forgotten content. This prospectus proposes inference-time behavioral unlearning: rather than […]