Making Adversarially-Trained Language Models Forget with Model Retraining: A Case Study on Hate Speech Detection

Published in In the proceedings of Companion of The Web Conference 2022, Virtual Event / Lyon, France, April 25 - 29, 2022, 2022

Recommended citation: Marwan Omar, David Mohaisen, "Making Adversarially-Trained Language Models Forget with Model Retraining: A Case Study on Hate Speech Detection." In the proceedings of Companion of The Web Conference 2022, Virtual Event / Lyon, France, 2022. https://doi.org/10.1145/3487553.3524667

Share on

Twitter Facebook LinkedIn