publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2025

  1. Tracing and Reversing Rank-One Model Edits
    Paul Youssef, Zhixue Zhao , Christin Seifert , and 1 more author
    2025
  2. Position: Editing Large Language Models Poses Serious Safety Risks
    Paul Youssef, Zhixue Zhao , Daniel Braun , and 2 more authors
    In Forty-second International Conference on Machine Learning Position Paper Track , 2025
  3. How to Make LLMs Forget: On Reversing In-Context Knowledge Edits
    Paul Youssef, Zhixue Zhao , Jörg Schlötterer , and 1 more author
    In Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers) , Apr 2025
  4. Has this Fact been Edited? Detecting Knowledge Edits in Language Models
    Paul Youssef, Zhixue Zhao , Christin Seifert , and 1 more author
    In Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers) , Apr 2025

2024

  1. The Queen of England is not England’s Queen: On the Lack of Factual Coherency in PLMs
    Paul Youssef, Jörg Schlötterer , and Christin Seifert
    In Findings of the Association for Computational Linguistics: EACL 2024 , Mar 2024

2023

  1. Give Me the Facts! A Survey on Factual Knowledge Probing in Pre-trained Language Models
    Paul Youssef, Osman Koraş , Meijie Li , and 2 more authors
    In Findings of the Association for Computational Linguistics: EMNLP 2023 , Dec 2023
  2. Privacy-Preserving Knowledge Transfer through Partial Parameter Sharing
    Paul Youssef, Jörg Schlötterer , and Christin Seifert
    In Proceedings of the 5th Clinical Natural Language Processing Workshop , Jul 2023