Tweets by nedjmaou
photo

Nedjma Djouhra Ousidhoum نجمة جوهرة أوسيدهم

Lecturer (Assistant Professor)

OusidhoumN (at) cardiff (dot) ac (dot) uk
Office 5.50 Abacws building, Cardiff.

I am a Lecturer (Assistant Professor) at Cardiff University and a Visiting Academic at the University of Cambridge. Prior to this I was a Postdoctoral Research Associate at the University of Cambridge advised by Andreas Vlachos. I did my PhD at the Hong Kong University of Science and Technology (HKUST) under the supervision of Yangqiu Song and Dit-Yan Yeung.

I am interested in Computational Social Science and NLP in general with a focus on automated fact-checking, toxic content detection and related tasks, and in low-resource languages. For more information, you can check my curriculum vitae and my Google Scholar.

Education

  • 2014-2021: PhD in Computer Science, the Hong Kong University of Science and Technology (HKUST), Hong Kong.
  • 2010-2012: Master in Software Engineering, University of Science and Technology Houari Boumedienne (USTHB), Algiers, Algeria.
  • 2007-2010: Bachelor in Computer Science, University of science and technology Houari Boumedienne (USTHB), Algiers, Algeria.

Academic Services

  • Area Chair (Less-Resourced/Endangered/Less-studied Languages track) at LREC-COLING 2024.
  • Area Chair (Ethics in NLP) at EACL 2024.
  • Senior Area Chair (NLP Applications track) at EMNLP 2023 (Outstanding Senior Area Chair Award).
  • Diversity and Inclusion Chair at ACL 2023. You can check the statistics related to the Visa situation and some other thoughts here.
  • Diversity, Equity and Inclusion committee member (financial accessibility chair) at NAACL 2022.
  • Reviewer for ACL Rolling Review (since November 2021 until now).
  • Reviewer for EMNLP 2021, ACL 2021, NAACL 2021, NEATCLasS Workshop at ICWSM 2022, ACL 2022, NAACL 2022, EMNLP 2022, ACL 2023, C3NLP Workshop at ACL 2023, NEATCLasS Workshop at ICWSM 2023, SemEval 2024.
  • Secondary reviewer for COLING 2020, AAAI 2019.
  • Co-organiser of the North African in NLP affinity group social at EMNLP 2020, COLING 2020, EACL 2021, NAACL 2021, ACL 2021, NAACL 2022.

Current Teaching

I am currently (Spring 2024) teaching CMT3202 (Emerging Technologies) at Cardiff University. If you are taking the class, send me an email if you have any questions.
(Please add the class code to the subject of your email.)

Past Teaching Experience

  • Fall 2022: Supervisor of Object-Oriented Programming, University of Cambridge.
  • Summer 2022: Supervisor of 3 Summer projects, Pembroke College, University of Cambridge.
  • Spring 2018: COMP4911 IT Entrepreneurship, the Hong Kong University of Science and Technology.
  • Spring 2017: COMP4221 Natural Language Processing, the Hong Kong University of Science and Technology.
  • Spring 2016: COMP4431 Multimedia Computing, the Hong Kong University of Science and Technology.
  • Fall 2015: COMP1029Q C Programming Bridging Course, the Hong Kong University of Science and Technology.

Publications

Nedjma Ousidhoum Shamsuddeen Hassan Muhammad, Mohamed Abdalla, Idris Abdulmumin, Ibrahim Said Ahmad, Sanchit Ahuja, Alham Fikri Aji, Vladimir Araujo, Meriem Beloucif, Christine De Kock, Oumaima Hourrane, Manish Shrivastava, Thamar Solorio, Nirmal Surange, Krishnapriya Vishnubhotla, Seid Muhie Yimam, Saif M. Mohammad: SemEval-2024 Task 1: Semantic Textual Relatedness for African and Asian Languages, in Proceeding of SemEval: The 18th International Workshop on Semantic Evaluation co-located with NAACL 2024 (to appear). [Paper] [Data and Code]

Nedjma Ousidhoum, Shamsuddeen Hassan Muhammad, Mohamed Abdalla, Idris Abdulmumin, Ibrahim Said Ahmad, Sanchit Ahuja, Alham Fikri Aji, Vladimir Araujo, Abinew Ali Ayele, Pavan Baswani, Meriem Beloucif, Chris Biemann, Sofia Bourhim, Christine De Kock, Genet Shanko Dekebo, Oumaima Hourrane, Gopichand Kanumolu, Lokesh Madasu, Samuel Rutunda, Manish Shrivastava, Thamar Solorio, Nirmal Surange, Hailegnaw Getaneh Tilaye, Krishnapriya Vishnubhotla, Genta Winata, Seid Muhie Yimam, Saif M. Mohammad: SemRel2024: A Collection of Semantic Textual Relatedness Datasets for 14 Languages (Preprint). [Paper] [Data and Code]

Michael Schlichtkrull, Nedjma Ousidhoum, Andreas Vlachos: The Intended Uses of Automated Fact-Checking Artefacts: Why, How and Who (Findings of EMNLP 2023). [Paper] [Data]

Shamsuddeen Hassan Muhammad, Idris Abdulmumin, Abinew Ali Ayele, Nedjma Ousidhoum, David Ifeoluwa Adelani, Seid Muhie Yimam, Ibrahim Sa'id Ahmad, Meriem Beloucif, Saif M. Mohammad, Sebastian Ruder, Oumaima Hourrane, Pavel Brazdil, Felermino Dário Mário António Ali, Davis David, Salomey Osei, Bello Shehu Bello, Falalu Ibrahim, Tajuddeen Gwadabe, Samuel Rutunda, Tadesse Belay, Wendimu Baye Messelle, Hailu Beshada Balcha, Sisay Adugna Chala, Hagos Tesfahun Gebremichael, Bernard Opoku, Steven Arthur: Afrisenti: A Twitter Sentiment Analysis Benchmark for African Languages (EMNLP 2023). [Paper] [Data]

Shamsuddeen Hassan Muhammad, Idris Abdulmumin, Seid Muhie Yimam, David Ifeoluwa Adelani, Ibrahim Sa'id Ahmad, Nedjma Ousidhoum, Abinew Ayele, Saif M Mohammad, Meriem Beloucif : SemEval-2023 Task 12: Sentiment Analysis for African Languages (AfriSenti-SemEval) in Proceeding of SemEval: the 17th International Workshop on Semantic Evaluation co-located with ACL 2023. [Paper] [Data and Code]

Nedjma Ousidhoum*, Zhangdie Yuan*, Andreas Vlachos: Varifocal Question Generation for Fact-checking in Proceedings of EMNLP 2022. [Paper] [Code]. (* Equal contribution)

Nedjma Ousidhoum: On the Importance and Challenges of the Experimental Design of Multilingual Toxic Content Detection. PhD thesis 2021.

Nedjma Ousidhoum, Xinran Zhao, Tianqing Fang, Yangqiu Song, Dit-Yan Yeung: Probing Toxic Content in Large Pre-Trained Language Models in Proceedings of ACL-IJCNLP 2021. [Paper] [Code] [Presentation]

Nedjma Ousidhoum, Yangqiu Song, Dit-Yan Yeung: Comparative Evaluation of Label-Agnostic Selection in Multilingual Hate Speech Datasets in Proceedings of EMNLP 2020. [Paper] [Code] [Presentation]

Nedjma Ousidhoum, Zizheng Lin, Hongming Zhang, Yangqiu Song, Dit-Yan Yeung: Multilingual and Multi-Aspect Hate Speech Analysis" in Proceedings of EMNLP 2019. [Paper] [Code]

Nedjma Ousidhoum, Nacéra Bensaou: Towards The Refinement of the Arabic Soundex in Proceedings of NLDB 2013. [Paper]

Nedjma Ousidhoum, Asma Bensalah, Nacéra Bensaou: New Classical Arabic Soundex Algorithm in Proceedings of CIT 2012 the Second Conference on Advances in Communication and Information Technology.[Paper]

Talks

On Benchmarking and Building resources: The inevitable and the preventable pitfalls. Queen Mary University of London (Seminar). March 2024.

What Is Needed Vs What is Built in NLP: Toxic Language Detection and Automated Fact-checking Models As Use Cases. Uppsala NLP Seminar. May 2023.

What Is Needed Vs What is Built in NLP: Toxic Language Detection and Automated Fact-checking Models As Use Cases. CohereAI Community Talks. Recording. May 2023.

Expectations vs. Reality: Doing Multilingual Toxic Content Detection in NLP.. Aston Institute of Forensic Linguistics. May 2023.

What Is Needed Vs What is Built in NLP: Toxic Language Detection and Automated Fact-checking Models As Use Cases. Cardiff NLP Seminar. January 2023.

Being a Researcher in Arabic NLP. Panel discussion at WiNLP Workshop (co-located with EMNLP 2022). December 2022.

Arabic Toxic Content Detection in NLP. "Arabic AI and Toxic Online Content Detection" Panel discussion at IWABigDAI. May 2022.

Expectations vs. Reality: Lessons Learned from Working on Toxic Content Detection in NLP. Language Technology Group Seminar (Hamburg Universität). February 2022.

Expectations vs. Reality: Lessons Learned from Working on Toxic Content Detection in NLP. Cambridge NLIP Seminar (University of Cambridge). January 2022.

Expectations vs. Reality: Lessons Learned from Working on Toxic Content Detection in NLP. MilaNLP Group Seminar (Bocconi University, Milan). September 2021.

Challenges in Toxic Content Detection. Language and Multimodal AI Lab (LAMA) Group Seminar (Imperial College London). August 2021.

Normalizing the Experimental Design of Multilingual Hate Speech Detection. Group Seminar of the Digital Technologies Research Center (National Research Council, Canada). November 2020.

Social Networks