LREC 2022 Program - Day 1 Oral & Poster Sessions

Tuesday, 21 June 2022
                         Day 1
09:30 - 11:00    Opening Ceremony
[Video]
Address by the LREC Chair, Nicoletta Calzolari
Address by the ELRA President, António Branco
Address by the ELRA Secretary General, Khalid Choukri
Address by the LREC Committee Chairs, Frédéric Béchet and Philippe Blache
11:00 - 11:20    ELRA: Next 25 Years
[Video]
11:20 - 11:40    Coffee Break
11:40 - 13:00    Session O1: Machine Translation and Evaluation - Auditorium
Chair: Macken, Lieve
Co-Chair: Basta, Christine
11:40 - 12:00    Domain Adaptation in Neural Machine Translation using a Qualia-Enriched FrameNet
[Paper] [Slides] [Video]
Alexandre Diniz da Costa1, Mateus Coutinho Marim2, Ely Matos1, Tiago Timponi Torrent2
1UFJF - Federal University of Juiz de Fora, 2Federal University of Juiz de Fora
12:00 - 12:20    HOPE: A Task-Oriented and Human-Centric Evaluation Framework Using Professional Post-Editing Towards More Effective MT Evaluation
[Paper] [Slides] [Video]
Serge Gladkoff1 and Lifeng Han2
1Logrus Global, 2Dublin City University
12:20 - 12:40    Priming Ancient Korean Neural Machine Translation
[Paper] [Slides] [Video]
chanjun park1, Seolhwa Lee2, Jaehyung Seo1, Hyeonseok Moon3, Sugyeong Eo1, Heuiseok Lim1
1korea university, 2University of Copenhagen, 3glee889@korea.ac.kr
12:40 - 13:00    GECO-MT: The Ghent Eye-tracking Corpus of Machine Translation
[Paper] [Slides] [Video]
Toon Colman, Margot Fonteyne, Joke Daems, Nicolas Dirix, Lieve Macken
Ghent University
11:40 - 13:00    Session O2: Semantics and Lexicon - Salle 120
Chair: Krek, Simon
Co-Chair: Cruz González, Rafael
11:40 - 12:00    Introducing Frege to Fillmore: A FrameNet Dataset that Captures both Sense and Reference
[Paper] [Slides] [Video]
Levi Remijnse1, Piek Vossen2, Antske Fokkens3, Sam Titarsolej3
1Vrije Universiteit, 2VU University Amsterdam, 3VU Amsterdam
12:00 - 12:20    Compiling a Suitable Level of Sense Granularity in a Lexicon for AI Purposes: The Open Source COR Lexicon
[Paper] [Slides] [Video]
Bolette Pedersen1, Nathalie Sørensen1, Sanni Nimb2, Ida Flørke3, Sussi Olsen4, Thomas Troelsgård5
1University of Copenhagen, 2Society for Danish Language and Literature (DSL), 3The Danish Society for Language and Literature, 4UCPH, Centre for Language Technology, 5Society for Danish Language and Literature
12:20 - 12:40    Sense and Sentiment
[Paper] [Slides] [Video]
Francis Bond1 and Merrick Choo2
1Palacký University, 2NTU
12:40 - 13:00    Enriching Linguistic Representation in the Cantonese Wordnet and Building the New Cantonese Wordnet Corpus
[Paper] [Slides] [Video]
Ut Seong Sio1 and Luís Morgado da Costa2
1Nanyang Technological University, 2Palacký University
11:40 - 13:00    Session O3: Corpus and Annotation (1) - La Major
Chair: Fišer, Darja
Co-Chair: Chersoni, Emmanuele
11:40 - 12:00    ZAEBUC: An Annotated Arabic-English Bilingual Writer Corpus
[Paper] [Slides] [Video]
Nizar Habash1 and David Palfreyman2
1New York University Abu Dhabi, 2Zayed University
12:00 - 12:20    Turkish Universal Conceptual Cognitive Annotation
[Paper] [Slides] [Video]
Necva Bölücü1 and Burcu Can2
1Hacettepe University, 2Wolverhampton UK
12:20 - 12:40    Introducing the CURLICAT Corpora: Seven-language Domain Specific Annotated Corpora from Curated Sources
[Paper] [Video]
Tamás Váradi1, Bence Nyéki1, Svetla Koeva2, Marko Tadić3, Vanja Štefanec4, Maciej Ogrodniczuk5, Bartłomiej Nitoń5, Piotr Pęzik6, Verginica Barbu Mititelu7, Elena Irimia7, Maria Mitrofan7, Dan Tufiș7, Radovan Garabík8, Simon Krek9, Andraž Repar9
1Hungarian Research Centre for Linguistics, Budapest, 2Institute for Bulgarian Language, Bulgarian Academy of Sciences, Sofia, 3University of Zagreb, Faculty of Humanities and Social Sciences, 4University of Zagreb, Faculty of Humanities and Social Sciences, Zagreb, 5Institute of Computer Science, Polish Academy of Sciences, Warsaw, 6University of Łódź, Łódź, 7RACAI, Bucharest, 8Ľ. Štúr Institute of Linguistics, Slovak Academy of Sciences, Bratislava, 9IJS, Ljubljana
12:40 - 13:00    RU-ADEPT: Russian Anonymized Dataset with Eight Personality Traits
[Paper] [Slides] [Video]
C. Anton Rytting1, Valerie Novak1, James Hull1, Victor Frank1, Paul Rodrigues2, Jarrett Lee1, Laurel Miller-Sims3
1University of Maryland College Park, 2Accenture, 3University of Maryland
11:40 - 13:00    Session O4: Dialogue (1) - Salle 92
Chair: Navarretta, Costanza
Co-Chair: Higashinaka, Ryuichiro
11:40 - 12:00    CoQAR: Question Rewriting on CoQA
[Paper] [Video]
Quentin Brabant1, Gwénolé Lecorvé2, Lina M. Rojas Barahona3
1Orange Innovation, 2Orange, 3Orange Labs
12:00 - 12:20    User Interest Modelling in Argumentative Dialogue Systems
[Paper] [Video]
Annalena Aicher1, Nadine Gerstenlauer1, Wolfgang Minker1, Stefan Ultes2
1Ulm University, 2Mercedes-Benz AG
12:20 - 12:40    Every time I fire a conversational designer, the performance of the dialogue system goes down
[Paper] [Video]
Giancarlo Xompero1, Michele Mastromattei2, Samir Salman3, Cristina Giannone4, Andrea Favalli5, Raniero Romagnoli5, Fabio Massimo Zanzotto2
1Almawave SpA, 2University of Rome Tor Vergata, 3University of Rome "Tor Vergata", 4Almawave srl, 5Almawave
12:40 - 13:00    An Empirical Study on the Overlapping Problem of Open-Domain Dialogue Datasets
[Paper] [Slides] [Video]
Yuqiao Wen, Guoqing Luo, Lili Mou
University of Alberta
11:40 - 13:00    Session: P1 - Language Resource Infrastructures and Policy issues - Poster Area 1
Chair: Labropoulou, Penny
   Language Technologies for the Creation of Multilingual Terminologies. Lessons Learned from the SSHOC Project
[Paper] [Video]
Federica Gamba1, Francesca Frontini2, Daan Broeder3, Monica Monachini4
1Istituto di Linguistica Computazionale “A. Zampolli” (ILC-CNR), 2Istituto di Linguistica Computazionale "A. Zampolli" - ILC Consiglio Nazionale delle Ricerche - CNR, 3CLARIN ERIC, 4Institute of Computational Linguistics "A. Zampolli" - CNR
   How to be FAIR when you CARE: The DGS Corpus as a Case Study of Open Science Resources for Minority Languages
[Paper] [Video]
Marc Schulder and Thomas Hanke
University of Hamburg
   Italian NLP for Everyone: Resources and Models from EVALITA to the European Language Grid
[Paper] [Poster] [Video]
Valerio Basile1, Cristina Bosco2, Michael Fell1, Viviana Patti3, Rossella Varvara4
1University of Turin, 2Dipartimento di Informatica - Università di Torino, 3University of Turin, Dipartimento di Informatica, 4University of Fribourg
   Cross-Lingual Link Discovery for Under-Resourced Languages
[Paper] [Poster] [Video]
Michael Rosner1, Sina Ahmadi2, Elena-Simona Apostol3, Julia Bosque-Gil4, Christian Chiarcos5, Milan Dojchinovski6, Katerina Gkirtzou7, Jorge Gracia4, Dagmar Gromann8, Chaya Liebeskind9, Giedrė Valūnaitė Oleškevičienė10, Gilles Sérasset11, Ciprian-Octavian Truică12
1University of Malta, 2NUI, Galway, 3University Politehnica of Bucharest, Romania, 4University of Zaragoza, 5Goethe-Universität Frankfurt am Main, 6CTU in Prague / InfAI, Germany, 7ILSP/Athena Research Center, 8University of Vienna, 9Jerusalem College of Technology , Lev Academic Center, 10Mykolas Romeris University, 11Université Grenoble Alpes, 12Uppsala University
11:40 - 13:00    Session: P2 - Social Media Processing - Poster Area 1
Chair: Parde, Natalie
   Angry or Sad ? Emotion Annotation for Extremist Content Characterisation
[Paper] [Video]
Valentina Dragos1, Delphine Battistelli2, Aline Etienne2, Yolène Constable1
1ONERA, 2MODYCO
   Identification of Multiword Expressions in Tweets for Hate Speech Detection
[Paper] [Video]
Nicolas Zampieri1, Carlos Ramisch2, Irina Illina3, Dominique Fohr1
1LORIA-INRIA, 2Aix Marseille University, CNRS, LIS, 3LORIA/INRIA
   Causal Investigation of Public Opinion during the COVID-19 Pandemic via Social Media Text
[Paper] [Poster] [Video]
Michael Jantscher and Roman Kern
Graz University of Technology
   Misspelling Semantics in Thai
[Paper] [Poster] [Video]
Pakawat Nakwijit and Matthew Purver
Queen Mary University of London
   Automatic Detection of Stigmatizing Uses of Psychiatric Terms on Twitter
[Paper] [Video]
Véronique MORICEAU1, Farah Benamara2, Abdelmoumene Boumadane3
1IRIT, Université Toulouse 3, 2University of toulouse, 3université Paris Saclay
   CoVERT: A Corpus of Fact-checked Biomedical COVID-19 Tweets
[Paper] [Poster] [Video]
Isabelle Mohr1, Amelie Wührl2, Roman Klinger2
1Institut für Maschinelle Sprachverarbeitung, Universität Stuttgart, 2University of Stuttgart
   XLM-T: Multilingual Language Models in Twitter for Sentiment Analysis and Beyond
[Paper] [Poster] [Video]
Francesco Barbieri1, Luis Espinosa Anke2, Jose Camacho-Collados2
1Snap Inc., 2Cardiff University
   ‘Am I the Bad One’? Predicting the Moral Judgement of the Crowd Using Pre–trained Language Models
[Paper] [Poster] [Video]
Areej Alhassan1, Jinkai Zhang2, Viktor Schlegel2
1King Saud University, 2University of Manchester
11:40 - 13:00    Session: P3 - Natural Language Generation (including Summarization) (1) - Poster Area 1
Chair: Reed, Chris
   Generating Questions from Wikidata Triples
[Paper] [Poster] [Video]
Kelvin Han1, Thiago Castro Ferreira2, Claire Gardent3
1Loria/CNRS, 2Federal University of Minas Gerais, 3CNRS/LORIA
   Evaluating Transformer Language Models on Arithmetic Operations Using Number Decomposition
[Paper] [Poster] [Video]
Matteo Muffo, Aldo Cocco, Enrico Bertino
Indigo.ai
   Evaluating the Effects of Embedding with Speaker Identity Information in Dialogue Summarization
[Paper] [Video]
Yuji Naraki, Tetsuya Sakai, Yoshihiko Hayashi
Waseda University
   Perceived Text Quality and Readability in Extractive and Abstractive Summaries
[Paper] [Poster] [Video]
Julius Monsen and Evelina Rennes
Linköping University
   Learning to Prioritize: Precision-Driven Sentence Filtering for Long Text Summarization
[Paper] [Poster] [Video]
Alex Mei1, Anisha Kabir1, Rukmini Bapat1, John Judge1, Tony Sun1, William Yang Wang2
1University of California, Santa Barbara, 2Unversity of California, Santa Barbara
   Automating Horizon Scanning in Future Studies
[Paper] [Poster] [Video]
Tatsuya Ishigaki1, Suzuko Nishino2, Sohei Washino3, Hiroki Igarashi3, Yukari Nagai2, Yuichi Washida4, Akihiko Murai3
1National Institute of Advanced Industrial Science and Technology (AIST), 2Japan Advanced Institute of Science and Technology, 3National Institute of Advanced Industrial Science and Technology, 4Hitotsubashi University
11:40 - 13:00    Session: P4 - Statistical Methods and Machine Learning (1) - Poster Area 1
Chair: Mesgar, Mohsen
   ViHealthBERT: Pre-trained Language Models for Vietnamese in Health Text Mining
[Paper] [Video]
Nguyen Minh1, Vu Tran1, Vu Hoang1, Huy Ta1, Trung Bui2, Steven Truong1
1Vinbrain, 2Vinbrain; Adobe Research
   Privacy-Preserving Graph Convolutional Networks for Text Classification
[Paper] [Poster] [Video]
Timour Igamberdiev1 and Ivan Habernal2
1Technical University of Darmstadt, 2Technische Universität Darmstadt
   ArMATH: a Dataset for Solving Arabic Math Word Problems
[Paper] [Poster] [Video]
Reem Alghamdi1, Zhenwen Liang2, Xiangliang Zhang2
1King Abdullah University of Science and Technology (KAUST), 2University of Notre Dame
   KIMERA: Injecting Domain Knowledge into Vacant Transformer Heads
[Paper] [Poster] [Video]
Benjamin Winter, Alexei Rosero, Alexander Löser, Felix Gers, Amy Siu
Berliner Hochschule für Technik
   Distilling the Knowledge of Romanian BERTs Using Multiple Teachers
[Paper] [Poster] [Video]
Andrei-Marius Avram1, Darius Catrina2, Dumitru-Clementin Cercel3, Mihai Dascalu3, Traian Rebedea3, Vasile Pais1, Dan Tufis1
1Research Institute for Artificial Intelligence, Romanian Academy, 2Duke University, 3University Politehnica of Bucharest
   Personalized Filled-pause Generation with Group-wise Prediction Models
[Paper] [Poster] [Video]
Yuta Matsunaga, Takaaki Saeki, Shinnosuke Takamichi, Hiroshi Saruwatari
Graduate School of Information Science and Technology, The University of Tokyo
   Transformer versus LSTM Language Models trained on Uncertain ASR Hypotheses in Limited Data Scenarios
[Paper] [Poster] [Video]
Imran Sheikh1, Emmanuel Vincent2, Irina Illina3
1Vivoka, 2Inria, 3LORIA/INRIA
   Out of Thin Air: Is Zero-Shot Cross-Lingual Keyword Detection Better Than Unsupervised?
[Paper] [Poster] [Video]
Boshko Koloski1, Senja Pollak2, Blaž Škrlj2, Matej Martinc1
1Jozef Stefan Institute, 2Jožef Stefan Institute
   Evaluating Pretraining Strategies for Clinical BERT Models
[Paper] [Poster] [Video]
Anastasios Lamproudis1, Aron Henriksson2, Hercules Dalianis1
1DSV/Stockholm University, 2Department of Computer and Systems Sciences (DSV), Stockholm University
11:40 - 13:00    Session: P5 - Information Extraction (1) - Poster Area 1
Chair: Ferret, Olivier
   KazNERD: Kazakh Named Entity Recognition Dataset
[Paper] [Poster] [Video]
Rustem Yeshpanov1, Yerbolat Khassanov2, Huseyin Atakan Varol2
1Institute of Smart Systems and Artificial Intelligence, Nazarbayev University, 2Nazarbayev University
   Mitigating Dataset Artifacts in Natural Language Inference Through Automatic Contextual Data Augmentation and Learning Optimization
[Paper] [Poster] [Video]
Michail Mersinias and Panagiotis Valvis
University of Texas at Austin
   Kompetencer: Fine-grained Skill Classification in Danish Job Postings via Distant Supervision and Transfer Learning
[Paper] [Poster] [Video]
Mike Zhang1, Kristian Nørgaard Jensen1, Barbara Plank2
1IT University of Copenhagen, 2LMU Munich
   Semantic Role Labelling for Dutch Law Texts
[Paper] [Poster] [Video]
Roos Bakker1, Romy van Drie1, Maaike de Boer1, Robert van Doesburg2, Tom van Engers2
1TNO, 2TNO, Leibniz Center for Law, UvA
   English Language Spelling Correction as an Information Retrieval Task Using Wikipedia Search Statistics
[Paper] [Poster] [Video]
Kyle Goslin and Markus Hofmann
TU Dublin
   CrudeOilNews: An Annotated Crude Oil News Corpus for Event Extraction
[Paper] [Poster] [Video]
Meisin Lee, Lay-Ki Soon, Eu Gene Siew, Ly Fie Sugianto
Monash University
   Claim Extraction and Law Matching for COVID-19-related Legislation
[Paper] [Poster] [Video]
Niklas Dehio1, Malte Ostendorff2, Georg Rehm3
1Technical University Berlin, 2German Research Center for Artificial Intelligence, 3DFKI
   Constructing A Dataset of Support and Attack Relations in Legal Arguments in Court Judgements using Linguistic Rules
[Paper] [Poster] [Video]
Basit Ali1, Sachin Pawar2, Girish Palshikar3, Rituraj Singh1
1TCS Research, 2Tata Consultancy Services Ltd., 3Tata Consultancy Services Limited
   KIND: an Italian Multi-Domain Dataset for Named Entity Recognition
[Paper] [Poster] [Video]
Teresa Paccosi1 and Alessio Palmero Aprosio2
1Università degli Studi di Trento, 2Fondazione Bruno Kessler
   Russian Jeopardy! Data Set for Question-Answering Systems
[Paper] [Poster] [Video]
Elena Mikhalkova and Alexander Khlyupin
Tyumen State University
   Know Better – A Clickbait Resolving Challenge
[Paper] [Poster] [Video]
Benjamin Hättasch1 and Carsten Binnig2
1DM Lab, Technische Universität Darmstadt, 2TU Darmstadt
   Valet: Rule-Based Information Extraction for Rapid Deployment
[Paper] [Video]
Dayne Freitag, John Cadigan, Robert Sasseen, Paul Kalmar
SRI International
   Negation Detection in Dutch Spoken Human-Computer Conversations
[Paper] [Video]
Tom Sweers1, Iris Hendrickx2, Helmer Strik3
1Centre for Language and Speech Technology (CLST), Centre for Language Studies (CLS), Radboud University, Nijmegen, 2Centre for Language Studies, Radboud University Nijmegen, 3Centre for Language and Speech Technology (CLST), Centre for Language Studies (CLS), Radboud University Nijmegen
13:00 - 14:30    Lunch Break
14:30 - 15:10    Keynote Speaker: Julia Parish-Morris - Auditorium
[Video]
Chair: Cieri, Chris
15:15 - 16:35    Session O5: Language Resource Policies and Management - Auditorium
Chair: Di Persio, Denise
Co-Chair: Frontini, Francesca
15:15 - 15:35    Reflections on 30 Years of Language Resource Development and Sharing
[Paper] [Slides] [Video]
Christopher Cieri1, Mark Liberman2, Sunghye Cho1, Stephanie Strassel1, James Fiumara1, Jonathan Wright2
1Linguistic Data Consortium, University of Pennsylvania, 2University of Pennsylvania
15:35 - 15:55    Language Resources to Support Language Diversity – the ELRA Achievements
[Paper] [Slides] [Video]
Valérie Mapelli1, Victoria Arranz1, Khalid Choukri2, Hélène Mazo1
1ELDA, 2ELRA/ELDA
15:55 - 16:15    Ethical Issues in Language Resources and Language Technology – Tentative Categorisation
[Paper] [Slides] [Video]
Pawel Kamocki1 and Andreas Witt2
1Leibniz Institute for German Language, 2Leibniz Institute for the German Language
16:15 - 16:35    Do we Name the Languages we Study? The #BenderRule in LREC and ACL articles
[Paper] [Slides] [Video]
Fanny Ducel1, Karën Fort2, Gaël Lejeune3, Yves Lepage4
1Sorbonne Université, 2Sorbonne Université and LORIA, 3STIH, Paris-Sorbonne, 4Waseda University
15:15 - 16:35    Session O6: Emotion and Sentiment - La Major
Chair: Agerri, Rodrigo
Co-Chair: Labat, Sofie
15:15 - 15:35    Aspect-Based Emotion Analysis and Multimodal Coreference: A Case Study of Customer Comments on Adidas Instagram Posts
[Paper] [Video]
Luna De Bruyne1, Akbar Karimi2, Orphee De Clercq3, Andrea Prati2, Veronique Hoste3
1LT3, Language and Translation Technology Team, Ghent University, 2IMP Lab, University of Parma, 3LT3, Ghent University
15:35 - 15:55    Multi-source Multi-domain Sentiment Analysis with BERT-based Models
[Paper] [Slides] [Video]
Gabriel Roccabruna, Steve Azzolin, Giuseppe Riccardi
University of Trento
15:55 - 16:15    NaijaSenti: A Nigerian Twitter Sentiment Corpus for Multilingual Sentiment Analysis
[Paper] [Slides] [Video]
Shamsuddeen Muhammad1, David Adelani2, Anuoluwapo Aremu3, Idris Abdulmumin4
1Faculty of computer science, University of Porto, 2Saarland University, 3Masakhane, 4Ahmadu Bello University, Zaria
16:15 - 16:35    A (Psycho-)Linguistically Motivated Scheme for Annotating and Exploring Emotions in a Genre-Diverse Corpus
[Paper] [Slides] [Video]
Aline Etienne1, Delphine Battistelli1, Gwénolé Lecorvé2
1MoDyCo, 2Orange
15:15 - 16:35    Session O7: Knowledge Discovery and Evaluation - Salle 120
Chair: Rigau, German
Co-Chair: Vezzani, Federica
15:15 - 15:35    Integrating a Phrase Structure Corpus Grammar and a Lexical-Semantic Network: the HOLINET Knowledge Graph
[Paper] [Slides] [Video]
Jean-Philippe Prost
Aix-Marseille Université
15:35 - 15:55    On the Impact of Temporal Representations on Metaphor Detection
[Paper] [Slides] [Video]
Giorgio Ottolina1, Matteo Palmonari1, Manuel Vimercati1, Mehwish Alam2
1University of Milano-Bicocca at Milan, 2FIZ Karlsruhe - Leibniz Institute for Information Infrastructure, AIFB Institute, KIT
15:55 - 16:15    Analysis and Prediction of NLP Models via Task Embeddings
[Paper] [Slides] [Video]
Damien Sileo and Marie-Francine Moens
KU Leuven
16:15 - 16:35    Cross-lingual and Cross-domain Transfer Learning for Automatic Term Extraction from Low Resource Data
[Paper] [Slides] [Video]
Amir Hazem1, Merieme Bouhandi2, Florian Boudin3, Beatrice Daille4
1LS2N UMR CNRS 6004, 2LS2N, 3Université de Nantes, 4Université de Nantes - LS2N
15:15 - 16:35    Session O8: Applications involving LRs and Evaluation (1) - Salle 92
Chair: Rehm, Georg
Co-Chair: ImaniGooghari, Ayyoob
15:15 - 15:35    Few-Shot Learning for Argument Aspects of the Nuclear Energy Debate
[Paper] [Slides] [Video]
Lena Jurkschat1, Gregor Wiedemann2, Maximilian Heinrich3, Mattes Ruckdeschel4, Sunna Torge1
1Technische Universität Dresden, 2Leibniz Institute for Media Research | Hans-Bredow-Institute, 3Leipzig University, Germany, 4Leibniz-Institute for Media Research|Hans-Bredow-Institute, Germany
15:55 - 16:15    MuLVE, A Multi-Language Vocabulary Evaluation Data Set
[Paper] [Slides] [Video]
Anik Jacobsen1, Salar Mohtaj2, Sebastian Möller3
1TU Berlin, 2Technische Universität Berlin, 3Quality and Usability Lab, TU Berlin
16:15 - 16:35    PLOD: An Abbreviation Detection Dataset for Scientific Documents
[Paper] [Slides] [Video]
Leonardo Zilio1, Hadeel Saadany2, Prashant Sharma1, Diptesh Kanojia1, Constantin Orăsan1
1University of Surrey, 2University of Wolverhampton
15:15 - 16:35    Session: P6 - Corpora and Annotation (1) - Poster Area 2
Chair: Biemann, Chris
   Potential Idiomatic Expression (PIE)-English: Corpus for Classes of Idioms
[Paper] [Poster] [Video]
Tosin Adewumi1, Roshanak Vadoodi1, Aparajita Tripathy1, Konstantina Nikolaido1, Foteini Liwicki1, Marcus Liwicki2
1Luleå University of Technology, 2Luleå University
   LeSpell - A Multi-Lingual Benchmark Corpus of Spelling Errors to Develop Spellchecking Methods for Learner Language
[Paper] [Poster] [Video]
Marie Bexte1, Ronja Laarmann-Quante1, Andrea Horbach1, Torsten Zesch2
1FernUniversität in Hagen, 2Computational Linguistics, FernUniversität in Hagen
   Subjective Text Complexity Assessment for German
[Paper] [Poster] [Video]
Laura Seiffe1, Fares Kallel1, Sebastian Möller2, Babak Naderi3, Roland Roller4
1Deutsches Forschungszentrum für Künstliche Intelligenz (DFKI), 2Quality and Usability Lab, TU Berlin, 3Technische Universität Berlin, 4DFKI LT Lab
   Querying Interaction Structure: Approaches to Overlap in Spoken Language Corpora
[Paper] [Poster] [Video]
Elena Frick1, Thomas Schmidt2, Henrike Helmer1
1Leibniz-Institute for German Language, 2IDS Mannheim
   DiaBiz – an Annotated Corpus of Polish Call Center Dialogs
[Paper] [Poster] [Video]
Piotr Pęzik, Gosia Krawentek, Sylwia Karasińska, Paweł Wilk, Paulina Rybińska, Anna Cichosz, Angelika Peljak-Łapińska, Mikołaj Deckert, Michał Adamczyk
University of Lodz
   LaVA – Latvian Language Learner corpus
[Paper] [Poster] [Video]
Roberts Darģis1, Ilze Auziņa2, Inga Kaija3, Kristīne Levāne-Petrova2, Kristīne Pokratniece1
1Institute of Mathematics and Computer Science, University of Latvia, 2Institte of Mathematics and Computer Science, University of Latvia, 3Rīga Stradiņš University
   The EuroPat Corpus: A Parallel Corpus of European Patent Data
[Paper] [Poster] [Video]
Kenneth Heafield1, Elaine Farrow2, Jelmer van der Linde2, Gema Ramírez-Sánchez3, Dion Wiggins4
1University of Edinburgh, 2School of Informatics, University of Edinburgh, 3Prompsit Language Engineering, SL (PLE), 4Omniscien Technologies
   "Beste Grüße, Maria Meyer" — Pseudonymization of Privacy-Sensitive Information in Emails
[Paper] [Poster] [Video]
Elisabeth Eder1, Michael Wiegand2, Ulrike Krieg-Holz1, Udo Hahn3
1University of Klagenfurt, 2Alpen-Adria-Universitaet Klagenfurt, 3Friedrich-Schiller-Universität Jena
   Criteria for the Annotation of Implicit Stereotypes
[Paper] [Video]
Wolfgang Schmeisser-Nieto1, Montserrat Nofre1, Mariona Taulé2
1Universitat de Barcelona, 2University of Barcelona
   Common Phone: A Multilingual Dataset for Robust Acoustic Modelling
[Paper] [Poster] [Video]
Philipp Klumpp1, Tomas Arias2, Paula Andrea Pérez-Toro3, Elmar Noeth4, Juan Orozco-Arroyave2
1Friedrich-Alexander-Universität Erlangen-Nürnberg, 2Universidad de Antioquia, 3University of Erlangen-Nuremberg, 4Friedrich-Alexander-University Erlangen-Nuremberg
   Curras + Baladi: Towards a Levantine Corpus
[Paper] [Poster] [Video]
Karim Al-Haff1, Mustafa Jarrar2, Tymaa Hammouda2, Fadi Zaraket3
1University of Strasbourg, 2Birzeit University, 3American University of Beirut
   Annotation Study of Japanese Judgments on Tort for Legal Judgment Prediction with Rationales
[Paper] [Poster] [Video]
Hiroaki Yamada1, Takenobu Tokunaga1, Ryutaro Ohara2, Keisuke Takeshita3, Mihoko Sumida3
1Tokyo Institute of Technology, 2Nakamura, Tsunoda & Matsumoto, 3Hitotsubashi University
   Placing M-Phasis on the Plurality of Hate: A Feature-Based Corpus of Hate Online
[Paper] [Poster] [Video]
Dana Ruiter1, Liane Reiners2, Ashwin Geet D'Sa3, Thomas Kleinbauer1, Dominique Fohr3, Irina Illina4, Dietrich Klakow1, Christian Schemer2, Angeliki Monnier5
1Saarland University, 2Johannes Gutenberg University Mainz (JGU), 3LORIA-INRIA, 4LORIA/INRIA, 5Université de Lorraine
   ParCorFull2.0: a Parallel Corpus Annotated with Full Coreference
[Paper] [Poster] [Video]
Ekaterina Lapshinova-Koltunski1, Pedro Ferreira2, Elina Lartaud3, Christian Hardmeier4
1Universität des Saarlandes, 2University of Aveiro, 3Uppsala University, 4IT University of Copenhagen/Uppsala University
   A Multi-Party Dialogue Ressource in French
[Paper] [Poster] [Video]
Maria Boritchev1 and Maxime Amblard2
1Université de Lorraine, CNRS, Inria, LORIA, F-54000 Nancy, EPC, 2Université de Lorraine
   Bicleaner AI: Bicleaner Goes Neural
[Paper] [Video]
Jaume Zaragoza-Bernabeu1, Gema Ramírez-Sánchez1, Marta Bañón2, Sergio Ortiz Rojas1
1Prompsit Language Engineering, 2Prompsit SL
   Semi-automatically Annotated Learner Corpus for Russian
[Paper] [Poster] [Video]
Anisia Katinskaia1, Maria Lebedeva2, Jue Hou1, Roman Yangarber1
1University of Helsinki, 2Language and Coginition Laboratory, Pushkin State Russian Language Institute
   UniMorph 4.0: Universal Morphology
[Paper] [Video]
Khuyagbaatar Batsuren1, Omer Goldman2, Salam Khalifa3, Nizar Habash4, Witold Kieraś5, Gábor Bella6, Brian Leonard7, Garrett Nicolai8, Kyle Gorman9, Yustinus Ate10, Maria Ryskina11, Sabrina Mielke7, Elena Budianskaya12, Charbel El-Khaissi13, Tiago Pimentel14, Michael Gasser15, William Lane16, Mohit Raj17, Matt Coler18, Jaime Samame19, Delio Camaiteri20, Esaú Rojas20, Didier Francis20, Arturo Oncevay21, Juan Bautista20, Gema Villegas19, Lucas Hennigen14, Adam Ek22, David Guriel23, Peter Dirix24, Jean-Philippe Bernardy22, Andrey Scherbakov25, Aziyana Bayyr-ool26, Antonios Anastasopoulos27, Roberto Zariquiey19, Karina Sheifer28, Sofya Ganieva29, Hilaria Cruz30, Ritván Karahóǧa31, Stella Markantonatou31, George Pavlidis31, Matvey Plugaryov29, Elena Klyachko32, Ali Salehi33, Candy Angulo19, Jatayu Baxi34, Andrew Krizhanovsky35, Natalia Krizhanovskaya35, Elizabeth Salesky7, Clara Vania36, Sardana Ivanova37, Jennifer White14, Rowan Maudslay14, Josef Valvoda14, Ran Zmigrod14, Paula Czarnowska14, Irene Nikkarinen14, Aelita Salchak38, brijesh bhatt34, Christopher Straughn39, Zoey Liu40, Jonathan Washington41, Yuval Pinter42, Duygu Ataman43, Marcin Wolinski5, Totok Suhardijanto44, Anna Yablonskaya45, Niklas Stoehr46, Hossep Dolatian3, Zahroh Nuriah44, Shyam Ratan17, Francis Tyers47, Edoardo Ponti48, Grant Aiton13, Aryaman Arora49, Richard Hatcher33, Ritesh Kumar17, Jeremiah Young50, Daria Rodionova45, Anastasia Yemelina45, Taras Andrushko45, Igor Marchenko45, Polina Mashkovtseva45, Alexandra Serova45, Emily Prud'hommeaux40, Maria Nepomniashchaya45, fausto giunchiglia51, Eleanor Chodroff52, Mans Hulden53, Miikka Silfverberg8, Arya D. McCarthy7, David Yarowsky7, Ryan Cotterell46, Reut Tsarfaty23, Ekaterina Vylomova54
1National University of Mongolia, 2Bar Ilan University, 3Stony Brook University, 4New York University Abu Dhabi, 5Institute of Computer Science, Polish Academy of Sciences, 6University of Trento, 7Johns Hopkins University, 8University of British Columbia, 9The Graduate Center, City University of New York, 10STKIP Weetebula, 11Carnegie Mellon University, 12Institute of Linguistics, Russian Academy of Sciences, 13Australian National University, 14University of Cambridge, 15Indiana University, 16Charles Darwin University, 17Dr. Bhimrao Ambedkar University, 18University of Groningen, 19Pontificia Universidad Católica del Perú, 20Universidad Católica Sedes Sapientiae, Filial Atalaya, 21University of Edinburgh, 22University of Gothenburg, 23Bar-Ilan University, 24Katholieke Universiteit Leuven, 25The University of Melbourne, 26Institute of Philology of the Siberian Branch of the Russian Academy of Sciences, 27George Mason University, 28Higher School of Economics; Institute of Linguistics, Russian Academy of Sciences; Institute for System Programming, Russian Academy of Sciences, 29Moscow State University; Institute of Linguistics, Russian Academy of Sciences, 30University of Louisville, 31ILSP/Athena RC, 32Higher School of Economics; Institute of Linguistics, Russian Academy of Sciences, 33University at Buffalo, 34Dharmsinh Desai University, 35Karelian Research Centre of the Russian Academy of Sciences, 36Amazon, 37University of Helsinki, 38Tuvan State University, 39Northeastern Illinois University, 40Boston College, 41Swarthmore College, 42Ben-Gurion University of the Negev, 43University of Zürich, 44Universitas Indonesia, 45Higher School of Economics, 46ETH Zürich, 47Indiana University; Higher School of Economics, 48Mila/McGill University Montreal, 49Georgetown University, 50University of Oregon, 51Univesity of Trento, 52University of York, 53University of Colorado Boulder, 54University of Melbourne
   Textinator: an Internationalized Tool for Annotation and Human Evaluation in Natural Language Processing and Generation
[Paper] [Poster] [Video]
Dmytro Kalpakchi1 and Johan Boye2
1KTH Royal Institute of Technology, 2KTH
   CyberAgressionAdo-v1: a Dataset of Annotated Online Aggressions in French Collected through a Role-playing Game
[Paper] [Poster] [Video]
Anaïs Ollagnier1, Elena Cabrio1, Serena Villata1, Catherine Blaya2
1Université Côte d’Azur, Inria, CNRS, I3S, 2Université Côte d’Azur, CNRS, Unité de Recherche Migrations et Société (Urmis)
   Finnish Hate-Speech Detection on Social Media Using CNN and FinBERT
[Paper] [Poster] [Video]
Md Saroar Jahan, Mourad Oussalah, Nabil Arhab
University of Oulu
15:15 - 16:35    Session: P7 - Multilinguality and Machine Translation (1) - Poster Area 2
Chair: Çöltekin, Çağrı
   Empirical Analysis of Noising Scheme based Synthetic Data Generation for Automatic Post-editing
[Paper] [Poster] [Video]
Hyeonseok Moon1, chanjun park2, Seolhwa Lee3, Jaehyung Seo2, Jungseob Lee2, Sugyeong Eo2, Heuiseok Lim2
1glee889@korea.ac.kr, 2korea university, 3University of Copenhagen
   Domain Mismatch Doesn't Always Prevent Cross-lingual Transfer Learning
[Paper] [Poster] [Video]
Daniel Edmiston1, Phillip Keung2, Noah A. Smith2
1Amazon, 2University of Washington
   Cross-Lingual Knowledge Transfer for Clinical Phenotyping
[Paper] [Poster] [Video]
Jens-Michalis Papaioannou1, Paul Grundmann1, Betty van Aken1, Athanasios Samaras2, Ilias Kyparissidis3, George Giannakoulas2, Felix Gers4, Alexander Loeser5
1Berliner Hochschule für Technik (BHT), 2First Department of Cardiology, AHEPA University Hospital, Aristotle University of Thessaloniki, Greece, 3Lab of Medical Physics, Aristotle University of Thessaloniki, Greece, 4Beuth Univeristy Berlin, 5Beuth-University of Applied Sciences Berlin
   The Multilingual Microblog Translation Corpus: Improving and Evaluating Translation of User-Generated Text
[Paper] [Video]
Paul McNamee and Kevin Duh
Johns Hopkins University
   Multilingual and Multimodal Learning for Brazilian Portuguese
[Paper] [Video]
Júlia Sato1, Helena Caseli1, Lucia Specia2
1Federal University of São Carlos, 2Imperial College London
   LibriS2S: A German-English Speech-to-Speech Translation Corpus
[Paper] [Poster] [Video]
Pedro Jeuris1 and Jan Niehues2
1Department of Data Science and Knowledge Engineering,Maastricht University, Netherlands, 2Karlsruhe Institut of Technology
   A Linguistically Motivated Test Suite to Semi-Automatically Evaluate German--English Machine Translation Output
[Paper] [Poster] [Video]
Vivien Macketanz1, Eleftherios Avramidis1, Aljoscha Burchardt2, He Wang1, Renlong Ai2, Shushen Manakhimova1, Ursula Strohriegel1, Sebastian Möller3, Hans Uszkoreit4
1German Research Center for Artificial Intelligence (DFKI), 2DFKI, 3Quality and Usability Lab, TU Berlin, 4DFKI and Saarland University
   Cross-lingual Transfer of Monolingual Models
[Paper] [Poster] [Video]
Evangelia Gogoulou1, Ariel Ekgren2, Tim Isbister2, Magnus Sahlgren2
1RISE, 2AI Sweden
   Dataset of Student Solutions to Algorithm and Data Structure Programming Assignments
[Paper] [Poster] [Video]
Fynn Petersen-Frey1, Marcus Soll2, Louis Kobras1, Melf Johannsen1, Peter Kling1, Chris Biemann1
1Universität Hamburg, 2Autal 20, 22880 Wedel, Germany
   Language Patterns and Behaviour of the Peer Supporters in Multilingual Healthcare Conversational Forums
[Paper] [Poster] [Video]
Ishani Mondal1, Kalika Bali2, Mohit Jain3, Monojit Choudhury4, Jacki O'Neill5, Millicent Ochieng5, Kagnoya Awori5, Keshet Ronen6
1Microsoft, 2Microsoft Research Labs, 3Microsoft Research India, 4Microsoft Research, 5Microsoft Africa Research Institute, 6University of Washington
   Frame Shift Prediction
[Paper] [Poster] [Video]
Zheng Xin Yong1, Patrick Watson2, Tiago Timponi Torrent3, Oliver Czulo4, Collin Baker5
1Brown University, 2Minerva University, 3Federal University of Juiz de Fora, 4Universität Leipzig, 5International Computer Science Institute
15:15 - 16:35    Session: P8 - Speech Resources and Processing (1) - Poster Area 2
Chair: Burkhardt, Felix
   CLeLfPC: a Large Open Multi-Speaker Corpus of French Cued Speech
[Paper] [Poster] [Video]
Brigitte BIGI1, Maryvonne Zimmermann2, Carine André3
1LPL, CNRS, 2Association nationale pour la promotion et le développement de la Langue française Parlée Complétée, 3LPL, CNRS, Aix-Marseille Univ.
   Samrómur Children: An Icelandic Speech Corpus
[Paper] [Poster] [Video]
Carlos Daniel Hernandez Mena1, David Erik Mollberg2, Michal Borský3, Jón Guðnason3
1University of Reykjavík, 2Reykjavik University, 3Reykiavík University
   The Norwegian Parliamentary Speech Corpus
[Paper] [Poster] [Video]
Per Erik Solberg1 and Pablo Ortiz2
1National Library of Norway, 2Telenor Research
   A Speech Recognizer for Frisian/Dutch Council Meetings
[Paper] [Video]
Martijn Bentum1, Louis ten Bosch2, Henk van den Heuvel3, Simone Wills3, Domenique van der Niet4, Jelske Dijkstra5, Hans Van de Velde5
1Centre for Language Studies, Radboud University, 2Radboud University Nijmegen, 3CLS/CLST, Radboud University Nijmegen, 4Humainr, 5Fryske Akademy
   Elderly Conversational Speech Corpus with Cognitive Impairment Test and Pilot Dementia Detection Experiment Using Acoustic Characteristics of Speech in Japanese Dialects
[Paper] [Video]
Meiko Fukuda1, Ryota Nishimura1, Maina Umezawa2, Kazumasa Yamamoto3, Yurie Iribe4, Norihide Kitaoka5
1Tokushima university, 2Aichi Prefectural University,, 3Chubu University, 4Aichi Prefectural University, 5Toyohashi University of Technology
   A Spoken Drug Prescription Dataset in French for Spoken Language Understanding
[Paper] [Poster] [Video]
Ali Can Kocabiyikoglu1, François Portet2, Prudence Gibert3, Hervé Blanchon4, Jean-Marc Babouchkine5, Gaëtan Gavazzi3
1University of Grenoble Alpes, 2Univ Grenoble Alpes, Laboratoire d'Informatique de Grenoble, 3CHU Grenoble Alpes, 4Univ. Grenoble Alpes, 5Calystene
   Towards an Open-Source Dutch Speech Recognition System for the Healthcare Domain
[Paper] [Poster] [Video]
Cristian Tejedor-García1, Berrie van der Molen2, Henk van den Heuvel3, Arjan van Hessen4, Toine Pieters2
1CLST, Radboud University, 2Freudenthal Institute, Utrecht University, Utrecht, the Netherlands, 3CLS/CLST, Radboud University Nijmegen, 4University of Twente
   A Dataset for Speech Emotion Recognition in Greek Theatrical Plays
[Paper] [Poster] [Video]
Maria Moutti1, Sofia Eleftheriou2, Panagiotis Koromilas2, Theodoros Giannakopoulos2
1University of the Peloponnese, 2National Center for Scientific Research Demokritos
   Audiobook Dialogues as Training Data for Conversational Style Synthetic Voices
[Paper] [Poster] [Video]
Liisi Piits, Hille Pajupuu, Heete Sahkai, Rene Altrov, Liis Ermus, Kairi Tamuri, Indrek Hein, Meelis Mihkla, Indrek Kiissel, Egert Männisalu, Kristjan Suluste, Jaan Pajupuu
Institute of the Estonian Language
   Using a Knowledge Base to Automatically Annotate Speech Corpora and to Identify Sociolinguistic Variation
[Paper] [Video]
Yaru WU1, Fabian Suchanek2, Ioana Vasilescu3, Lori Lamel4, Martine Adda-Decker5
1CRISCO/EA4255, Université de Caen Normandie, 14000 Caen, France; Laboratoire de Phonétique et Phonologie (UMR7018, CNRS-Sorbonne Nouvelle), France, 2Telecom Paris, 3LIMSI-CNRS, 4CNRS/LIMSI, 5LPP (Lab. Phonétique & Phonologie) / LIMSI-CNRS
   Phone Inventories and Recognition for Every Language
[Paper] [Video]
Xinjian Li1, Florian Metze1, David R. Mortensen2, Alan W Black1, Shinji Watanabe1
1Carnegie Mellon University, 2Language Technologies Institute, Carnegie Mellon University
16:35 - 16:55    Coffee Break
16:55 - 18:15    Session O9: Bio-medical Corpora - Salle 120
Chair: Melero, Maite
Co-Chair: Bawden, Rachel
16:55 - 17:15    Constructing Parallel Corpora from COVID-19 News using MediSys Metadata
[Paper] [Slides] [Video]
Dimitrios Roussis1, Vassilis Papavassiliou1, Sokratis Sofianopoulos1, Prokopis Prokopidis1, Stelios Piperidis2
1ILSP/Athena RC, 2Athena RC/ILSP
17:15 - 17:35    A Distant Supervision Corpus for Extracting Biomedical Relationships Between Chemicals, Diseases and Genes
[Paper] [Slides] [Video]
Dongxu Zhang1, Sunil Mohan2, Michaela Torkar2, Andrew McCallum3
1University of Massachusetts, Amherst, 2Chan Zuckerberg Initiative, 3UMass Amherst
17:35 - 17:55    DrugEHRQA: A Question Answering Dataset on Structured and Unstructured Electronic Health Records For Medicine Related Queries
[Paper] [Slides] [Video]
Jayetri Bardhan1, Anthony Colas1, Kirk Roberts2, Daisy Wang1
1University of Florida, 2The University of Texas Health Science Center at Houston
17:55 - 18:15    Efficiently and Thoroughly Anonymizing a Transformer Language Model for Dutch Electronic Health Records: a Two-Step Method
[Paper] [Slides] [Video]
Stella Verkijk1 and Piek Vossen2
1Vrije Universiteit Amsterdam, 2VU University Amsterdam
16:55 - 18:15    Session O10: Parsing and Tagging - Salle 92
Chair: Simov, Kiril
Co-Chair: Gamba, Federica
16:55 - 17:15    BERTrade: Using Contextual Embeddings to Parse Old French
[Paper] [Slides] [Video]
Loïc Grobol1, Mathilde Regnault2, Pedro Ortiz Suarez3, Benoît Sagot4, Laurent Romary4, Benoit Crabbé5
1Université Paris Nanterre, 2Universität Stuttgart, 3Data and Web Science Group, University of Mannheim, 4Inria, 5University of Paris
17:15 - 17:35    Out-of-Domain Evaluation of Finnish Dependency Parsing
[Paper] [Slides] [Video]
Jenna Kanerva and Filip Ginter
University of Turku
17:35 - 17:55    TArC: Tunisian Arabish Corpus, First complete release
[Paper] [Slides] [Video]
elisa gugliotta1 and Marco Dinarelli2
1Sapienza University of Rome, 2LIG
17:55 - 18:15    Towards Universal Segmentations: UniSegments 1.0
[Paper] [Slides] [Video]
Zdeněk Žabokrtský1, Niyati Bafna1, Jan Bodnár1, Lukáš Kyjánek1, Emil Svoboda1, Magda Ševčíková1, Jonáš Vidra2
1Charles University, 2Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics
16:55 - 18:15    Session O11: Less-Resourced Languages - La Major
Chair: Bird, Steven
Co-Chair: Conforti, Costanza
16:55 - 17:15    TeDDi Sample: Text Data Diversity Sample for Language Comparison and Multilingual NLP
[Paper] [Slides] [Video]
Steven Moran1, Christian Bentz2, Ximena Gutierrez-Vasques3, Olga Sozinova3, Tanja Samardzic3
1University of Neuchâtel, 2University of Tübingen, 3University of Zurich
17:15 - 17:35    Leveraging a Bilingual Dictionary to Learn Wolastoqey Word Representations
[Paper] [Video]
Diego Bear and Paul Cook
University of New Brunswick
17:35 - 17:55    Unmasking the Myth of Effortless Big Data - Making an Open Source Multi-lingual Infrastructure and Building Language Resources from Scratch
[Paper] [Slides] [Video]
Linda Wiechetek1, Katri Hiovain-Asikainen1, Inga Lill Sigga Mikkelsen2, Sjur Moshagen2, Flammie Pirinen1, Trond Trosterud1, Børre Gaup1
1UiT Norgga árktalaš universitehta, 2UiT The Arctic University of Norway
17:55 - 18:15    Building and curating conversational corpora for diversity-aware language science and technology
[Paper] [Video]
Andreas Liesenfeld and Mark Dingemanse
Radboud University
16:55 - 18:15    Session O12: Corpus Creation, Use and Evaluation (1) - Auditorium
Chair: Tadić, Marko
Co-Chair: Paccosi, Teresa
16:55 - 17:15    EPIC UdS - Creation and Applications of a Simultaneous Interpreting Corpus
[Paper] [Slides] [Video]
Heike Przybyl1, Ekaterina Lapshinova-Koltunski2, Katrin Menzel3, Stefan Fischer1, Elke Teich2
1Saarland University, 2Universität des Saarlandes, 3Saarland University, Department of Language Science and Technology
17:15 - 17:35    Development of a Benchmark Corpus to Support Entity Recognition in Job Descriptions
[Paper] [Slides] [Video]
Thomas Green1, Diana Maynard1, Chenghua Lin2
1University of Sheffield, 2University of Aberdeen
17:35 - 17:55    CAMIO: A Corpus for OCR in Multiple Languages
[Paper] [Slides] [Video]
Michael Arrigo1, Stephanie Strassel2, Nolan King3, Thao Tran3, Lisa Mason3
1Linguistic Data Consortium, 2Linguistic Data Consortium, University of Pennsylvania, 3US DOD
17:55 - 18:15    FABRA: French Aggregator-Based Readability Assessment toolkit
[Paper] [Video]
Rodrigo Wilkens1, David Alfter2, Xiaoou Wang3, Alice Pintard2, Anaïs Tack4, Kevin P. Yancey5, Thomas François6
1Université catholique de Louvain, 2UCLouvain, 3University of Louvain, 4Stanford University, 5Duolingo, 6UCLouvain, CENTAL
16:55 - 18:15    Session: P9 - Dialogue and Conversational Systems (1) - Poster Area 1
Chair: Mou, Lili
   Towards Building a Spoken Dialogue System for Argument Exploration
[Paper] [Video]
Annalena Aicher1, Nadine Gerstenlauer1, Isabel Feustel1, Wolfgang Minker1, Stefan Ultes2
1Ulm University, 2Mercedes-Benz AG
   FreeTalky: Don’t Be Afraid! Conversations Made Easier by a Humanoid Robot using Persona-based Dialogue
[Paper] [Poster] [Video]
chanjun park1, Yoonna Jang2, Seolhwa Lee3, Sungjin Park4, Heuiseok Lim1
1korea university, 2Department of Computer Science and Engineering, Korea University, 3University of Copenhagen, 4NAVER Corp.
   Self-Contained Utterance Description Corpus for Japanese Dialog
[Paper] [Video]
Yuta Hayashibe
Megagon Labs, Tokyo, Japan, Recruit Co., Ltd.
   DialCrowd 2.0: A Quality-Focused Dialog System Crowdsourcing Toolkit
[Paper] [Poster] [Video]
Jessica Huynh1, Ting-Rui Chiang1, Jeffrey Bigham2, Maxine Eskenazi1
1Carnegie Mellon University, 2CMU/Apple
   A Brief Survey of Textual Dialogue Corpora
[Paper] [Poster] [Video]
Hugo Gonçalo Oliveira1, Patrícia Ferreira2, Daniel Martins3, Catarina Silva1, Ana Alves4
1CISUC, DEI, University of Coimbra, 2CISUC, University of Coimbra and ISEC, Instituto Politécnico de Coimbra, 3ISEC, Instituto Politécnico de Coimbra, 4CISUC - University of Coimbra and Polythecnic Institute of Coimbra
   A Unified Approach to Entity-Centric Context Tracking in Social Conversations
[Paper] [Video]
Ulrich Rückert1, Srinivas Sunkara1, Abhinav Rastogi1, Sushant Prakash2, Pranav Khaitan1
1Google Research, 2Google
   A Unifying View On Task-oriented Dialogue Annotation
[Paper] [Poster] [Video]
Vojtěch Hudeček1, leon-paul Schaub2, Daniel Stancl1, Patrick Paroubek3, Ondřej Dušek1
1Charles University, 2LIMSI-CNRS/AKIO, 3University Paris-Saclay - CNRS - LISN
   A Multi-source Graph Representation of the Movie Domain for Recommendation Dialogues Analysis
[Paper] [Poster] [Video]
Antonio Origlia1, Martina Di Bratto2, Maria Di Maro2, Sabrina Mennella3
1PRISCA Lab - Dept. of Electrical Engineering and Information Technology - University of Naples "Federico II", 2University of Naples Federico II, 3University of Catania
16:55 - 18:15    Session: P10 - Lexicons (1) - Poster Area 1
Chair: Olsen, Sussi
   SHARE: A Lexicon of Harmful Expressions by Spanish Speakers
[Paper] [Poster] [Video]
Flor Miriam Plaza-del-Arco1, Ana Belén Parras Portillo2, Pilar López Úbeda1, Beatriz Gil3, María-Teresa Martín-Valdivia4
1University of Jaén, 2Universidad de Jaén, 3Universidad de Alicante, 4Univeristy of Jaen
   Wiktextract: Wiktionary as Machine-Readable Structured Data
[Paper] [Video]
Tatu Ylonen
University of Helsinki
   NyLLex: A Novel Resource of Swedish Words Annotated with Reading Proficiency Level
[Paper] [Poster] [Video]
Daniel Holmer and Evelina Rennes
Linköping University
   Making a Semantic Event-type Ontology Multilingual
[Paper] [Poster] [Video]
Zdenka Uresova1, Karolina Zaczynska2, Peter Bourgonje3, Eva Fučíková1, Georg Rehm4, Jan Hajic1
1Charles University, 2German Research Center for Artificial Intelligence, 3Morningsun Technology, 4DFKI
   NomVallex: A Valency Lexicon of Czech Nouns and Adjectives
[Paper] [Poster] [Video]
Veronika Kolářová1 and Anna Vernerová2
1Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics, 2Institute of Formal and Applied Linguistics, Faculty of Mathematics and Physics, Charles University
   TZOS: an Online Terminology Database Aimed at Working on Basque Academic Terminology Collaboratively
[Paper] [Poster] [Video]
Izaskun Aldezabal1, Jose Mari Arriola2, Arantxa Otegi3
1University of the Basque Country, 2UPV/EHU University of the Basque Country, 3University of the Basque Country UPV/EHU
   Animacy Denoting German Nouns: Annotation and Classification
[Paper] [Poster] [Video]
Manfred Klenner1 and Anne Göhring2
1Computational Linguistics, University of Zurich, 2University of Zurich
16:55 - 18:15    Session: P11 - Opinion Mining, Sentiment and Emotion (1) - Poster Area 1
Chair: Kruschwitz, Udo
   x-enVENT: A Corpus of Event Descriptions with Experiencer-specific Emotion and Appraisal Annotations
[Paper] [Poster] [Video]
Enrica Troiano, Laura Ana Maria Oberlaender, Maximilian Wegge, Roman Klinger
University of Stuttgart
   Polar Quantification of Actor Noun Phrases for German
[Paper] [Video]
Anne Göhring1 and Manfred Klenner2
1University of Zurich, 2Computational Linguistics, University of Zurich
   Czech Dataset for Cross-lingual Subjectivity Classification
[Paper] [Poster] [Video]
Pavel Přibáň1 and Josef Steinberger2
1University of West Bohemia, Faculty of Applied Sciences, 2University of West Bohemia
   RED v2: Enhancing RED Dataset for Multi-Label Emotion Detection
[Paper] [Poster] [Video]
Alexandra Ciobotaru1, Mihai Constantinescu2, Liviu P. Dinu1, Stefan Dumitrescu2
1University of Bucharest, 2Independent researcher
16:55 - 18:15    Session: P12 - Evaluation and Validation Methodologies (1) - Poster Area 1
Chair: Refaee, Eshrag Ali A.
   Fine-Grained Error Analysis and Fair Evaluation of Labeled Spans
[Paper] [Poster] [Video]
Katrin Ortmann
Ruhr-Universität Bochum
   Probing Pre-trained Auto-regressive Language Models for Named Entity Typing and Recognition
[Paper] [Video]
Elena V. Epure and Romain Hennequin
Deezer Research
   Frustratingly Easy Performance Improvements for Low-resource Setups: A Tale on BERT and Segment Embeddings
[Paper] [Poster] [Video]
Rob van der Goot1, Max Müller-Eberstein1, Barbara Plank2
1IT University of Copenhagen, 2LMU Munich
   The Subject Annotations of the Danish Parliament Corpus (2009-2017) - Evaluated with Automatic Multi-label Classification
[Paper] [Poster] [Video]
Costanza Navarretta1 and Dorte Haltrup Hansen2
1University of Copenhagen, 2University od Copenhagen
   A Systematic Study Reveals Unexpected Interactions in Pre-Trained Neural Machine Translation
[Paper] [Poster] [Video]
Ashleigh Richardson and Janet Wiles
University of Queensland
   Holistic Evaluation of Automatic TimeML Annotators
[Paper] [Video]
Mustafa Ocal1, Adrian Perez2, Antonela Radas2, Mark Finlayson2
1Florida International University, 2FIU
   Measuring Uncertainty in Translation Quality Evaluation (TQE)
[Paper] [Poster] [Video]
Serge Gladkoff1, Irina Sorokina1, Lifeng Han2, Alexandra Alekseeva3
1Logrus Global, 2Dublin City University, 3ROKO lab
   Challenging the Transformer-based models with a Classical Arabic dataset: Quran and Hadith
[Paper] [Poster] [Video]
Shatha Altammami and Eric Atwell
university of leeds
   Question Modifiers in Visual Question Answering
[Paper] [Poster] [Video]
William Britton1, Somdeb Sarkhel2, Deepak Venugopal3
1University of Memphis, 2Adobe, 3The University of Memphis
16:55 - 18:15    Session: P13 - Multimodality and Cross-modality (1) - Poster Area 1
Chair: Favre, Benoit
   Multimodal Pipeline for Collection of Misinformation Data from Telegram
[Paper] [Poster] [Video]
Jose Sosa and Serge Sharoff
University of Leeds
   Identifying Tension in Holocaust Survivors’ Interview: Code-switching/Code-mixing as Cues
[Paper] [Poster] [Video]
Xinyuan Xia, Lu Xiao, Kun Yang, Yueyue Wang
Syracuse University
   Fine-tuning vs From Scratch: Do Vision & Language Models Have Similar Capabilities on Out-of-Distribution Visual Question Answering?
[Paper] [Poster] [Video]
Kristian Nørgaard Jensen1 and Barbara Plank2
1IT University of Copenhagen, 2LMU Munich
   Multilingual Image Corpus – Towards a Multimodal and Multilingual Dataset
[Paper] [Video]
Svetla Koeva1, Ivelina Stoyanova2, Jordan Kralev3
1Institute for Bulgarian Language "Prof. Lyubomir Andreychin", Bulgarian Academy of Sciences, 2Department of Computational Linguistics, IBL - BAS, 3Technical university, Sofia
   Sign Language Production With Avatar Layering: A Critical Use Case over Rare Words
[Paper] [Video]
Jung-Ho Kim1, Eui Jun Hwang2, Sukmin Cho1, Du Hui Lee3, Jong Park2
1Korea Advanced Institute of Science and Technology, 2KAIST, 3EQ4ALL
   The VoxWorld Platform for Multimodal Embodied Agents
[Paper] [Video]
Nikhil Krishnaswamy1, William Pickard1, Brittany Cates1, Nathaniel Blanchard1, James Pustejovsky2
1Colorado State University, 2Brandeis University
   MemoSen: A Multimodal Dataset for Sentiment Analysis of Memes
[Paper] [Poster] [Video]
Eftekhar Hossain1, Omar Sharif1, Mohammed Moshiul Hoque2
1Chittagong University of Engineering and Technology (CUET), 2Department of Computer Science & Engineering, Chittagong University of Engineering & Technology
   RUSAVIC Corpus: Russian Audio-Visual Speech in Cars
[Paper] [Video]
Denis Ivanko, Alexandr Axyonov, Dmitry Ryumin, Alexey Kashevnik, Alexey Karpov
SPC RAS
   A First Corpus of AZee Discourse Expressions
[Paper] [Poster] [Video]
Camille Challant1 and Michael Filhol2
1Université Paris-Saclay, CNRS, LISN, 2LISN, CNRS, Université Paris-Saclay
   BERTHA: Video Captioning Evaluation Via Transfer-Learned Human Assessment
[Paper] [Poster] [Video]
Luis Lebron1, Yvette Graham2, Kevin McGuinness1, Konstantinos Kouramas3, Noel O'Connor1
1Insight SFI Centre for Data Analytics @DCU, 2ADAPT, Trinity College Dublin, 3Collins Aerospace
   Abstract Meaning Representation for Gesture
[Paper] [Video]
Richard Brutti1, Lucia Donatelli2, Kenneth Lai1, James Pustejovsky1
1Brandeis University, 2Saarland University
18:20 - 19:30    ELRA General Meeting - Auditorium
[Video]
20:00    LREC 2022 Welcome Reception - Palais du Pharo
                         End of Day 1