TSAR 2022 : Workshop on Text Simplification, Accessibility, and Readability (TSAR-2022)
Call For Papers
Workshop on Text Simplification, Accessibility, and Readability (TSAR-2022)
Workshop and Shared Task at EMNLP 2022
Shared Task: https://taln.upf.edu/pages/tsar2022-st
Call for Papers
The web provides an abundance of knowledge and information that can reach large populations. However, the way in which a text is written (vocabulary, syntax, or text organization/structure), or presented, can make it inaccessible for many people, especially for non-native speakers, people with low literacy, and people with some type of cognitive or linguistic impairments. The results of the Adult Literacy Survey (OECD, 2013) indicate that approximately 16.7% of the adult population (averaged over 24 highly-developed countries) requires lexical, 50% syntactic, and 89.4% conceptual simplification of everyday texts (Štajner, 2021).
Research on automatic text simplification (TS), textual accessibility, and readability have the potential to improve social inclusion of marginalized populations. These related research areas have attracted attention in the past ten years, evidenced by the growing number of publications in NLP conferences. While only about 300 articles in Google Scholar mentioned TS in 2010, this number has increased to about 600 in 2015 and greater than 1000 in 2020 (Štajner, 2021).
Recent research in automatic text simplification has mostly focused on proposing the use of methods derived from the deep learning paradigm (Glavaš and Štajner, 2015; Paetzold and Specia, 2016; Nisioi et al., 2017; Zhang and Lapata, 2017; Martin et al., 2020; Maddela et al., 2021; Sheang and Saggion, 2021). However, there are many important aspects of automatic text simplification that need the attention of our community: the design of appropriate evaluation metrics, the development of context-aware simplification solutions, the creation of appropriate language resources to support research and evaluation, the deployment of simplification in real environments for real users, the study of discourse factors in text simplification, the identification of factors affecting the readability of a text, etc. To overcome those issues, there is a need for collaboration of CL/NLP researchers, machine learning and deep learning researchers, UI/UX and Accessibility professionals, as well as public organizations representatives (Štajner, 2021).
The proposed TSAR workshop builds upon the recent success of several regional workshops that covered a subset of our topics of interest, including READI Workshops at LREC 2022 and LREC 2022, SEPLN 2021 Workshop on Current Trends in Text Simplification (CTTS) and the SimpleText workshop at CLEF 2021, as well as the birds-of-a-feather events on Text Simplification at NAACL 2021 (over 50 participants) and ACL 2022.
The TSAR workshop aims to foster collaboration among all parties interested in making information more accessible to all people. Through the two invited talks, a shared task on lexical simplification, the round table discussion, oral and poster presentations of novel research, we will discuss recent trends and developments in the area of automatic text simplification, text accessibility, automatic readability assessment, language resources and evaluation for text simplification, etc.
We invite contributions on the following topics (among others):
Modular and end-to-end TS;
Sequence-to-sequence and zero-shot TS;
Text complexity assessment;
Complex word identification and lexical complexity prediction;
Corpora, lexical resources, and benchmarks for TS;
Evaluation of TS systems;
Domain specific/adaptable TS (e.g. health, legal);
Other related topics (e.g. empirical and eye-tracking studies);
Assistive technologies for improving readability and comprehension including those going beyond text.
Text Simplification in Languages other than English
Readability Controlled MT
We welcome two types of papers: long papers and short papers. Submissions should be made to the Softconf submission management system: https://softconf.com/emnlp2022/tsar. The papers should present novel research. The review will be double blind and thus all submissions should be anonymized.
Format: Paper submissions must use the official EMNLP template, which is available as an Overleaf template and also downloadable directly (Latex and Word) (see here: https://2022.emnlp.org/calls/style-and-formatting/). Authors may not modify these style files or use templates designed for other conferences. Submissions that do not conform to the required styles, including paper size, margin width, and font size restrictions, will be rejected without review.
Long Papers: Long papers must describe substantial, original, completed, and unpublished work. Wherever appropriate, concrete evaluation and analysis should be included. Long papers may consist of up to eight (8) pages of content, plus unlimited pages of references. Final versions of long papers will be given one additional page of content (up to 9 pages), so that reviewers’ comments can be taken into account. Long papers will be presented orally or as posters as determined by the program committee. The decisions as to which papers will be presented orally and which as poster presentations will be based on the nature rather than the quality of the work. There will be no distinction in the proceedings between long papers presented orally and long papers presented as posters.
Short Papers: Short paper submissions must describe original and unpublished work. Please note that a short paper is not a shortened long paper. Instead, short papers should have a point that can be made in a few pages. Some kinds of short papers include: a small, focused contribution; a negative result; an opinion piece; an interesting application nugget Short papers may consist of up to four (4) pages of content, plus unlimited pages of references. Final versions of short papers will be given one additional page of content (up to 5 pages), so that reviewers' comments can be taken into account. Short papers will be presented orally or as posters as determined by the program committee. While short papers will be distinguished from long papers in the proceedings, there will be no distinction in the proceedings between short papers presented orally and short papers presented as posters.
13 September 2022 (extended): Workshop paper submission deadline (Softconf)
2 October 2022: Workshop paper notification deadline
16 October 2022: Workshop paper camera ready deadline
8 December 2022: Workshop
All accepted papers will be included in the workshop proceedings and published in ACL Anthology. Extended versions of best papers will be invited for a special issue of Frontiers in Artificial Intelligence focused on: applied research for TS and readability assessment in the context of TS.
Sanja Štajner, NLP Researcher, Germany
Horacio Saggion, Chair in Computer Science and Artificial Intelligence and Head of the LaSTUS Lab in the TALN-DTIC, Universitat Pompeu Fabra
Wei Xu, Assistant Professor at School of Interactive Computing, Georgia Institute of Technology
Marcos Zampieri, Assistant Professor at the Rochester Institute of Technology
Matthew Shardlow, Senior Lecturer at Manchester Metropolitan University
Daniel Ferrés, Post-Doctoral Research Assistant at LaSTUS Lab. at TALN-DTIC, Universitat Pompeu Fabra
Kai North, Ph.D. student at the Rochester Institute of Technology
Kim Cheng Sheang, PhD student at LaSTUS Lab. at TALN-DTIC, Universitat Pompeu Fabra
Program committee (Tentative)
Rodrigo Alarcón (Universidad Carlos III, Spain)
Fernando Alva Manchego (University of Sheffield, UK)
Susana Bautista (Universidad Francisco de Vitoria, Spain)
Antoine Bordes (Facebook, UK)
Remi Cardon (Université Catholique de Louvain, Belgium)
Eric De la Clergerie (INRIA, France)
Felice Dell'Orletta (Istituto di Linguistica Computazionale “Antonio Zampolli”, Italy)
Thomas François (Université catholique de Louvain, Belgique)
Nuria Gala (Université Aix-Marseille, France)
Goran Glavaš (University of Mannheim, Germany)
Itziar Gonzalez-Dios (University of the Basque Country, Spain)
Natalia Grabar (Université de Lille, France)
Raquel Hervás (Universidad Complutense de Madrid, Spain)
Tomoyuki Kajiwara (Ehime University, Japan)
David Kauchak (Pomona College, USA)
Reno Kriz (University of Pennsylvania, USA)
Louis Martin (Facebook, UK)
Lourdes Moreno López (Universidad Carlos III, Spain)
Christina Niklaus (University St. Gallen, Switzerland)
Benoît Sagot (INRIA, France)
Giulia Venturi (Istituto di Linguistica Computazionale “Antonio Zampolli”, Italy)
Victoria Yaneva (National Board of Medical Examiners, USA)
Feel free to send us messages at: