posted by user: shabnamt || 663 views || tracked by 2 users: [display]

CoCo4MT Shared Task 2023 : CoCo4MT Shared Task: First Call for Participation


When Sep 4, 2023 - Sep 8, 2023
Where Macau SAR, China
Submission Deadline Jul 12, 2023

Call For Papers


We are excited to introduce a new shared task for this year’s CoCo4MT
workshop! Our aim is to encourage and facilitate research on corpus
construction for low-resource machine translation.

Corpus creation for machine translation is typically constrained by the
cost and availability of human translators. When a new dataset needs to be
created for a low-resource language or a specialized domain, the annotation
budget should be used efficiently and any sentences chosen for translation
should be of high quality and as useful for machine translation system
training as possible.

In this shared task, we ask participants to come up with ways in which such
examples can be identified for a target language without any existing data.
Specifically, given a parallel corpus between high-resource languages, the
goal is to choose a good subset of the high-resource corpus to be
translated into the low-resource language, in order to obtain a good
training set for a machine translation system. The shared task winner will
be the team whose instances result in the best final system after training.




- May 19 2023: Release of train, dev and test data
- May 30 2023: Release of baselines
- July 12, 2023: Deadline to submit results
- July 20, 2023: System description papers due

Organizers (listed alphabetically)

- Ananya Ganesh, University of Colorado Boulder
- Constantine Lignos, Brandeis University
- John E. Ortega, Northeastern University
- Jonne Sälevä, Brandeis University
- Katharina Kann, University of Colorado Boulder
- Marine Carpuat, University of Maryland
- Rodolfo Zevallos, Universitat Pompeu Fabra
- Shabnam Tafreshi, University of Maryland
- William Chen, Carnegie Mellon University

Related Resources

GUA-SPA at IberLEF 2023   GUA-SPA - Guarani-Spanish Code Switching Analysis at IberLEF 2023
FinCausal 2023   Call for Participation: Financial Document Causality Detection Shared Task (FinCausal 2023)
FinTOC 2023   FNP-2023 Shared Task - FinTOC (Financial Document Structure Extraction)
SYMPTEMIST @ BioCreative/AMIA 2023   CFP: SYMPTEMIST Shared Task (BioCreative VIII run with AMIA 2023)
#SMM4H 2023   Social Media Mining for Health Applications Workshop & Shared Task 2023
WojoodNER 2023   Call for participation - Arabic NER Shared Task 2023
CoCo4MT 2023   The Second Workshop on Corpus Generation and Corpus Augmentation for Machine Translation (CoCo4MT)
GenChal 2023   The Generation Challenges @INLG'23
VarDial EC 2023   Call for Participation - VarDial Evaluation Campaign 2023
Dialogical Approaches to the Sphere ‘in 2024   Call For Papers - Dialogical Approaches to the Sphere ‘in-between’ Self and Other: The Methodological Meaning of Listening