posted by organizer: bucc_cfp || 1520 views || tracked by 1 users: [display]

BUCC 2019 : Building and Using Comparable Corpora

FacebookTwitterLinkedInGoogle

Link: https://comparable.limsi.fr/bucc2019/
 
When Sep 5, 2019 - Sep 5, 2019
Where Varna, Bulgaria
Submission Deadline Jun 30, 2019
Notification Due Jul 28, 2019
 

Call For Papers

Submissions: https://www.softconf.com/ranlp2019/BUCC/

Comparable corpora with various degrees of comparability (from noisy parallel corpora to random web snapshots) have been used in a range of applications, including Information Retrieval, Machine Translation, Cross-lingual text classification, etc. We believe that the linguistic definitions and observations related to comparable corpora can improve methods to mine such corpora for applications of statistical NLP, for example to extract parallel corpora from comparable corpora for neural MT, see the BUCC shared tasks in the past years.

The special topic for this year is Neural Networks for Building and Using Comparable Corpora. The workshop is co-located with RANLP'19.

We solicit contributions to the following topics:
Building Comparable Corpora
• Automatic and semi-automatic methods
• Methods to mine parallel and non-parallel corpora from the Web
• Tools and criteria to evaluate the comparability of corpora
• Parallel vs non-parallel corpora, monolingual corpora
• Rare and minority languages, across language families
• Multi-media/multi-modal comparable corpora

Applications of comparable corpora
• Human translations
• Language learning
• Cross-language information retrieval & document categorization
• Bilingual projections
• Machine translation
• Writing assistance

Mining from Comparable Corpora
• Cross-language distributional semantics
• Extraction of parallel segments or paraphrases from comparable corpora
• Methods to extract parallel from non-parallel corpora (e.g. to provide for low-resource languages in neural machine translation)
• Extraction of bilingual and multilingual translations of single words and multi-word expressions; proper names, named entities, etc.


Related Resources

BUCC 2025   18th Workshop on Building and Using Comparable Corpora workshop at COLING'25
CMC-Corpora 2025   12th International Conference on CMC and Social Media Corpora for the Humanities
MAS-GAIN 2025   1st International Workshop on Multi-Agent Systems using Generative Artificial INtelligence for Automated Software Engineering
IndiREAD 2025   investigating individual differences in reading using both experimental and computational approaches
CoUDP 2025   2025 International Conference on Urban Design and Planning (CoUDP 2025)
ICBSTS 2025   2025 6th International Conference on Building Science, Technology and Sustainability (ICBSTS 2025)
ICBMC 2026   2026 11th International Conference on Building Materials and Construction (ICBMC 2026)
SmartIoT 2025   The 9th IEEE International Conference on Smart Internet of Things
ICCESB 2025   Springer--2025 International Conference on Civil Engineering and Sustainable Building (ICCESB 2025)
ICBMM 2025   2025 The 9th International Conference on Building Materials and Materials Engineering (ICBMM 2025)