posted by user: bucc || 2626 views || tracked by 9 users: [display]

BUCC 2010 : 3rd Workshop on Building and Using Comparable Corpora

FacebookTwitterLinkedInGoogle

Link: http://www.fb06.uni-mainz.de/lk/bucc2010
 
When May 22, 2010 - May 22, 2010
Where Valletta, Malta
Submission Deadline Feb 19, 2010
Notification Due Mar 12, 2010
Final Version Due Mar 22, 2010
Categories    computational linguistics   language engineering
 

Call For Papers

==================================================================

1st Call for Papers

THIRD WORKSHOP ON BUILDING AND USING COMPARABLE CORPORA

Applications of Parallel and Comparable Corpora in
Natural Language Engineering and the Humanities

LREC 2010 post-conference workshop, 22 May 2010

Mediterranean Conference Centre, La Valletta, Malta

Deadline for submissions: February 19, 2010

http://www.fb06.uni-mainz.de/lk/bucc2010

==================================================================

We are inviting submissions for the third edition of the Workshop
on Building and Using Comparable Corpora. Comparable corpora are
collections of documents that are comparable in content and form
in various degrees and dimensions. This definition includes many
types of parallel and non-parallel multilingual corpora, but also
sets of monolingual corpora that are used for comparative purposes.

By bringing together researchers from several disciplines, including
natural language engineering and the humanities, the workshop
aims at showing the full breadth of research on comparable corpora.

We solicit contributions on a range of topics, including but not
limited to:

* Building Comparable Corpora

- Automatic and semi-automatic methods
- Methods to mine parallel and non-parallel corpora from the Web
- Tools and criteria to evaluate the comparability of corpora
- Parallel vs non-parallel corpora, monolingual corpora
- Rare and minority languages
- Comparisons across language families
- Multi-media/multi-modal comparable corpora

* Applications of Comparable Corpora

- Human translations
- Language learning
- Cross-language information retrieval & document categorization
- Bilingual projections
- (Statistical) machine translation
- Writing assistance

* Mining from Comparable Corpora

- Extraction of parallel segments or paraphrases from
comparable corpora
- Extraction of bilingual and multilingual translations of
single words and multi-word expressions, proper names,
named entities, etc.


IMPORTANT DATES

February 19, 2010 Deadline for paper submissions
March 12, 2010 Notification of acceptance
March 22, 2010 Camera-ready copies due
May 22, 2010 Workshop


INVITED SPEAKER

Adam Kilgarriff (Lexical Computing Ltd)


WORKSHOP CO-CHAIRS

Reinhard Rapp (University of Tarragona, Spain)
Pierre Zweigenbaum (LIMSI-CNRS, Orsay, France)
Serge Sharoff (University of Leeds, UK)


PROGRAMME COMMITTEE

Srinivas Bangalore (AT&T Labs, USA)
Caroline Barrière (National Research Council Canada)
Chris Biemann (Microsoft / Powerset, San Francisco, USA)
Lynne Bowker (University of Ottawa, Canada)
Hervé Déjean (Xerox Research Centre Europe, Grenoble, France)
Andreas Eisele (DFKI Saarbrücken, Germany)
Éric Gaussier (Université Joseph Fourier, Grenoble, France)
Gregory Grefenstette (Exalead, Paris, France)
Pascale Fung (Hong Kong University of Science & Technology, China)
Silvia Hansen-Schirra (University of Mainz, Germany)
Hitoshi Isahara (NICT, Tokyo, Japan)
Kyo Kageura (University of Tokyo, Japan)
Min-Yen Kan (National University of Singapore)
Natalie Kübler (Université Paris Diderot, France)
Adam Kilgarriff (Lexical Computing Ltd, UK)
Philippe Langlais (Université de Montréal, Canada)
Emmanuel Morin (Université de Nantes, France)
Tony McEnery (Lancaster University, UK)
Dragos Stefan Munteanu (Language Weaver Inc., USA)
Carol Peters (ISTI-CNR, Pisa, Italy)
Reinhard Rapp (University of Tarragona, Spain)
Sujith Ravi (ISI, University of Southern California, USA)
Serge Sharoff (University of Leeds, UK)
Michel Simard (National Research Council Canada)
Richard Sproat (OGI School of Science and Technology, USA)
Michael Zock (LIF, CNRS Marseille, France)
Pierre Zweigenbaum (LIMSI-CNRS, Orsay, France)


SUBMISSIONS

Papers of no more than 10 pages and following the
LREC conference formatting details should be submitted
as a PDF file via the START conference manager at
https://www.softconf.com/lrec2010/BUCC2010/
Papers will be blind reviewed by at least two members
of the Programme Committee. Therefore, authors' names
and affiliations should not appear in the paper.
Accepted papers will be published in the workshop
proceedings.

This year, when submitting a paper from the START page,
authors will be asked to participate in a survey on the
use of language resources by providing essential information
about the resources (in a broad sense, i.e. also technologies,
standards, evaluation kits, etc.) that have been used for the
work described in the paper or are a new result of the research.
For further information on this new iniative, please refer to
http://www.lrec-conf.org/lrec2010/?LREC2010-Map-of-Language-Resources.


FURTHER INFORMATION

If you have questions, please consult the workshop website at
http://www.fb06.uni-mainz.de/lk/bucc2010 or contact
Reinhard Rapp at reinhardrapp AT gmx DOT de

Related Resources

BUCC 2018   11th WORKSHOP ON BUILDING AND USING COMPARABLE CORPORA
COLING 2018   The 27th International Conference on Computational Linguistics (COLING 2018)
LTA 2018   3rd International Workshop on Language Technologies and Applications
TMonJobAds 2018   Text Mining on Job Advertisements - Strategies for discovering valuable information from large corpora
OSACT3 2018   The 3rd Workshop on Open-Source Arabic Corpora and Processing Tools
*SEM 2018   7th Joint Conference on Lexical and Computational Semantics
ISAIR 2018   The 3rd International Symposium on Artificial Intelligence and Robotics 2018
LT 2018   Special Session on Language Technologies
SMC 2018   15th Sound & Music Computing Conference
QUALICO 2018   QUALICO 2018 - International Quantitative Linguistics Conference