posted by organizer: manandinfo || 7272 views || tracked by 2 users: [display]

DPIL @ FIRE 2016 : Shared Task on Detecting Paraphrase in Indian Languages (DPIL) held in conjunction with FIRE2016


When Dec 8, 2016 - Dec 10, 2016
Where ISI, Kolkatta
Submission Deadline TBD
Categories    natural language processing   indian languages   NLP   semantic similarity

Call For Papers

Call for Participation
CFP in Shared Task on Detecting Paraphrase in Indian Languages (DPIL) held in conjunction with the Forum for Information Retrieval Evaluation (FIRE) 2016 on 8th - 10th December 2016, Kolkata

Paraphrase can be defined as “the same meaning of a sentence is expressed in another sentence using different words”. Paraphrases can be identified, generated or extracted. The proposed task is focused on sentence level paraphrase identification for Indian languages (Tamil, Malayalam, Hindi and Punjabi). Identifying paraphrases in Indian languages is a difficult task, because evaluating the semantic similarity of the underlying content and the understanding the morphological variations of the language are more critical. Paraphrase identification is strongly connected with generation and extraction of paraphrases. The paraphrase identification systems improve the performance of a paraphrase generation in terms of choosing the best paraphrase candidate from the list of paraphrases candidates generated by paraphrases generation system. Paraphrase Identification is also used in validating the paraphrase extraction system and the machine translation system. Since there are no annotated corpora or automated semantic interpretation systems available for Indian languages till date, creating benchmark data for paraphrases and utilizing that data in open shared task competitions will motivate the research community for further research in Indian languages.

Sub Task 1:

Given a pair of sentences from news paper domain, the task is to classify them as paraphrases (P) or not paraphrases (NP).

Sub Task 2:

Given two sentences from news paper domain, the task is to identify whether they are completely equivalent (E) or roughly equivalent (RE) or not equivalent (NE). This task is similar to the subtask 1, but the main difference is 3-point scale tag in paraphrases.


We are planning to provide cash awards or travel grants for Top 2 winning teams . We glad to invite all researchers working on Paraphrase Identification, Plagiarism detection and Machine Translation Evaluation to participate in the Shared Task DPIL.

Important Dates
21th June 2016 : Registration Starts
10th July 2016 : Training data will be Released
15th August 2016 : Test Data will be Released
1st September 2016 : Deadline of Runs Submission
10th September 2016 : Results declared
10th October 2016 : Working Notes Due
December 8-10, 2016 : FIRE 2016 Conference

-------------------------- Registeration -------------------------
Registration is now open. You can register on the DPIL website:
Task website :
Fire website :

Contact email:
Track Organizers
Anand Kumar M, CEN, Amrita Vishwa Vidyapeetham, Coimbatore, India
Soman K P , CEN, Amrita Vishwa Vidyapeetham, Coimbatore, India

Related Resources

CoNLL Shared Task 2016   Multilingual Shallow Discourse Parsing (CoNLL 2016 Shared Task)
IJAIA 2016   International Journal of Artificial Intelligence & Applications
FIRE 2016   FIRE 2016 : Eigth meeting of the Forum for Information Retrieval Evaluation
ACL 2017   The 55th annual meeting of the Association for Computational Linguistics
The Great Fire: Reconsidered 2016   The Great Fire: Reconsidered
IJNLC 2016   International Journal on Natural Language Computing
MPP 2016   5th Workshop on Parallel Programming Models - Special Edition on Task Parallelism
MTAGS 2016   MTAGS16: 9th Workshop on Many-Task Computing on Clouds, Grids, and Supercomputers
EACL SRW 2017   EACL 2017 Student Research Workshop