posted by organizer: manandinfo || 9244 views || tracked by 1 users: [display]

DPIL @ FIRE 2016 : Shared Task on Detecting Paraphrase in Indian Languages (DPIL) held in conjunction with FIRE2016


When Dec 8, 2016 - Dec 10, 2016
Where ISI, Kolkatta
Submission Deadline TBD
Categories    natural language processing   indian languages   NLP   semantic similarity

Call For Papers

Call for Participation
CFP in Shared Task on Detecting Paraphrase in Indian Languages (DPIL) held in conjunction with the Forum for Information Retrieval Evaluation (FIRE) 2016 on 8th - 10th December 2016, Kolkata

Paraphrase can be defined as “the same meaning of a sentence is expressed in another sentence using different words”. Paraphrases can be identified, generated or extracted. The proposed task is focused on sentence level paraphrase identification for Indian languages (Tamil, Malayalam, Hindi and Punjabi). Identifying paraphrases in Indian languages is a difficult task, because evaluating the semantic similarity of the underlying content and the understanding the morphological variations of the language are more critical. Paraphrase identification is strongly connected with generation and extraction of paraphrases. The paraphrase identification systems improve the performance of a paraphrase generation in terms of choosing the best paraphrase candidate from the list of paraphrases candidates generated by paraphrases generation system. Paraphrase Identification is also used in validating the paraphrase extraction system and the machine translation system. Since there are no annotated corpora or automated semantic interpretation systems available for Indian languages till date, creating benchmark data for paraphrases and utilizing that data in open shared task competitions will motivate the research community for further research in Indian languages.

Sub Task 1:

Given a pair of sentences from news paper domain, the task is to classify them as paraphrases (P) or not paraphrases (NP).

Sub Task 2:

Given two sentences from news paper domain, the task is to identify whether they are completely equivalent (E) or roughly equivalent (RE) or not equivalent (NE). This task is similar to the subtask 1, but the main difference is 3-point scale tag in paraphrases.


We are planning to provide cash awards or travel grants for Top 2 winning teams . We glad to invite all researchers working on Paraphrase Identification, Plagiarism detection and Machine Translation Evaluation to participate in the Shared Task DPIL.

Important Dates
21th June 2016 : Registration Starts
10th July 2016 : Training data will be Released
15th August 2016 : Test Data will be Released
1st September 2016 : Deadline of Runs Submission
10th September 2016 : Results declared
10th October 2016 : Working Notes Due
December 8-10, 2016 : FIRE 2016 Conference

-------------------------- Registeration -------------------------
Registration is now open. You can register on the DPIL website:
Task website :
Fire website :

Contact email:
Track Organizers
Anand Kumar M, CEN, Amrita Vishwa Vidyapeetham, Coimbatore, India
Soman K P , CEN, Amrita Vishwa Vidyapeetham, Coimbatore, India

Related Resources

DMD 2018   Shared task on Detecting Malicious Domain names (DMD 2018)
IJSCMC 2018   International Journal of Soft Computing, Mathematics and Control
CL-SciSumm 2018   [CfP] CL-SciSumm Shared Task 2018 @SIGIR’2018: The Scientific Summarization Shared Task
NAACL-HLT 2019   Annual Conference of the North American Chapter of the Association for Computational Linguistics
IWSPA-AP 2018   First Security and Privacy Analytics Anti-Phishing Shared Task
JoL 2018   International Journal of Law
ST-VMWE 2018   Shared task on automatic identification of verbal multiword expressions – edition 1.1
IJCI 2018   International Journal on Cybernetics & Informatics
RUSSE 2018   A Shared Task on Word Sense Induction and Disambiguation for the Russian Language
IJNLC 2018   International Journal on Natural Language Computing