posted by organizer: fintoc || 488 views || tracked by 3 users: [display]

FinTOC 2020 : Financial Document Structure Extraction 2020

FacebookTwitterLinkedInGoogle

Link: http://wp.lancs.ac.uk/cfie/fintoc2020/
 
When N/A
Where Barcelona
Submission Deadline TBD
Categories    NLP   document analysis   LAYOUT
 

Call For Papers

First Call For Participation

FinTOC’2 shared task
Held at COLING 2020 as part of the FNP-FNS 2020 workshop.
13 September, Barcelona, Spain.
====================

Shared Task URL: http://wp.lancs.ac.uk/cfie/fintoc2020/
Workshop URL: http://wp.lancs.ac.uk/cfie/fnp2020/
Participation Form: https://forms.gle/LFsVaw6DqYikhKHx9

_____________________________________________

The FinTOC’2 shared task aims to bring together the community of researchers interested in Financial Document Processing and Document Layout Analysis to advance the state of the art in the automatic processing of financial documents. This task focuses on the automatic generation of reports' Table Of Contents (henceforth TOC), as it is a key building block in the semantic analysis of financial documents. Generating the TOC requires detecting the span of all document sections and subsections, identifying their titles, and organising them into a hierarchy. It is a well-known fact that extracting document structure is a key step in information processing. For example sections can be used to determine areas where algorithms can be applied, such as Information Extraction, thus reducing false positives rate and irrelevant noise.

This is the second edition of the FinTOC shared task which will be held at COLING 2020 in Barcelona (Spain) as part of the FNP-FNS 2020 workshop. Last year’s edition received significant interest, particularly on the Title Detection track. Our aim this year is to increase interest by:
- lowering the barriers to the entry to the TOC extraction track, and
- opening up the task to a new language: French. We are particularly interested in systems which can be applied to both English and French languages.

This second edition proposes two tracks: one track per language, and it will score systems on both Title detection and TOC generation performance. We have revised the task and greatly simplified data formats to make it as smooth as possible for every interested researcher to participate and submit their systems’ outputs at FinTOC’2.

Each of the participating teams will be asked to submit a short paper describing their methods and solutions to be presented at the workshop.

_____________________________________________

To register your interest in participating in FinTOC’2 shared task please use the following google form by no later than April 6th, 2020: https://forms.gle/LFsVaw6DqYikhKHx9
__________________________________________

Important dates:
December 1st, 2020: Registration opens.
February 17th, 2020: Release of training set & scoring scripts.
March 23rd, 2020: Release of test set.
April 6th, 2020: Registration deadline.
April 13th, Submission deadline.
May 1st, 2020: Release of results.
Sep 13th, 2020: Workshop day.
_________________________________________

Contact:
For any questions on the shared task please contact us on:
fin.toc.task@gmail.com
______________________________________

Shared task organizers:
- Najah-Imane BENTABET, Fortia Financial Solutions
- Ismail El Maarouf, Fortia Financial Solutions
- Mahmoud El-Haj, Lancaster University
- Rémi Juge, Fortia Financial Solutions
- Sira Ferradans, Fortia Financial Solutions
- Dialekti Valsamou-Stanislawski, Fortia Financial Solutions

Related Resources

NoDaLiDa 2019   [FNP 2019] Second Financial Narrative Processing Workshop
ACL 2020   The 58th Annual Meeting of the Association for Computational Linguistics
ICDAR 2019   International Conference on Document Analysis and Recognition
COLING 2020   The 28th International Conference on Computational Linguistics
Text2Story 2020   Third International Workshop on Narrative Extraction from Texts
FinancialNews&Data-IEEE-BigData 2019   The 3rd International Workshop on Big Data for Financial News and Data
user2agent 2019   IUI 2019 Workshop on User-Aware Conversational Agents
MNLP 2020   4th IEEE Conference on Machine Learning and Natural Language Processing
SLIE 2020   Semantic, Logics, Information Extraction and AI
DAS 2020   14th IAPR International Workshop on Document Analysis Systems