SIGIR eCom DC 2020 : The 2020 SIGIR Ecom Data Challenge : Multi-modal Product Data Classification and Retrieval challenge

posted by organizer: skallumadi || 11994 views || tracked by 3 users: [display]

SIGIR eCom DC 2020 : The 2020 SIGIR Ecom Data Challenge : Multi-modal Product Data Classification and Retrieval challenge

Link: https://sigir-ecom.github.io/data-task.html

When	Jul 30, 2020 - Jul 30, 2020
Where	Xi'an
Abstract Registration Due	Jul 15, 2020
Submission Deadline	Jul 22, 2020
Notification Due	Jul 23, 2020
Final Version Due	Jul 27, 2020

Categories ecommerce NLP information retrieval classification

Call For Papers

Call for Participation

The 2020 SIGIR workshop on eCommerce is hosting the Rakuten Multi-modal Product Data Classification and Retrieval challenge. This challenge focuses on two topics, namely large-scale multi-modal (text and image) classification and cross-modal retrieval. The goal of the multi-modal classification task is to predict each product’s 'type code' as defined in the catalog of Rakuten France. In the cross-modal retrieval task, presented with the text of the products, the goal is to retrieve the images corresponding to the products.

Challenge website: https://sigir-ecom.github.io/data-task.html

Important Dates:
Data Challenge opens: April 20, 2020
Final Leaderboard - July 22, 2020
SIGIR eCom Full day Workshop - July 30, 2020

Task Description:

The cataloging of product listings through some type of text or image categorization method is a fundamental problem for any e-commerce marketplace, with applications ranging from personalized search and recommendations to query understanding. Manual and rule-based approaches to categorization are not scalable since commercial products are organized in many and sometimes thousands of classes. When actual users categorize product data, it has often been observed that not only the text of the title and description of the product is useful but also its associated images.

Tasks:

The main tasks for this challenge are as follows:
1. Multi-modal classification. Given a training set of products and their product type codes, predict the corresponding product type codes for an unseen held out test set of products. The systems are free to use the available textual titles and/or descriptions whenever available and additionally the images to allow for true multi-modal learning.
2. Cross-modal retrieval. Given an held-out test set of product items with their titles and (possibly empty) descriptions, predict the best image from among a set of test images that correspond to the products in the test set.

Participation and Data:

The data challenge is open to everyone.

For this challenge, Rakuten France has released approximately 99K product listings in tsv format, including a training (84,916) and test set (13,812). The dataset consists of product titles, product descriptions, product images and their corresponding product type codes. The test set will be released towards the end of the data challenge. Furthermore, one can assume the test set has been generated from the same data distribution as the training set.

Details about evaluation metrics and other aspects of the task can be found at the website: https://sigir-ecom.github.io/data-task.html

Important Dates:
Data Challenge opens: April 20, 2020
Final Leaderboard - July 22, 2020
SIGIR eCom Full day Workshop - July 30, 2020

Related Resources

SIGIR 2024 The 47th International ACM SIGIR Conference on Research and Development in Information Retrieval

Ei/Scopus-AACIP 2024 2024 2nd Asia Conference on Algorithms, Computing and Image Processing (AACIP 2024)-EI Compendex

CyberHunt 2024 7th Annual Workshop on Cyber Threat Intelligence and Hunting in conjunction with the IEEE Big Data Conference

CST 2024 11th International Conference on Advances in Computer Science and Information Technology

IEEE Big Data PhD Forum 2024 IEEE Big Data 2024 - Ph.D. Forum

IEEE Big Data - MMAI 2024 IEEE Big Data 2024 Workshop on Multimodal AI

IEEE Big Data - AIMG 2024 IEEE Big Data 2024 Workshop on AI Music Generation

NLAI 2024 5th International Conference on NLP & Artificial Intelligence Techniques

DSIT 2024 2024 7th International Conference on Data Science and Information Technology (DSIT 2024)

AMLDS 2025 2025 International Conference on Advanced Machine Learning and Data Science