posted by organizer: trecvid || 1769 views || tracked by 1 users: [display]

DVU 2021 : DVU-Challenge 2021 : Deep Video Understanding - ACM MM Grand Challenge

FacebookTwitterLinkedInGoogle

Link: https://sites.google.com/view/dvuchallenge2021/home
 
When Oct 20, 2021 - Oct 24, 2021
Where Chengdu, China
Submission Deadline Jul 11, 2021
Categories    computer vision   multimedia understanding   multimedia queries   knowledge graph
 

Call For Papers

Deep video understanding is a difficult task which requires systems to develop a deep analysis and understanding of the relationships between different entities in video, to use known information to reason about other, more hidden information, and to populate a knowledge graph (KG) with all acquired information. To work on this task, a system should take into consideration all available modalities (speech, image/video, and in some cases text). The aim of this new challenge is to push the limits of multimodal extraction, fusion, and analysis techniques to address the problem of analyzing long duration videos holistically and extracting useful knowledge to utilize it in solving different types of queries. The target knowledge includes both visual and non-visual elements. As videos and multimedia data are getting more and more popular and usable by users in different domains, the research, approaches and techniques we aim to be applied in this Grand Challenge will be very relevant in the coming years and near future.

Challenge Overview:
Interested participants are invited to apply their approaches and methods on an extended novel Deep Video Understanding (DVU) dataset being made available by the challenge organizers. This includes the 10 movies from the 2020 version of this challenge (HLVU) with a Creative Commons license, and has been supplemented with the Land Girls TV series licensed for us in this challenge by the BBC, and additional Creative Commons license movies added for the 2021 challenge. The dataset will be annotated by human assessors and final ground truth, both at the overall movie level (Ontology of relations, entities, actions & events, Knowledge Graph, and names and images of all main characters), and the individual scene level (Ontology of locations, people/entities, attributes for these and interactions between) will be provided for 50% of the dataset to participating researchers for training and development of their systems. The organizers will support evaluation and scoring for a hybrid of main query types, at the overall movie level and at the individual scene level distributed with the dataset (please refer to the dataset webpage for more details):

Example Question types at Overall Movie Level:

1- Multiple choice question answering on part of Knowledge Graph for selected movies.

2- Possible path analysis between persons / entities of interest in a Knowledge Graph extracted from selected movies.

3- Fill in the Graph Space - Given a partial graph, systems will be asked to fill in the graph space.

Example Question types at Individual Scene Level:

1- Find next or previous interaction, given two people, a specific scene, and the interaction between them.

2- Find a unique scene given a set of interactions and a scene list.

3- Fill in the Graph Space - Given a partial graph for a scene, systems will be asked to fill in the graph space.

4- Match between selected scenes and set of scene descriptions written in natural language

Important Dates:

Complete HLVU annotations for development and testing data ,used in 2020, available: drive.google.com/drive/u/0/folders/1q1Ca0aFJrF9tB8hsw-mrI9d4tzy5wlPZ

DVU development data release: April 19, 2021
Testing dataset release : May 1, 2021
Testing queries release : June 6, 2021
Run submissions due to organizers: July 11, 2021
Paper submission deadline: July 11, 2021
Results released back to participants: TBD
Notification to authors: TBD
camera-ready submission: TBD
ACM Multimedia dates: October 20 - 24, 2021

Related Resources

SOFTPA 2025   4th International Conference on Emerging Practices in Software Process & Architecture
SPIE-Ei/Scopus-DMNLP 2025   2025 2nd International Conference on Data Mining and Natural Language Processing (DMNLP 2025)-EI Compendex&Scopus
AMLDS 2025   IEEE--2025 International Conference on Advanced Machine Learning and Data Science
CMVIT-Maldives 2025   2025 9th International Conference on Machine Vision and Information Technology (CMVIT 2025)
21st AIAI 2025   21st (AIAI) Artificial Intelligence Applications and Innovations
VISAPP 2025   20th International Conference on Computer Vision Theory and Applications
CVAI 2026   2026 International Symposium on Computer Vision and Artificial Intelligence (CVAI 2026)
ITCSS 2025   11th International Conference on Information Technology Convergence and Services
25th EANN/EAAAI 2025   25th (EANN/EAAAI) Engineering Applications and Advances of of Artificial Intelligence
AdNLP 2025   6th International Conference on Advanced Natural Language Processing