posted by user: trecvid || 419 views || tracked by 2 users: [display]

DVU 2020 : International Workshop on Deep Video Understanding


When Oct 25, 2020 - Oct 29, 2020
Where The Netherlands
Submission Deadline Jul 31, 2020
Notification Due Aug 10, 2020
Final Version Due Aug 17, 2020
Categories    multimedia analysis   video understanding   computer vision   multimodal interaction

Call For Papers

Deep video understanding is a difficult task which requires systems to develop a deep analysis and understanding of the relationships between different entities in video, to use known information to reason about other, more hidden information, and to populate a knowledge graph (KG) with all acquired information. To work on this task, a system should take into consideration all available modalities (speech, image/video, and in some cases text). The aim of this workshop is to push the limits of multimodal extraction, fusion, and analysis techniques to address the problem of analysing long duration videos holistically and extracting useful knowledge to utilize it in solving different types of queries. The target knowledge includes both visual and non-visual elements. As videos and multimedia data are getting more and more popular and usable by users in different domains, the research, approaches and techniques we aim to be applied in this workshop will be very relevant in the coming years and near future.
This workshop will support two tracks of research contributions:

1. Track 1: Interested authors are invited to apply their approaches and methods on a novel High-Level Video Understanding (HLVU) dataset being made available by the workshop organizers. These include 10 movies with a Creative Commons license. This dataset will be annotated by human assessors and ground truth (Ontology of relations, entities, actions & events, names and images of all main characters, Knowledge Graph for 50% of the movies) provided to participating researchers. The organizers will also support evaluation and scoring of two main query types distributed with the dataset:

-Multiple choice question answering on part of Knowledge Graph for selected movies.
-Possible path analysis between persons / entities of interest in a Knowledge Graph
extracted from selected movies.

2. Track 2: Contributions related (but not limited) to the following topics applied on the provided HLVU dataset or any external datasets are invited:

Multimodal feature extraction for movies and extended video
Multimodal fusion of computer vision, text/language processing and audio for
extended video / movie analysis
Machine Learning methods for movie-based multimodal interaction
Sentiment analysis and multimodal dialogue modeling for movies
Knowledge Graph generation, analysis, and extraction for movies and extended

Submission :

We invite submissions of long papers (up to 8 pages excluding references), short papers (up to 4 pages excluding references), and extended abstracts (up to 1 page excluding references), formatted according to the ACM template available here (, or directly from Overleaf ( Submissions shall be single blind, i.e. do not need to be anonymized. Workshop papers will be indexed by ACM Digital Library in an adjunct proceedings.

Papers submitted at ICMI 2020 must not have been published previously. A paper is considered to have been published previously if it has appeared in a peer-reviewed journal, magazine, book, or meeting proceedings that is reliably and permanently available afterward in print or electronic form to non-attendees, regardless of the language of that publication. A paper substantially similar in content to one submitted to ICMI 2020 should not be simultaneously under consideration for another conference or workshop.

ICMI 2020 does not consider a paper on as a dual submission.

All submissions will be handled electronically via EasyChair :

Important Dates:

HLVU movie dataset available including preliminary annotations: March 31, 2020
Complete HLVU annotations and development data available: April 24, 2020 *
Testing queries released: May 29, 2020 *
Run submissions due to organizers: June 29, 2020 *
Results released back to participants: July 13, 2020 *
Workshop paper submission deadline: July 31, 2020
Notification to authors: August 10, 2020
Workshop camera-ready submission: August 17, 2020
Workshop date: October 25 or 29, 2020

* = Track 1 only

Related Resources

IEEE-CVIV 2020   2020 2nd International Conference on Advances in Computer Vision, Image and Virtualization (CVIV 2020)
DVU-Challenge 2020   Deep Video Understanding - ACM MM Grand Challenge
ACM-CSSE 2020-Ei/Scopus 2020   2020 3nd International Conference on Computer Science and Software Engineering (CSSE 2020)
TRECVID 2020   Trec Video Retrieval Evaluation
ICMLC--ACM, Ei and Scopus 2020   ACM--2020 12th International Conference on Machine Learning and Computing (ICMLC 2020)--SCOPUS, Ei Compendex
ISAIR 2020   The 5th International Symposium on Artificial Intelligence and Robotics
IEEE JSTSP SI 2020   IEEE JSTSP Deep Learning for Image/Video Restoration and Compression (Special Issue on)
MNLP 2020   4th IEEE Conference on Machine Learning and Natural Language Processing
CCVPR 2020   2020 3rd International Joint Conference on Computer Vision and Pattern Recognition (CCVPR 2020)
AI ML Big Data Vision 2020   IEEE COINS | Artificial Intelligence | Machine Learning | Deep Learning | Machine Vision | Big Data Analytics | Video Analytics | Speech Recognition | NLP