posted by user: cchangyou || 2208 views || tracked by 2 users: [display]

FOMO-VL 2022 : The 1st Workshop on Foundation Models for Vision and Language

FacebookTwitterLinkedInGoogle

Link: https://fomo-vl.github.io/icdm2022/
 
When Nov 28, 2022 - Nov 28, 2022
Where Virtual/Florida
Submission Deadline Oct 10, 2022
Notification Due Oct 13, 2022
Categories    machine learning   foundation models   vision and language   deep learning
 

Call For Papers

The FOMO-VL 2022 workshop aims to bring together practitioners and researchers with a specific focus on the emerging trends and industry needs associated with multimodality data analytics with foundation models. Both theoretical and experimental submissions are encouraged. Papers should elaborate on model pre-training and adaptation methods with multimodality data, opportunities and issues associated with foundation models, visualization and efficient large-scale training tools, methods, and novel applications or systems. Topics of interest include but are not limited to:

1. Theories and algorithms of self-supervised learning, e.g., generative and contrastive approaches
2. Scaling and generalization of pre-training including multi-task and modularized architectures
3. Efficient distributed training technique for big multimodality data
4. Light-weight model adaption on resource-limited devices and scenarios
5. Data-efficient model adaptation methods: zero-shot and few-shot
6. Vision-and-language (V+L) benchmarks and evaluation
7. Knowledge-enriched methods
8. Interactive AI agents with foundation models
9. Foundation models beyond V+L, e.g., structured data, multilingual, video and knowledge-graph
10. Data collection for foundation models
11. Risks and bias issues in foundation models
12. Novel applications in domains including retails, finance, and healthcare
13. Visions/Comments on the futures of foundation models for V+L

Submission Guidelines We welcome full research papers (be limited to a maximum of 8 pages excluding supplementary materials), as well as vision/demo/poster/industrial papers (up to 3 pages excluding references and appendix). Submissions longer than 8 main pages will be rejected without review. You can include any number of pages for references and appendix. If you have an appendix, please combine it with the main pages into a single PDF file, as no additional file will be accepted in the submission system. All submissions will be reviewed by the Program Committee on the basis of technical quality, relevance to scope of the conference, originality, significance, and clarity.

Panelists (random order):
-- Jianfeng Gao (MSR)
-- Trishul Chilimbi (Amazon)
-- Christoph Schuhmann (LAION)
-- Ruslan Salakhutdinov (CMU)
-- Ludwig Schmidt (UW)

Invited Speakers (random order):
-- Danqi Chen (Princeton)
-- Xifeng Yan (UCSB)
-- Tengyu Ma (Standford)
-- Letitia Parcalabescu (University of Heidelberg)
-- Jiahui Yu (Google)
-- Lu Yuan (MSR)
-- Jiasen Lu (Allen Institute of AI)
-- Justin Lin (Alibaba)

Related Resources

V3SC 2025   Video Surveillance Systems in Smart Cities: Synthetic Images and Foundation Models for Advanced Monitoring Technologies
IEEE-Ei/Scopus-ITCC 2025   2025 5th International Conference on Information Technology and Cloud Computing (ITCC 2025)-EI Compendex
ISWC 2025   24th International Semantic Web Conference
AMLDS 2025   IEEE--2025 International Conference on Advanced Machine Learning and Data Science
MobiCASE 2025   16th EAI International Conference on Mobile Computing, Applications and Services
ICDM 2025   The 25th IEEE International Conference on Data Mining
ICNLSP 2025   8th International Conference on Natural Language and Speech Processing
IEEE-CNIOT 2025   2025 IEEE 6th International Conference on Computing, Networks and Internet of Things (CNIOT 2025) -EI Compendex
GenAI and LVMs for Biometrics 2025   IEEE Transactions on Biometrics, Behavior, and Identity Science (T-BIOM) Special Issue on Generative AI and Large Vision-Language Models for Biometrics
IJRAP 2025   International Journal of Recent advances in Physics