posted by user: vatcheo || 1808 views || tracked by 1 users: [display]

DIFM 2023 : First Workshop on Distributed infrastructures for foundation models

FacebookTwitterLinkedInGoogle

Link: https://difm-conf.github.io/index.html
 
When Dec 11, 2023 - Dec 15, 2023
Where Bologna, Italy
Submission Deadline Oct 9, 2023
Notification Due Oct 20, 2023
Final Version Due Oct 27, 2023
Categories    distributed systems   distributed frameworks   artificial intelligence   machine learning
 

Call For Papers

The DIFM workshop is co-located with ACM/IFIP Middleware 2023, which takes place from December 11-15 in Bologna, Italy.

Following the innovations in deep learning, foundation models (FM) are the next evolution in machine learning. Foundation models, including large language models, are driving the recent breakthroughs around conversational AI chatbots (such as ChatGPT), image generation (such as Stable Diffusion), and code assistants (such as GitHub CoPilot).

Foundation models across text, speech, and vision domains can be trained at scale using self-supervision techniques and applied to a broad set of downstream tasks. They address a limitation of deep learning models which typically required large task-specific labeled datasets. There is tremendous activity in this space from large enterprises, such as Google, Microsoft, Amazon, and IBM; and startups, such as OpenAI,Anthropic, Stability AI, and Cohere. As well there have been a flurry of papers in this space from academia and the open source community, resulting in models such as Alpaca, and efforts such as OpenAssistant.

Much of the attention has been on the models and datasets, with academic communities relying on the training and serving infrastructure provided by large cloud providers. Innovations on the infrastructure aspects have largely been taking place in closed enterprises and startups. This workshop will serve as a venue for academics and practitioners to share their findings, visions, and ideas around these infrastructure challenges and concerns. There are still challenging open problems that need attention. One piece of evidence of these challenges is OpenAI’s GPT-4 technical report. While this report is light on technical details, it includes an extensive Acknowledgements section that listed large dedicated teams focused on infrastructure aspects such as “Compute cluster scaling”, “Distributed training infrastructure”, “Hardware correctness”, “Training run babysitting”, “Deployment & post-training”, “Data infrastructure”, “Acceleration forecasting”, “Inference research”, “Inference infrastructure”, and “Reliability engineering”. This suggests the importance of middleware infrastructure to train and serve FMs, and the need for research in this space.

The scope of this workshop includes, but is not limited to:

Resource scheduling algorithms and optimizations for FM serving workloads, including batch, streaming, and synchronous invocation patterns.
Novel techniques to train large FMs.
Frameworks for fine-tuning FMs.
Programming model abstractions for FMs (such as LangChain).
Case studies of FM middleware.
Novel debugging and logging techniques, both in cases of black-box FMs available through an API, and locally available FMs.
Deployment of FMs in resource constrained environments (such as edge platforms, web browsers, and mobile devices).
Dates and location
Paper submissions: October 9, 2023
Notification to authors: October 20, 2023
Camera-ready copy due: October 27, 2023

Papers and Submissions
We are looking for the following types of submissions:
Research and industry papers (up to 8 pages): Reports on original results including novel techniques, significant case studies or surveys. Authors may include extra material beyond the six pages as a clearly marked appendix, which reviewers are not obliged to read but could read.
Position papers (up to 4 pages): Reports identifying unaddressed problems and research challenges.
Abstracts (up to 1 page): An extended abstract on a preliminary or ongoing work.
Papers must be written in English and submitted in PDF format. All papers should follow ACM formatting instructions, specifically the ACM SIG Proceedings Standard Style. The author kit containing the templates for the required style can be found at http://www.acm.org/publications/proceedings-template.

Submissions should not be blinded for review. Please submit your papers via the submission site: https://difm23.hotcrp.com/

All accepted papers will appear in the Middleware 2023 companion proceedings, available in the ACM Digital Library. All accepted papers will also be presented at the workshop, and at least one author of each paper must register for the workshop.

Workshop Co-chairs
Bishwaranjan Bhattacharjee, IBM Research
Vatche Isahagian, IBM Research
Vinod Muthusamy, IBM Research
Program Committee (Tentative)
Parag Chandakkar, Walmart Labs
Ian Foster, Argonne National Laboratory and the University of Chicago
Matthew Hill, Dataminr
Mayoore Jaiswal, Nvidia
Gauri Joshi, Carnegie Mellon University
Jayaram K. R., IBM Research
Ruben Mayer, Technical University of Munich
Pietro Michiardi, Eurecom
Phuong Nguyen, eBay
Peter Pietzuch, Imperial College
Chuan Wu, University of Hong Kong

Related Resources

FLLM 2024   The 2nd International Conference on Foundation and Large Language Models
ECAI 2024   27th European Conference on Artificial Intelligence
FL@FM-TheWebConf 2024   International Workshop on Federated Foundation Models for the Web 2024
ICANN 2024   33rd International Conference on Artificial Neural Networks
IEEE DAPPS 2024   IEEE International Conference on Decentralized Applications and Infrastructures
JCICE 2024   2024 International Joint Conference on Information and Communication Engineering(JCICE 2024)
FL@FM-ICME 2024   International Workshop on Federated Learning and Foundation Models for Multi-Media
ACM-Ei/Scopus-CCISS 2024   2024 International Conference on Computing, Information Science and System (CCISS 2024)
FL@FM-IJCNN 2024   IJCNN'24 Special Session on Trustworthy Federated Learning: in the Era of Foundation Models
HLPP 2024   17th International Symposium on High-Level Parallel Programming and Applications