![]() |
| |||||||||||
DEARING 2025 2025 : [CFP DEARING2025@ECMLPKDD2025] 2nd International Workshop on Data-Centric Artificial Intelligence (DEARING 2025) | |||||||||||
| |||||||||||
Call For Papers | |||||||||||
Please circulate within your networks. Apologies for multiple postings.
===============================================================================================================
2nd International Workshop on Data-Centric Artificial Intelligence (DEARING 2025)
September 15-19, 2025 - Porto, Portugal
https://dearing-workshop.github.io
CO-LOCATED with the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases - (ECML PKDD 2025) =============================================================================================================== == CALL FOR PAPERS == Artificial Intelligence (AI) has historically relied on two components: data and algorithms. However, the conventional model-centric AI paradigm has historically prioritized algorithms, often treating data as static entities. Typically, data is initially collected, pre-processed, and held fixed, with a significant portion of development efforts dedicated to optimizing learned models. This conventional approach has led to the creation of increasingly intricate and opaque models, necessitating substantial training data. In contrast, the emerging data-centric AI paradigm is dedicated to systematically and algorithmically generating optimal data to feed Machine Learning (ML) models. The primary objective of data-centric AI approaches is to consistently enhance data quality, thereby achieving a level of model accuracy that was previously considered unattainable through model-centric techniques alone. This workshop aims to explore the transformative impact of recent advancements in the data-centric AI paradigm on the future of AI and ML. It serves as a platform for in-depth discussions and the exchange of scientific contributions, recent achievements, and open challenges. The goal is to inspire further innovations in data-centric AI research, ultimately shaping the future landscape of artificial intelligence. Accepted paper formats DEARING welcomes both research papers reporting results from mature work, as well as more speculative papers describing new ideas or preliminary exploratory work. Papers reporting industry experiences and case studies will also be encouraged. Submissions are accepted in two formats: • Regular research papers with 12 to 16 pages including references. • Short research papers of at most 6 pages including references, aiming at fostering discussion and collaboration. (eg. outlining new researching ideas). All submissions should be made in PDF using the Microsoft CMT platform (according to the conference guidelines) and must adhere to the Springer LNCS style. All workshop papers will be included in a joint Post-Workshop proceeding published by Springer Communications in Computer and Information Science in 1-2 volumes, organized by focused scope and possibly indexed by WOS. Paper authors will have the faculty to opt-in or opt-out. We suggest workshop papers be prepared and submitted in the LNCS format. To be published in the proceedings, research papers must be original, not published previously, and not submitted concurrently elsewhere. Important dates Concording with ECMLPKDD deadline: • Paper Submission deadline: 2025-06-14 • Paper Author notification: 2024-07-14 Workshop topics We welcome submissions that explore the opportunities, perspectives, and research directions within the data-centric AI paradigm. Potential topics include, but are not limited to: High-quality data preparation: • Data cleaning, denoising, and interpolation • Novel feature engineering pipelines • Label Errors and Confident Learning (CL) Selecting features and/or instances • Performing outlier detection and removal • Ensuring label consensus • Producing consistent and low-noise training data • Extracting smart data from raw data • Creating training datasets for small data problems • Handling rare classes and explaining important class coverage in big data problems • Incorporating human feedback into training datasets • Combining multi-view, multi-source, multi-objective datasets Data-centric ML and Deep Learning approaches • Active learning to identify the most valuable examples to label • Core-set learning to handle big data • Semi-supervised learning, few-shot learning, weak supervision, confident learning to take advantage of the limited amount of labels or handle label noise • Transfer learning and self-supervised learning algorithms to achieve rich data representations to be used with scarceness of labels • Concept drift detection and management • Adversarial learning to improve robustness and resilience Responsible and Ethical AI • Ensuring fairness, bias, ethics and diversity • Green AI design and evaluation • Scalable and reliable training • Privacy-preserving and secure learning • Reproducibility of AI Data benchmark creation • Creating licensed datasets based on public resources • Creating high-quality data from low-quality resources Data-centric Explainable AI • Novel XAI methods to identify possible data issues in the learning stage • XAI methods to generate features for machine learning problems Applications of novel data-centric AI solutions • Healthcare and Medical Applications: Ensuring data diversity and generating realistic patient data without exposing sensitive information • Automomous Vehicles and Smart Cities: Simulating representative scenarios for software testing • Cybersecurity and Fraud Detection: Detecting, exploring, or generating rare/edge cases and patterns for machine learning robustness • Manufacuring and Industrial Applications: Ensuring coverage in equipment failures for stress testing • Facial Recognition and Biometrics: Increasing diversity in images to reduce bias • Legal and Military Applications: Fostering fair and explainable systems Organizing Commitee Donato Malerba – University of Bari Aldo Moro Vincenzo Pasquadibisceglie – University of Bari Aldo Moro Antonella Poggi – Sapienza University of Rome Miriam Seoane Santos – University of Porto Web Chair Vito Recchia – University of Bari Aldo Moro Program Committee TBA |
|