posted by user: danilo_dessi || 67 views || tracked by 2 users: [display]

NSLP 2026 : 3rd International Workshop on Natural Scientific Language Processing (NSLP 2026)

FacebookTwitterLinkedInGoogle

Link: https://nfdi4ds.github.io/nslp2026
 
When May 12, 2026 - May 12, 2026
Where Palma, Mallorca (Spain)
Submission Deadline Feb 20, 2026
Notification Due Mar 13, 2026
Final Version Due Mar 30, 2026
Categories    scientific llms   research knowledge graphs   scholarly ir   information extraction
 

Call For Papers

3rd International Workshop on Natural Scientific Language Processing (NSLP 2026)

12 May 2026 – Co-located with LREC 2026
Palma, Mallorca (Spain)

NSLP 2026 features three shared tasks:
- ClimateCheck 2026: Scientific Fact-Checking of Social Media Claims
- SciVQA 2026: Scientific Visual Question Answering
- SOMD 2026: Software Mention Detection & Coreference Resolution

NSLP 2026 – important dates:
- Submission deadline: 20 February 2026
- Notifications: 13 March 2026
- Camera-ready: 30 March 2026

NSLP 2026 website (including the shared tasks):
https://nfdi4ds.github.io/nslp2026

Scientific research has witnessed a steep growth rate over the last decades. The number of scholarly publications is growing exponentially, and doubles every 15-17 years. Consequently, both general and specialised repositories, databases, knowledge graphs, and digital libraries have been developed to publish and manage scientific artifacts. Examples include the Open Research Knowledge Graph (ORKG), the Semantic Scholar Academic Graph (S2AG), PubMed Central and also the ACL Anthology. These resources enable the collection, reuse, tracking, and expansion of scientific findings, and facilitate downstream applications such as scientific search engines.

However, in order to develop robust systems that deal with scholarly text, various challenges need to be addressed. The current status quo of scientific communication mostly includes scholarly articles as unstructured PDF documents, which are not machine-readable in the sense that relevant scientific information can be extracted easily, thus making extracting and utilising this information as part of the scientific process a laborious and time-consuming task. Developing methods for converting unstructured information into structured formats is one of the major challenges in the field of Natural Scientific Language Processing (NSLP). This goal encompasses related challenges such as detecting, disambiguating, and linking mentions of scientific artifacts (e.g., software tools or specific datasets or language resources), and tracking state-of-the-art models and their evaluation scores (including new versions of existing models). Extracting and managing heterogeneous scientific knowledge effectively remains a challenging ongoing research area. Existing efforts are often fragmented, addressing separate issues with distinct datasets and conceptual approaches.

NSLP 2026 addresses current topics and issues in Natural Scientific Language Processing. It is proposed and organised with the support of NFDI for Data Science and Artificial Intelligence (NFDI4DS), a long-term project with approx. 20 partners who work towards building a German national research data infrastructure for DS and AI. The workshop aims to further bring together the international community of researchers who work on NSLP and related topics (including research knowledge graphs), to discuss current issues and possible solutions. NSLP 2026 includes two keynote speakers and presentations of accepted papers (oral and poster presentations), as well as three shared tasks.

Topics of interest include, but are not limited to

- Scientific LLMs – LLMs for NSLP
- Language resources (LRs) and Language technologies (LTs) for NSLP beyond LLMs
- Research Knowledge Graphs (RKGs), Scientific Knowledge Graphs (SKGs) and other forms of structured representation of research-related knowledge
- Information extraction from scholarly articles
- Extraction of research information from texts
- Detection and disambiguation of mentions of datasets, tasks, software or other methods
- Classification of scholarly articles (collections, single documents, parts of documents)
- Information extraction for RKGs
- Summarisation of scholarly articles
- Scholarly IR and scientific search engines
- Question answering over scientific knowledge
- Metadata and cataloging
- Cross-lingual and multilingual natural scientific language processing
- Adaptation of NLP methods for NSLP purposes

Important Dates

- Paper submission deadline: 20 February 2026 (not to be extended)
- Notification of acceptance: 13 March 2026
- Camera-ready submission: 30 March 2026
- Workshop: 12 May 2026

Submission Guidelines

The NSLP 2026 workshop invites submissions of: regular long papers; short papers; position papers. We especially encourage submissions from junior researchers and students from diverse backgrounds.

- Note that we will not accept work that is under review or has already been published in or accepted for publication in a journal, another conference, or another workshop.
- The workshop invites anonymous submissions of regular long papers (up to 8 pages without references and appendix); short papers as well as position papers (up to 4 pages without references and appendix) presenting, for example, negative results, in-progress projects, or demos.
- Authors are permitted to include an optional appendix of up to 2 pages. However, reviewers will not be mandated to review the appendix and all papers must be self-contained.
- Reviewing will be performed double-blind, i.e., submissions must be anonymous. Reviewers will not actively try to identify the authors.
- Submissions must be in PDF, formatted in the LREC 2026 style.
- The proceedings of this workshop will be published in the ACL Anthology (full Open Access) as part of the LREC 2026 proceedings.
- At least one author per contribution must register for the workshop for presentation.
- All submissions are done via START (Softconf) – link to be provided soon.

When submitting a paper through START, the authors will be asked to provide essential information about resources (in a broad sense, i.e., also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research. Moreover, ELRA encourages all LREC authors to share the described LRs (data, tools, services, etc.) to enable their reuse and replicability of experiments (including evaluation ones).

################################

Keynote Speakers

Iryna Gurevych, TU Darmstadt, Germany
Yufang Hou, ITU Austria, Austria

###############################

Shared Tasks

ClimateCheck 2026: Scientific Fact-Checking of Social Media Claims

The rise of climate discourse on social media offers new channels for public engagement but also amplifies mis- and disinformation. As online platforms increasingly shape public understanding of science, tools that ground claims in trustworthy, peer-reviewed evidence are necessary. The new iteration of ClimateCheck builds on the results and insights from the 2025 iteration (run at SDP 2025/ACL 2025), offering the following subtasks:

Subtask 1: Abstract retrieval and claim verification: given a claim and corpus of publications, retrieve the top 10 most relevant abstracts and classify each claim-abstract pair as supports, refutes, or not enough information.

Subtask 2: Disinformation narrative classification: given a claim, predict which climate disinformation narrative exists according to a predefined taxonomy.

New training data will be released for both tasks, with task 1 having triple the amount of the last iteration. The new iteration will focus on sustainability, emphasising the need to build climate-friendly NLP systems with minimal environmental impact.

Shared task co-organisers: Raia Abu Ahmad, Aida Usmanova, Max Upravitelev, Georg Rehm




SciVQA 2026: Scientific Visual Question Answering

Scientific papers communicate information through unstructured text as well as (semi-)structured figures and tables. Jointly reasoning over both modalities benefits downstream applications such as visual question answering (VQA). SciVQA 2026 builds on the insights from SciVQA 2025 (run at SDP 2025/ACL 2025), shifting the focus toward evaluating the ability of multimodal LLMs to reason over combined modalities (figures, tables, text). SciVQA 2026 will include a new set of papers and entirely new annotations, featuring two subtasks:

Subtask 1: Context retrieval: Given a question, a paper, and its corpus of paragraphs and images, retrieve the relevant context (tables, figures, paragraphs from the main text) required to answer it.

Subtask 2: Answer generation: Given a question and the context retrieved from the first task, generate an answer.

Shared task co-organisers: Ekaterina Borisova, Georg Rehm





SOMD 2026: Software Mention Detection & Coreference Resolution

Understanding software mentions is crucial for reproducibility and to interpret experimental results. Citations of software are often informal, lacking the use of persistent identifiers, making it hard to infer and disambiguate knowledge about software efficiently. This task will build on SOMD 2025 (run at SDP 2025, co-located with ACL 2025) and focus on entity disambiguation as an under-investigated problem in this context. More precisely, we address the task of coreference resolution of software mentions across multiple documents, i.e. given a set of software mentions extracted from multiple scientific publications, cluster these mentions so that all software mentions in a particular cluster refer to the same real world software. We define three subtasks with varying challenges:

Subtask 1: Software coreference resolution over gold standard mentions. Addresses the task based on high-quality (gold standard) mentions of software that are expert-annotated in multiple publications.

Subtask 2: Software coreference resolution over predicted mentions. Addresses the task on software mentions that are automatically extracted using a baseline model, i.e. reflecting a typical information extraction scenario, where upstream pipelines (such as entity and metadata extraction) are imperfect.

Subtask 3: Software coreference resolution at scale. Addresses the task using predicted mentions of software and metadata at a larger scale. This challenges models to scale effectively, maintain accuracy, and distinguish among an increasingly dense field of similar or overlapping software mentions.

Shared task co-organisers: Sharmila Upadhyaya, Stefan Dietze, Frank Krüger, Wolfgang Otto


##############################

Workshop Organisers

- Georg Rehm (Deutsches Forschungszentrum für Künstliche Intelligenz & Humboldt-Universität zu Berlin, Germany) – main contact: (georg.rehm@dfki.de)
- Stefan Dietze (GESIS Leibniz Institut für Sozialwissenschaften, Cologne & Heinrich-Heine-University Düsseldorf, Germany)
- Danilo Dessí (University of Sharjah, UAE)
- Diana Maynard (University of Sheffield, UK)
- Sonja Schimmler (Technical University of Berlin & Fraunhofer FOKUS, Germany)

Programme Committee

- Marcel Ackermann, Lernzentrum Informatik (LZI), DBLP, Germany
- Raia Abu Ahmad, Deutsches Forschungszentrum für Künstliche Intelligenz (DFKI), Germany
- Tilahun Abedissa Taffa, University of Hamburg, Germany
- Ekaterina Borisova, Deutsches Forschungszentrum für Künstliche Intelligenz (DFKI), Germany
- Davide Buscaldi, LIPN, CNRS, University Paris 13, France
- Leyla Jael Castro, ZB MED Information Centre for Life Sciences, Germany
- Mathieu d’Aquin, Université de Lorraine, France
- Jennifer D’Souza, TIB Leibniz Information Centre for Science and Technology, Germany
- Catherine Faron, Université Côte d’Azur, France
- Dayne Freitag, SRI International, USA
- Paul Groth, University of Amsterdam, TheNetherlands
- Leonhard Hennig, Deutsches Forschungszentrum für Künstliche Intelligenz (DFKI), Germany
- Inma Hernandez, University of Seville, Spain
- Robert Jäschke, Humboldt University of Berlin, Germany
- Petr Knoth, Open University, UK
- Frank Krüger, Wismar University of Applied Sciences, Germany
- Julia Lane, NYU Wagner Graduate School of Public Service, USA
- Andrea Mannocci, CNR-ISTI, Italy
- Natalia Manola, OpenAIRE, Greece
- Mirko Marras, University of Cagliari, Italy
- Philipp Mayr-Schlegel, GESIS Leibniz-Institute for the Social Sciences, Germany
- Pedro Ortiz Suarez, Common Crawl Foundation, USA
- Wolfgang Otto, GESIS Leibniz-Institute for the Social Sciences, Germany
- Haris Papageorgiou, R.C. Athena, Greece
- Silvio Peroni, University of Bologna, Italy
- Simone Ponzetto, Univ. of Mannheim, Germany
- Diego Reforgiato Recupero, University of Cagliari, Italy
- Harald Sack, FIZ Karlsruhe, Germany
- Angelo Salatino, The Open University, UK
- Philipp Schaer, TH Köln (University of Applied Sciences), Germany
- Atsuhiro Takasu, University of Tokyo, Japan
- Stefani Tsaneva, WU Wien, Austria
- Ricardo Usbeck, Leuphana University, Germany
- Thanasis Vergoulis, R.C. Athena, Greece

Related Resources

3rd IFIMITAI 2026   Loxbridge High-Tech Forum series on IT & AI for business, management, finance, industries, innovation and education
TempWeb 2026   The 16th Temporal Web Analytics Workshop (TempWeb 2026)
OCSPOE 2026   2026 2nd International Conference on Optical Communication, Signal Processing, and Optical Engineering
3rd-IC 2025   The Withering Panopticon? Evidence from Everyday Life in North Korean Cities
NSLP 2025   2nd International Workshop on Natural Scientific Language Processing and Research Knowledge Graphs
Image Processing 2025   2025 International Conference on Image Processing
Digital Technology and Language Teaching 2025   Routledge-CRC Press-Taylor and Francis Call for Chapters: Digital Technology and Language Teaching: Exploring New Frontiers in Language Education
International Scientific Online Conferen 2025   International Scientific Online Conference on Business in Emerging Markets entitled: “The Fourth Industrial Revolution: Transforming Business Landscapes and Future Opportunities”
AI and World Language Learning 2025   Routledge Handbook of AI and World Language Learning
Routledge Handbook of AI and Language Le 2026   The Routledge Handbook of AI and Language Learning Call for Chapter Proposals (Updated)