RPM 2023 : 4th Workshop on Responsible Process Mining - Data Governance, Trust, and Privacy
Call For Papers
Process mining has been successfully applied in analysing and improving processes based on event logs in all kinds of environments. Responsible Process Mining (RPM) is highly relevant to our more and more data-driven society and has received less focus. FACT (Fair, Accurate, Confidential, and Transparent) and similar other principles for data science and machine learning have been proposed (cf. redasci.org) to guide the development and application of data science. Issues such as lacking data quality in event logs, identifiable personal data in event logs, biased event logs, learning, discovery techniques with opaque parameters, uncertain event data and many more aspects threaten the compliance to these principles in process analytics. However, process mining could also be applied to help with the “FACT-ful” application of machine learning and other data-driven techniques by bringing transparency. All aspects of RPM are in the scope of this workshop covering a wide range of concepts and challenges such as fairness, accuracy, confidentiality, privacy, transparency, explainability, trust, data governance, data quality, ethics, security, and other related topics. Thus, responsibility in process mining covers a wide range of topics and issues. In the following, we briefly explain some of the main aspects.
Ethics. The field of ethics analyses practices, principles, traditions, and habits. Although widely disseminated as so, the primary interest in ethics is not to define what is “good” or “evil”, “right” or “wrong”. Ethics rationally analyses actions, with Morality as the subject. While Morality fits a subjective perception, considering that each of us is a subject endowed with morals, Ethics is traditionally taken as objective. It involves balancing subjectivity and objectivity, combining personal interest and moral interest. Since computational techniques, in general, are not neutral and, to a large extent, they are embargoed from technological determinisms; ethical analysis is necessary when applying process mining techniques. For example, predictive analytics results through process mining can lead to a behaviour that improves process performance but is not acceptable from an ethical perspective.
Data Governance. One of the aspects of RPM is data governance, which is a new capability for organisations to implement a corporate wide strategy to ensure data is of value. It ties together areas such as data quality, data security, data architecture, and data integration and interoperability. Data quality is a key area in data governance. Usually, a significant amount of time and effort associated with process mining projects is spent on data preparation tasks. A recent survey within the process mining community (XES) shows that more than 60% of the overall effort is spent on data preparation, where challenges such as complex data structures, incomplete, and inconstant data are being addressed. Current approaches to data preparation (e.g., data transformation, data quality auditing and remedies for repairs) are mostly ad-hoc and manual. Thus, there is a need for systematic and preferably automated approaches to event data transformation that will speed up the production of high-quality process data for decision-making purposes.
Privacy. Privacy relates to the concern that event logs may contain personal data of both customers and employees and the challenge of protecting the information about individuals while still being useful for process mining (e.g., differential privacy, k-anonymity, homomorphic encryption, secure multi-party computing). Often, security aspects (e.g., encryption) are closely connected when processing personal data cannot be avoided.
Fairness. Fairness in process mining relates to avoid making unfair decisions based on process mining results even if they are true based on the underlying data. For instance, a decision point in a process model can be equipped with a prediction model that learns from biased data and makes unfair predictions. Fairness concerns need to be addressed in a pre-processing phase, by altering the main algorithms, or in a post-processing phase.
Transparency. Transparency has become one of the main concerns in data science. Particularly, in the field of deep learning where decisions are made based on complicated learning models. In that regard, process mining techniques can be used to provide transparency by explaining the details with a process model language.
Trust. Trust is considered from two perspectives. First, the perspective of trust in organisational and technological measures that event logs are not misused (e.g., for worker surveillance) and the fairness of possibly automated decision taken. Second, from the perspective of trustworthy and transparent results of methods for process analytics, i.e., insights should faithfully reflect reality whereas data usually provides a biased viewpoint on it. This perspective includes aspects such as (the lack of) data quality, ensuring traceability and auditability (e.g., through blockchains).
There exist interdependencies among different aspects of responsible process mining. For example, improving privacy always comes with data utility/quality loss, or providing confidentiality using encryption-based techniques may affect the transparency of results. Moreover, improving some issues such as trust often requires considering several aspects at the same time, e.g., security, privacy, and fairness. These interdependencies are challenging to address, yet, at the same time, they provide a big opportunity for collaboration among researchers to promote the responsibility in process mining as a whole.
The main objective of the RPM workshop is to give a forum where researchers and practitioners can meet each other and start new collaboration points to promote responsible process analytics. We also consider topics from the ethics aspect to clarify real ethical issues for the process mining community with respect to the rules and regulations.
The topics of interest for this workshop, but not limited to:
Privacy-Preserving Methods for Process Mining
Trust and Transparency in Process Mining
Fairness in Process Mining
Data Quality in Process Mining
Data Governance Frameworks for event data
Techniques to detect and repair data quality issues in event data
Automated approaches to data pre-processing for process mining
Uncertainty in Process Mining
Ethics in Process Mining
Explainable Predictive/Prescriptive Process Mining
Federated Learning for Process Mining
Blockchains in Process Mining and Business Process Management
GDPR Compliance of Process Analytics
Process Mining for GDPR Compliance Checking
Process Mining for FACT Compliance
Privacy by Design in Process Mining
Privacy Engineering for Event Logs / Private Data Sharing
Explainable AI and Process Mining
Trustworthy AI and Process Mining
Process Mining in Cybersecurity
Frameworks to detect and quantify the quality of an event log
Case studies highlighting data preparation challenges and lessons learned
Methodologies and best practices for data preparation for process mining
Sustainability and Process Mining