Object-centric Process Mining Model: Difference between revisions

From QPR ProcessAnalyzer Wiki
Jump to navigation Jump to search
 
(50 intermediate revisions by the same user not shown)
Line 1: Line 1:
<div style="border:1px solid #dfdfdf;padding:0.5em 1em 0.5em 1em;background-color:#E7EAEC;margin:10px 0px 0px 10px;">
QPR ProcessAnalyzer supports object-centric process mining (OCPM) based on the OCEL 2.0 standard (https://www.ocel-standard.org). To use object-centric functionality, you need to transform data into the [[#Object-centric_model_structure|suitable format]] for the [[#Create_object-centric_model|object-centric model]]. Object-centric models can be analyzed in the object-centric flowchart and with (case-centric) charts because the object-centric model can be converted into a case-centric eventlog using [[#Object-centric_perspectives|perspectives]]. To use the OCPM functionality, Snowflake needs to be used as the calculation engine.
Note: This page describes functionality that hasn't fully been released.
</div>


QPR ProcessAnalyzer supports object-centric process mining models (OCPM) based on the OCEL 2.0 standard (https://www.ocel-standard.org).
== Create object-centric model ==
== Create OCPM model ==
Create a new object-centric model as follows:
New OCPM model is created as follows:
# In the Workspace, open the project where to create the model.
# In the Workspace, open the project where to created the model.
# Select '''NEW"''' in the top right menu and select '''model'''.
# Select "NEW" in top right menu and select "OCPM model"
# Define a name for the new model.
# Define a name for the new model and click '''Create'''.
# Set '''Model type''' as '''Object-centric'''.
# Click '''Create'''.


== Configure OCPM model datatables ==
== Configure object-centric model datatables ==
Datatables for the OCPM model can be configured as follows:
Datatables for the object-centric model need to exist in the same project as the model. Datatables can be set for the model as follows:
# In the Workspace, select the OCPM model and click '''Properties'''.
# In the Workspace, select the object-centric model and click '''Properties'''.
# In the dialog, open the '''Datasource''' tab.
# In the model properties dialog, open the '''Datasource''' tab.
# Add following kind of configuration to the textbox:
# Add a following kind of json configuration to the textbox:
<pre>
<pre>
{
{
   "Objects": "ocpm_model - objects",
   "Objects": "OCPM: objects",
   "Events": "ocpm_model - events",
   "Events": "OCPM: events",
   "ObjectToObject": "ocpm_model - object-object",
   "ObjectToObject": "OCPM: object-object",
   "EventToObject": "ocpm_model - event-object",
   "EventToObject": "OCPM: event-object",
   "ObjectTypes": {
   "ObjectTypes": {
     "Invoice": "ocpm_model - objecttype-Invoice",
     "Invoice": "OCPM object: Invoice",
     "Payment": "ocpm_model - objecttype-Payment",
     "Payment": "OCPM object: Payment",
     "Purchase Order": "ocpm_model - objecttype-Purchase Order"
     "Purchase Order": "OCPM object: Purchase Order"
   },
   },
   "EventTypes": {  
   "EventTypes": {  
     "Approve Purchase Requisition": "ocpm_model - eventtype-Approve Purchase Requisition",
     "Approve Purchase Requisition": "OCPM event: Approve Purchase Requisition",
     "Change PO Quantity": "ocpm_model - eventtype-Change PO Quantity",
     "Change PO Quantity": "OCPM event: Change PO Quantity",
     "Create Purchase Order": "ocpm_model - eventtype-Create Purchase Order",
     "Create Purchase Order": "OCPM event: Create Purchase Order",
     "Insert Invoice": "ocpm_model - eventtype-Insert Invoice",
     "Insert Invoice": "OCPM event: Insert Invoice",
     "Insert Payment": "ocpm_model - eventtype-Insert Payment"
     "Insert Payment": "OCPM event: Insert Payment"
   }
   }
}
}
</pre>
</pre>


== Import from OCEL file ==
The json configuration needs to have following properties:
* '''Objects''': Objects datatable name.
* '''Events''': Events datatable name.
* '''ObjectToObject''': Object-to-object relation datatable name.
* '''EventToObject''': Event-to-object relation datatable name.
* '''ObjectTypes''': Key-value-pairs of object type datatable names. Note that object names need to match with object names in the objects datatable.
* '''EventTypes''': Key-value-pairs of event type datatable names. Note that event names need to match with event names in the events datatable.


== OCPM model structure ==
== Import from OCEL 2.0 JSON file ==
OCPM model uses datatables described in the table below. Datatables can be named freely because the model configuration selects the datatable for each type of data. The datatables need to use the column names specified in the table below because those are the column names assumed by the OCPM calculation.
Object-centric model can be import from an OCEL 2.0 JSON file as follows:
# In the Workspace, open the project where to import the model.
# Select '''NEW''' in top right menu and select '''Import Model'''.
# Select the OCEL 2.0 JSON file from the disk and click '''Open'''.
 
An object-centric model and a list of datatables is created.
 
Example OCEL 2.0 eventlogs: https://www.ocel-standard.org/event-logs/overview/ (download the json format supported by QPR ProcessAnalyzer)
 
== Object-centric model structure ==
Object-centric model contains datatables described in the table below. Datatables can be named freely, as the model json configuration is used to define the datatable for each type of data. The datatables need to use column names specified in the table below because those are the column names assumed by the object-centric (i.e., column names cannot be selected freely).


{| class="wikitable"
{| class="wikitable"
!'''Datatable'''
!'''Datatable role'''
!'''Content'''
!'''Contained data'''
! '''Columns'''
! '''Datatable columns'''
|-
|-
||Objects
||Objects
Line 50: Line 65:
||
||
* '''OcelObjectId''': Unique id for the object (among all objects in the model).
* '''OcelObjectId''': Unique id for the object (among all objects in the model).
* '''OcelObjectType''': Object type name (such as Order, Invoice, Delivery).
* '''OcelObjectType''': Object type name (such as Order, Invoice, Delivery). Note that the model json configuration need to use same object type names.
|-
|-
||Events
||Events
Line 56: Line 71:
||
||
* '''OcelEventId''': Unique id for the event (among all events in the model).
* '''OcelEventId''': Unique id for the event (among all events in the model).
* '''OcelEventType''': Event type name (such as Order created, Invoice sent).
* '''OcelEventType''': Event type name (such as Order created, Invoice sent). Note that the model json configuration need to use same event type names.
* '''OcelEventTime''': Event timestamp.
* '''OcelEventTime''': Event timestamp.
|-
|-
||Object-object relations
||Object-object relations
||Describes relations between objects and objects (one row per relation).
||Relations between objects (one row per relation).
||
||
* '''OcelObjectObjectSourceId''': Source object id for the relation.
* '''OcelObjectObjectSourceId''': Source object id in the relation.
* '''OcelObjectObjectTargetId''': Target object id for the relation.
* '''OcelObjectObjectTargetId''': Target object id in the relation.
* '''OcelObjectObjectQualifier''': Describes the type of the relation (not used currently).
* '''OcelObjectObjectQualifier''': Describes the type of the relation as free-form text (not used currently).
|-
|-
||Event-object relations
||Event-object relations
||Describes relations between events and objects (one row per relation).
||Relations between events and objects (one row per relation).
||
||
* '''OcelEventObjectSourceId''': Object id for the relation.
* '''OcelEventObjectSourceId''': Object id in the relation.
* '''OcelEventObjectTargetId''': Event id for the relation.
* '''OcelEventObjectTargetId''': Event id in the relation.
* '''OcelEventObjectQualifier''': Describes the type of the relation (not used currently).
* '''OcelEventObjectQualifier''': Describes the type of the relation as free-form text (not used currently).
|-
|-
||Object attributes (several tables)
||Object attributes (several datatables)
||Object attribute values, each object type in a separate table (one row per object).
||Object attribute values, each object type in a separate table (one row per object).
||
||
* '''OcelObjectTypeObjectId''': Object id.
* '''OcelObjectTypeObjectId''': Object id. Matches to the objects datatable ''OcelObjectId'' column.
* '''OcelObjectTypeTime''': Timestamp starting from the attribute value is valid (not used currently).
* '''OcelObjectTypeTime''': Timestamp which the attribute value is valid from (not used currently).
* '''OcelObjectTypeChangedField''': Changed object attribute name (not used currently).
* '''OcelObjectTypeChangedField''': Changed object attribute name (not used currently).
* '''<Object attributes>''': Columns for each of the object attributes.
* '''<Object attributes>''': Columns for each of the object attribute values (column name is the object attribute name).
|-
|-
||Event attributes (several tables)
||Event attributes (several datatables)
||Event attribute values, each event type in a separate table (one row per event).
||Event attribute values, each event type in a separate table (one row per event).
||
||
* '''OcelEventTypeEventId''': Event id.
* '''OcelEventTypeEventId''': Event id. Matches to the events datatable ''OcelEventId'' column.
* '''<Event attributes>''': Columns for each of the event attributes.
* '''<Event attributes>''': Columns for each of the event attribute values (column name is the event attribute name).
|}
|}


== OCPM Perspectives ==
== Object-centric perspectives ==
Perspectives convert an OCPM model into the traditional case-centric model, allowing to analyze OCPM models in charts in dashboards. A single perspective is not able describe the OCPM model entirely, but just from a certain limited "perspective". By using several perspectives, it's possible to get a more complete picture of the OCPM model. The perspective starts from a certain object type and traverses the object-object relations as many steps as desired.
Perspectives convert an object-centric model into the traditional case-centric eventlog, allowing to view and analyze object-centric models in analyses provided by charts. A single perspective is not able describe the object-centric model entirely, but just from a certain limited viewpoint. By using analyses with several perspectives, it's possible to get a more complete picture of the object-centric model. The perspective starts from a certain object type and traverses the object-object relations as many steps as desired.
 
To define a perspective, the following settings are defined in the chart settings:
* '''Base Object type''': Object of this type will be cases in the projected case-centric eventlog.
* '''Object Relation Steps''': Specifies how many object-object relations will be traversed in order to find events connected to the base objects. Value zero means that only those events are returned that are directly connected to the base objects.
* '''Show Event Types''': List of event type names which are included into the perspective eventlog. If no events are explicitly defined, all events will be included, but their event attributes are not included.
 
The resulting perspective eventlog will have the following columns:
* '''OcelObjectId''' (mapped to case id)
* '''OcelEventType''' (mapped to event type)
* '''OcelEventTime''' (mapped to timestamp)
* '''OcelEventId'''
* Object attributes of the base object type. Note that the object attribute values are "repeated" for all events belonging to the same object.
* Event attributes of the selected event types. Values are null for events that don't have the attribute.
 
The base object type attributes are available as case attributes. As the object attribute values may change over time in the OCEL 2.0 data, the last attribute value is used as the case attribute value. Note that other object type's attributes are not available as case attributes, so the object for which the attributes are used, need to be set as the base object.


To define a perspective, the following settings are defined:
== Save perspective to filter ==
* '''Object type''': The object type that all events should be projected to.
It's possible to include the object-centric perspective to a stored filter. When a filter is selected, also the perspective in the filter is applied to the dashboard. This allows to quickly change perspectives for the entire dashboard. The chart specific perspective overrides the dashboard level perspective, so the dashboard level perspective is only applied for charts that don't have the chart specific perspective defined.
* '''Event types''': An array of event type names whose attributes are to be included into the perspective. If not defined, all the event types will be included, but their type specific attributes are not included.
* '''Number of relation steps''': Specifies how many object-object relations will be traversed in order to find events connected to the specified object type. Default value is 0, which means that only those events are returned that are directly connected to objects of the specified object type.


Generates a projection where resulting rows, each representing one event, are mapped to objects of type "Payment". If an event in OCEL model is not connected to Payment-object, even after performing two iterations of searches via ObjectToObject-table rows), that event will not show up in the result.  
Perspective can be added to a filter as follows:
# Go to the ''Process Discovery'' dashboard.
# Open the ''Session variables'' dialog in the dots menu on top right.
# Paste the filter json to the ''Value'' of the ''Filter'' variable (it might be easiest to start with a filter without filter rules, and then add the filter rules using the UI).
# Click ''Done'' button for the dialog.
# Save the filter by hovering the ''Unsaved filter'' (filters dropdown list) in the header and click ''Save as new filter''.


The resulting perspective eventlog will have the following columns:
Example: Filter json without any filter rules:
* OcelEventId
<pre>
* OcelObjectId (mapped to case id)
{
* OcelEventTime (mapped to timestamp)
  "Items": [],
* All the object type attributes in the perspective's select object type.
  "Perspective": {
* All the event type attributes in all the perspective's select event types.
    "ObjectType": "Container",
    "RecursionDepth": 0
  }
}
</pre>
 
Example: Filter json with a filter rule:
<pre>
{
  "Items": [
    {
      "Type": "IncludeEvents",
      "Items": [
        {
          "Type": "Attribute",
          "Attribute": "OcelEventId",
          "StringifiedValues": [ "0Event 1" ]
        }
      ]
    }
  ],
  "Perspective": {
    "ObjectType":  "Container",
    "RecursionDepth": 0
  }
}
</pre>


== Differences to OCEL 2.0 standard ==
== Differences to OCEL 2.0 standard ==
The OCPM models in QPR ProcessAnalyzer are mainly following the OCEL 2.0 standard, but there are the following exceptions:
Object-centric models in QPR ProcessAnalyzer are mainly following the OCEL 2.0 standard, but there are the following differences:
* Changing of object attributes values over time is not supported.
* Changing of object attributes values over time is not supported.
* ocel_time field of each event type table is moved to events table as every event anyways has a timestemp.  
* ''ocel_time'' field of each event type table is moved to events datatable (as every event has a timestemp).  
* *_map_type are not needed as the model settings are used for the same purpose.  
* ''*_map_type'' columns are not needed as the model settings are used for the same purpose.  
* Object type tables: If OcelObjectTypeChangedField is not null, all the other field values are copied from the previous entry having the same OcelObjectTypeObjectId, except:  
* Object type tables: If OcelObjectTypeChangedField is not null, all the other field values are copied from the previous entry except:  
** OcelObjectTypeChangedField, which has the name(s) of the changed field(s) as comma separated string.  
** ''OcelObjectTypeChangedField'' which has the names of the changed fields as a comma separated string.  
** The actual changed field, which has the new value.  
** The actual changed field which has the new value.  
** OcelObjectTypeTime, which has the timestamp when the value changed.
** ''OcelObjectTypeTime'' which has the timestamp when the value changed.

Latest revision as of 08:44, 14 November 2024

QPR ProcessAnalyzer supports object-centric process mining (OCPM) based on the OCEL 2.0 standard (https://www.ocel-standard.org). To use object-centric functionality, you need to transform data into the suitable format for the object-centric model. Object-centric models can be analyzed in the object-centric flowchart and with (case-centric) charts because the object-centric model can be converted into a case-centric eventlog using perspectives. To use the OCPM functionality, Snowflake needs to be used as the calculation engine.

Create object-centric model

Create a new object-centric model as follows:

  1. In the Workspace, open the project where to create the model.
  2. Select NEW" in the top right menu and select model.
  3. Define a name for the new model.
  4. Set Model type as Object-centric.
  5. Click Create.

Configure object-centric model datatables

Datatables for the object-centric model need to exist in the same project as the model. Datatables can be set for the model as follows:

  1. In the Workspace, select the object-centric model and click Properties.
  2. In the model properties dialog, open the Datasource tab.
  3. Add a following kind of json configuration to the textbox:
{
  "Objects": "OCPM: objects",
  "Events": "OCPM: events",
  "ObjectToObject": "OCPM: object-object",
  "EventToObject": "OCPM: event-object",
  "ObjectTypes": {
    "Invoice": "OCPM object: Invoice",
    "Payment": "OCPM object: Payment",
    "Purchase Order": "OCPM object: Purchase Order"
  },
  "EventTypes": { 
    "Approve Purchase Requisition": "OCPM event: Approve Purchase Requisition",
    "Change PO Quantity": "OCPM event: Change PO Quantity",
    "Create Purchase Order": "OCPM event: Create Purchase Order",
    "Insert Invoice": "OCPM event: Insert Invoice",
    "Insert Payment": "OCPM event: Insert Payment"
  }
}

The json configuration needs to have following properties:

  • Objects: Objects datatable name.
  • Events: Events datatable name.
  • ObjectToObject: Object-to-object relation datatable name.
  • EventToObject: Event-to-object relation datatable name.
  • ObjectTypes: Key-value-pairs of object type datatable names. Note that object names need to match with object names in the objects datatable.
  • EventTypes: Key-value-pairs of event type datatable names. Note that event names need to match with event names in the events datatable.

Import from OCEL 2.0 JSON file

Object-centric model can be import from an OCEL 2.0 JSON file as follows:

  1. In the Workspace, open the project where to import the model.
  2. Select NEW in top right menu and select Import Model.
  3. Select the OCEL 2.0 JSON file from the disk and click Open.

An object-centric model and a list of datatables is created.

Example OCEL 2.0 eventlogs: https://www.ocel-standard.org/event-logs/overview/ (download the json format supported by QPR ProcessAnalyzer)

Object-centric model structure

Object-centric model contains datatables described in the table below. Datatables can be named freely, as the model json configuration is used to define the datatable for each type of data. The datatables need to use column names specified in the table below because those are the column names assumed by the object-centric (i.e., column names cannot be selected freely).

Datatable role Contained data Datatable columns
Objects Objects in the model (one row per object).
  • OcelObjectId: Unique id for the object (among all objects in the model).
  • OcelObjectType: Object type name (such as Order, Invoice, Delivery). Note that the model json configuration need to use same object type names.
Events Events in the model (one row per event).
  • OcelEventId: Unique id for the event (among all events in the model).
  • OcelEventType: Event type name (such as Order created, Invoice sent). Note that the model json configuration need to use same event type names.
  • OcelEventTime: Event timestamp.
Object-object relations Relations between objects (one row per relation).
  • OcelObjectObjectSourceId: Source object id in the relation.
  • OcelObjectObjectTargetId: Target object id in the relation.
  • OcelObjectObjectQualifier: Describes the type of the relation as free-form text (not used currently).
Event-object relations Relations between events and objects (one row per relation).
  • OcelEventObjectSourceId: Object id in the relation.
  • OcelEventObjectTargetId: Event id in the relation.
  • OcelEventObjectQualifier: Describes the type of the relation as free-form text (not used currently).
Object attributes (several datatables) Object attribute values, each object type in a separate table (one row per object).
  • OcelObjectTypeObjectId: Object id. Matches to the objects datatable OcelObjectId column.
  • OcelObjectTypeTime: Timestamp which the attribute value is valid from (not used currently).
  • OcelObjectTypeChangedField: Changed object attribute name (not used currently).
  • <Object attributes>: Columns for each of the object attribute values (column name is the object attribute name).
Event attributes (several datatables) Event attribute values, each event type in a separate table (one row per event).
  • OcelEventTypeEventId: Event id. Matches to the events datatable OcelEventId column.
  • <Event attributes>: Columns for each of the event attribute values (column name is the event attribute name).

Object-centric perspectives

Perspectives convert an object-centric model into the traditional case-centric eventlog, allowing to view and analyze object-centric models in analyses provided by charts. A single perspective is not able describe the object-centric model entirely, but just from a certain limited viewpoint. By using analyses with several perspectives, it's possible to get a more complete picture of the object-centric model. The perspective starts from a certain object type and traverses the object-object relations as many steps as desired.

To define a perspective, the following settings are defined in the chart settings:

  • Base Object type: Object of this type will be cases in the projected case-centric eventlog.
  • Object Relation Steps: Specifies how many object-object relations will be traversed in order to find events connected to the base objects. Value zero means that only those events are returned that are directly connected to the base objects.
  • Show Event Types: List of event type names which are included into the perspective eventlog. If no events are explicitly defined, all events will be included, but their event attributes are not included.

The resulting perspective eventlog will have the following columns:

  • OcelObjectId (mapped to case id)
  • OcelEventType (mapped to event type)
  • OcelEventTime (mapped to timestamp)
  • OcelEventId
  • Object attributes of the base object type. Note that the object attribute values are "repeated" for all events belonging to the same object.
  • Event attributes of the selected event types. Values are null for events that don't have the attribute.

The base object type attributes are available as case attributes. As the object attribute values may change over time in the OCEL 2.0 data, the last attribute value is used as the case attribute value. Note that other object type's attributes are not available as case attributes, so the object for which the attributes are used, need to be set as the base object.

Save perspective to filter

It's possible to include the object-centric perspective to a stored filter. When a filter is selected, also the perspective in the filter is applied to the dashboard. This allows to quickly change perspectives for the entire dashboard. The chart specific perspective overrides the dashboard level perspective, so the dashboard level perspective is only applied for charts that don't have the chart specific perspective defined.

Perspective can be added to a filter as follows:

  1. Go to the Process Discovery dashboard.
  2. Open the Session variables dialog in the dots menu on top right.
  3. Paste the filter json to the Value of the Filter variable (it might be easiest to start with a filter without filter rules, and then add the filter rules using the UI).
  4. Click Done button for the dialog.
  5. Save the filter by hovering the Unsaved filter (filters dropdown list) in the header and click Save as new filter.

Example: Filter json without any filter rules:

{
  "Items": [],
  "Perspective": {
    "ObjectType": "Container",
    "RecursionDepth": 0
  }
}

Example: Filter json with a filter rule:

{
  "Items": [
    {
      "Type": "IncludeEvents",
      "Items": [
        {
          "Type": "Attribute",
          "Attribute": "OcelEventId",
          "StringifiedValues": [ "0Event 1" ]
        }
      ]
    }
  ],
  "Perspective": {
    "ObjectType":  "Container",
    "RecursionDepth": 0
  }
}

Differences to OCEL 2.0 standard

Object-centric models in QPR ProcessAnalyzer are mainly following the OCEL 2.0 standard, but there are the following differences:

  • Changing of object attributes values over time is not supported.
  • ocel_time field of each event type table is moved to events datatable (as every event has a timestemp).
  • *_map_type columns are not needed as the model settings are used for the same purpose.
  • Object type tables: If OcelObjectTypeChangedField is not null, all the other field values are copied from the previous entry except:
    • OcelObjectTypeChangedField which has the names of the changed fields as a comma separated string.
    • The actual changed field which has the new value.
    • OcelObjectTypeTime which has the timestamp when the value changed.