Object-centric Process Mining Model: Difference between revisions

From QPR ProcessAnalyzer Wiki
Jump to navigation Jump to search
No edit summary
 
(25 intermediate revisions by the same user not shown)
Line 1: Line 1:
<div style="border:1px solid #dfdfdf;padding:0.5em 1em 0.5em 1em;background-color:#E7EAEC;margin:10px 0px 0px 10px;">
<div style="border:1px solid #dfdfdf;padding:0.5em 1em 0.5em 1em;background-color:#E7EAEC;margin:10px 0px 0px 10px;">
Note: This page describes functionality that hasn't fully been released.
Note: This page describes functionality that will be available in QPR ProcessAnalyzer 2024.6 (released date 2024-09-17).
</div>
</div>


QPR ProcessAnalyzer supports object-centric process mining (OCPM) based on the OCEL 2.0 standard (https://www.ocel-standard.org). To use OCPM, you need to create an OCPM model. The OCPM model can be analyzed in the '''OCPM flowchart''' and regular charts through converting the OCPM eventlog into case-centric eventlogs using OCPM perspectives.
QPR ProcessAnalyzer supports object-centric process mining (OCPM) based on the OCEL 2.0 standard (https://www.ocel-standard.org). To use object-centric functionality, you need to transform data into the [[#Object-centric_model_structure|suitable format]] for the [[#Create_object-centric_model|object-centric model]]. Object-centric models can be analyzed in the object-centric flowchart and with (case-centric) charts because the object-centric model can be converted into a case-centric eventlog using [[#Object-centric_perspectives|perspectives]]. To use the OCPM functionality, Snowflake needs to be used as the calculation engine.


== Create OCPM model ==
== Create object-centric model ==
New OCPM model is created as follows:
Create a new object-centric model as follows:
# In the Workspace, open the project where to created the model.
# In the Workspace, open the project where to create the model.
# Select '''NEW"''' in top right menu and select '''OCPM model'''.
# Select '''NEW"''' in the top right menu and select '''model'''.
# Define a name for the new model and click '''Create'''.
# Define a name for the new model.
# Set '''Model type''' as '''Object-centric'''.
# Click '''Create'''.


== Configure OCPM model datatables ==
== Configure object-centric model datatables ==
Datatables for the OCPM model can be configured as follows:
Datatables for the object-centric model need to exist in the same project as the model. Datatables can be set for the model as follows:
# In the Workspace, select the OCPM model and click '''Properties'''.
# In the Workspace, select the object-centric model and click '''Properties'''.
# In the dialog, open the '''Datasource''' tab.
# In the model properties dialog, open the '''Datasource''' tab.
# Add following kind of configuration to the textbox where each of the values are datatable names in the project:
# Add a following kind of json configuration to the textbox:
<pre>
<pre>
{
{
Line 37: Line 39:
</pre>
</pre>


== Import from OCEL file ==
The json configuration needs to have following properties:
New OCPM model is created from an OCEL 2.0 JSON file as follows:
* '''Objects''': Objects datatable name.
* '''Events''': Events datatable name.
* '''ObjectToObject''': Object-to-object relation datatable name.
* '''EventToObject''': Event-to-object relation datatable name.
* '''ObjectTypes''': Key-value-pairs of object type datatable names. Note that object names need to match with object names in the objects datatable.
* '''EventTypes''': Key-value-pairs of event type datatable names. Note that event names need to match with event names in the events datatable.
 
== Import from OCEL 2.0 JSON file ==
Object-centric model can be import from an OCEL 2.0 JSON file as follows:
# In the Workspace, open the project where to import the model.
# In the Workspace, open the project where to import the model.
# Select '''NEW"''' in top right menu and select '''Import Model'''.
# Select '''NEW''' in top right menu and select '''Import Model'''.
# Select the OCEL 2.0 JSON file from the disk and click '''Open'''.
# Select the OCEL 2.0 JSON file from the disk and click '''Open'''.


== OCPM model structure ==
An object-centric model and a list of datatables is created.
OCPM model uses datatables described in the table below. Datatables can be named freely because the model configuration selects the datatable for each type of data. The datatables need to use the column names specified in the table below because those are the column names assumed by the OCPM calculation.
 
Example OCEL 2.0 eventlogs: https://www.ocel-standard.org/event-logs/overview/ (download the json format supported by QPR ProcessAnalyzer)
 
== Object-centric model structure ==
Object-centric model contains datatables described in the table below. Datatables can be named freely, as the model json configuration is used to define the datatable for each type of data. The datatables need to use column names specified in the table below because those are the column names assumed by the object-centric (i.e., column names cannot be selected freely).


{| class="wikitable"
{| class="wikitable"
!'''Datatable'''
!'''Datatable role'''
!'''Content'''
!'''Contained data'''
! '''Columns'''
! '''Datatable columns'''
|-
|-
||Objects
||Objects
Line 55: Line 69:
||
||
* '''OcelObjectId''': Unique id for the object (among all objects in the model).
* '''OcelObjectId''': Unique id for the object (among all objects in the model).
* '''OcelObjectType''': Object type name (such as Order, Invoice, Delivery).
* '''OcelObjectType''': Object type name (such as Order, Invoice, Delivery). Note that the model json configuration need to use same object type names.
|-
|-
||Events
||Events
Line 61: Line 75:
||
||
* '''OcelEventId''': Unique id for the event (among all events in the model).
* '''OcelEventId''': Unique id for the event (among all events in the model).
* '''OcelEventType''': Event type name (such as Order created, Invoice sent).
* '''OcelEventType''': Event type name (such as Order created, Invoice sent). Note that the model json configuration need to use same event type names.
* '''OcelEventTime''': Event timestamp.
* '''OcelEventTime''': Event timestamp.
|-
|-
Line 67: Line 81:
||Relations between objects (one row per relation).
||Relations between objects (one row per relation).
||
||
* '''OcelObjectObjectSourceId''': Source object id of the relation.
* '''OcelObjectObjectSourceId''': Source object id in the relation.
* '''OcelObjectObjectTargetId''': Target object id of the relation.
* '''OcelObjectObjectTargetId''': Target object id in the relation.
* '''OcelObjectObjectQualifier''': Describes the type of the relation (not used currently).
* '''OcelObjectObjectQualifier''': Describes the type of the relation as free-form text (not used currently).
|-
|-
||Event-object relations
||Event-object relations
||Relations between events and objects (one row per relation).
||Relations between events and objects (one row per relation).
||
||
* '''OcelEventObjectSourceId''': Object id of the relation.
* '''OcelEventObjectSourceId''': Object id in the relation.
* '''OcelEventObjectTargetId''': Event id of the relation.
* '''OcelEventObjectTargetId''': Event id in the relation.
* '''OcelEventObjectQualifier''': Describes the type of the relation (not used currently).
* '''OcelEventObjectQualifier''': Describes the type of the relation as free-form text (not used currently).
|-
|-
||Object attributes (several tables)
||Object attributes (several datatables)
||Object attribute values, each object type in a separate table (one row per object).
||Object attribute values, each object type in a separate table (one row per object).
||
||
* '''OcelObjectTypeObjectId''': Object id.
* '''OcelObjectTypeObjectId''': Object id. Matches to the objects datatable ''OcelObjectId'' column.
* '''OcelObjectTypeTime''': Timestamp where the attribute value is valid from (not used currently).
* '''OcelObjectTypeTime''': Timestamp which the attribute value is valid from (not used currently).
* '''OcelObjectTypeChangedField''': Changed attribute name (not used currently).
* '''OcelObjectTypeChangedField''': Changed object attribute name (not used currently).
* '''<Object attributes>''': Columns for each of the object attribute values (column name is the attribute name).
* '''<Object attributes>''': Columns for each of the object attribute values (column name is the object attribute name).
|-
|-
||Event attributes (several tables)
||Event attributes (several datatables)
||Event attribute values, each event type in a separate table (one row per event).
||Event attribute values, each event type in a separate table (one row per event).
||
||
* '''OcelEventTypeEventId''': Event id.
* '''OcelEventTypeEventId''': Event id. Matches to the events datatable ''OcelEventId'' column.
* '''<Event attributes>''': Columns for each of the event attribute values (column name is the attribute name).
* '''<Event attributes>''': Columns for each of the event attribute values (column name is the event attribute name).
|}
|}


== OCPM perspectives ==
== Object-centric perspectives ==
Perspectives convert an OCPM model into the traditional case-centric model, allowing to analyze OCPM models in charts in dashboards. A single perspective is not able describe the OCPM model entirely, but just from a certain limited "perspective". By using several perspectives, it's possible to get a more complete picture of the OCPM model. The perspective starts from a certain object type and traverses the object-object relations as many steps as desired.
Perspectives convert an object-centric model into the traditional case-centric eventlog, allowing to view and analyze object-centric models in analyses provided by charts. A single perspective is not able describe the object-centric model entirely, but just from a certain limited viewpoint. By using analyses with several perspectives, it's possible to get a more complete picture of the object-centric model. The perspective starts from a certain object type and traverses the object-object relations as many steps as desired.


To define a perspective, the following settings are defined:
To define a perspective, the following settings are defined in the chart settings:
* '''Object type''': The object type that all events should be projected to.
* '''Base Object type''': Object of this type will be cases in the projected case-centric eventlog.
* '''Event types''': An array of event type names whose attributes are to be included into the perspective. If not defined, all the event types will be included, but their type specific attributes are not included.
* '''Object Relation Steps''': Specifies how many object-object relations will be traversed in order to find events connected to the base objects. Value zero means that only those events are returned that are directly connected to the base objects.
* '''Number of relation steps''': Specifies how many object-object relations will be traversed in order to find events connected to the specified object type. Default value is 0, which means that only those events are returned that are directly connected to objects of the specified object type.
* '''Show Event Types''': List of event type names which are included into the perspective eventlog. If no events are explicitly defined, all events will be included, but their event attributes are not included.


Generates a projection where resulting rows, each representing one event, are mapped to objects of type "Payment". If an event in OCEL model is not connected to Payment-object, even after performing two iterations of searches via ObjectToObject-table rows), that event will not show up in the result.  
The resulting perspective eventlog will have the following columns:
* '''OcelObjectId''' (mapped to case id)
* '''OcelEventType''' (mapped to event type)
* '''OcelEventTime''' (mapped to timestamp)
* '''OcelEventId'''
* Object attributes of the base object type. Note that the object attribute values are "repeated" for all events belonging to the same object.
* Event attributes of the selected event types. Values are null for events that don't have the attribute.


The resulting perspective eventlog will have the following columns:
Note that there are no case attributes in the perspective event log.
* OcelEventId
* OcelObjectId (mapped to case id)
* OcelEventTime (mapped to timestamp)
* All object attributes in the perspective's select object type.
* All event attributes in all the perspective's select event types.


== Differences to OCEL 2.0 standard ==
== Differences to OCEL 2.0 standard ==
The OCPM models in QPR ProcessAnalyzer are mainly following the OCEL 2.0 standard, but there are the following exceptions:
Object-centric models in QPR ProcessAnalyzer are mainly following the OCEL 2.0 standard, but there are the following differences:
* Changing of object attributes values over time is not supported.
* Changing of object attributes values over time is not supported.
* ocel_time field of each event type table is moved to events table as every event anyways has a timestemp.  
* ''ocel_time'' field of each event type table is moved to events datatable (as every event has a timestemp).  
* *_map_type are not needed as the model settings are used for the same purpose.  
* ''*_map_type'' columns are not needed as the model settings are used for the same purpose.  
* Object type tables: If OcelObjectTypeChangedField is not null, all the other field values are copied from the previous entry having the same OcelObjectTypeObjectId, except:  
* Object type tables: If OcelObjectTypeChangedField is not null, all the other field values are copied from the previous entry except:  
** OcelObjectTypeChangedField, which has the name(s) of the changed field(s) as comma separated string.  
** ''OcelObjectTypeChangedField'' which has the names of the changed fields as a comma separated string.  
** The actual changed field, which has the new value.  
** The actual changed field which has the new value.  
** OcelObjectTypeTime, which has the timestamp when the value changed.
** ''OcelObjectTypeTime'' which has the timestamp when the value changed.

Latest revision as of 21:57, 15 September 2024

Note: This page describes functionality that will be available in QPR ProcessAnalyzer 2024.6 (released date 2024-09-17).

QPR ProcessAnalyzer supports object-centric process mining (OCPM) based on the OCEL 2.0 standard (https://www.ocel-standard.org). To use object-centric functionality, you need to transform data into the suitable format for the object-centric model. Object-centric models can be analyzed in the object-centric flowchart and with (case-centric) charts because the object-centric model can be converted into a case-centric eventlog using perspectives. To use the OCPM functionality, Snowflake needs to be used as the calculation engine.

Create object-centric model

Create a new object-centric model as follows:

  1. In the Workspace, open the project where to create the model.
  2. Select NEW" in the top right menu and select model.
  3. Define a name for the new model.
  4. Set Model type as Object-centric.
  5. Click Create.

Configure object-centric model datatables

Datatables for the object-centric model need to exist in the same project as the model. Datatables can be set for the model as follows:

  1. In the Workspace, select the object-centric model and click Properties.
  2. In the model properties dialog, open the Datasource tab.
  3. Add a following kind of json configuration to the textbox:
{
  "Objects": "OCPM: objects",
  "Events": "OCPM: events",
  "ObjectToObject": "OCPM: object-object",
  "EventToObject": "OCPM: event-object",
  "ObjectTypes": {
    "Invoice": "OCPM object: Invoice",
    "Payment": "OCPM object: Payment",
    "Purchase Order": "OCPM object: Purchase Order"
  },
  "EventTypes": { 
    "Approve Purchase Requisition": "OCPM event: Approve Purchase Requisition",
    "Change PO Quantity": "OCPM event: Change PO Quantity",
    "Create Purchase Order": "OCPM event: Create Purchase Order",
    "Insert Invoice": "OCPM event: Insert Invoice",
    "Insert Payment": "OCPM event: Insert Payment"
  }
}

The json configuration needs to have following properties:

  • Objects: Objects datatable name.
  • Events: Events datatable name.
  • ObjectToObject: Object-to-object relation datatable name.
  • EventToObject: Event-to-object relation datatable name.
  • ObjectTypes: Key-value-pairs of object type datatable names. Note that object names need to match with object names in the objects datatable.
  • EventTypes: Key-value-pairs of event type datatable names. Note that event names need to match with event names in the events datatable.

Import from OCEL 2.0 JSON file

Object-centric model can be import from an OCEL 2.0 JSON file as follows:

  1. In the Workspace, open the project where to import the model.
  2. Select NEW in top right menu and select Import Model.
  3. Select the OCEL 2.0 JSON file from the disk and click Open.

An object-centric model and a list of datatables is created.

Example OCEL 2.0 eventlogs: https://www.ocel-standard.org/event-logs/overview/ (download the json format supported by QPR ProcessAnalyzer)

Object-centric model structure

Object-centric model contains datatables described in the table below. Datatables can be named freely, as the model json configuration is used to define the datatable for each type of data. The datatables need to use column names specified in the table below because those are the column names assumed by the object-centric (i.e., column names cannot be selected freely).

Datatable role Contained data Datatable columns
Objects Objects in the model (one row per object).
  • OcelObjectId: Unique id for the object (among all objects in the model).
  • OcelObjectType: Object type name (such as Order, Invoice, Delivery). Note that the model json configuration need to use same object type names.
Events Events in the model (one row per event).
  • OcelEventId: Unique id for the event (among all events in the model).
  • OcelEventType: Event type name (such as Order created, Invoice sent). Note that the model json configuration need to use same event type names.
  • OcelEventTime: Event timestamp.
Object-object relations Relations between objects (one row per relation).
  • OcelObjectObjectSourceId: Source object id in the relation.
  • OcelObjectObjectTargetId: Target object id in the relation.
  • OcelObjectObjectQualifier: Describes the type of the relation as free-form text (not used currently).
Event-object relations Relations between events and objects (one row per relation).
  • OcelEventObjectSourceId: Object id in the relation.
  • OcelEventObjectTargetId: Event id in the relation.
  • OcelEventObjectQualifier: Describes the type of the relation as free-form text (not used currently).
Object attributes (several datatables) Object attribute values, each object type in a separate table (one row per object).
  • OcelObjectTypeObjectId: Object id. Matches to the objects datatable OcelObjectId column.
  • OcelObjectTypeTime: Timestamp which the attribute value is valid from (not used currently).
  • OcelObjectTypeChangedField: Changed object attribute name (not used currently).
  • <Object attributes>: Columns for each of the object attribute values (column name is the object attribute name).
Event attributes (several datatables) Event attribute values, each event type in a separate table (one row per event).
  • OcelEventTypeEventId: Event id. Matches to the events datatable OcelEventId column.
  • <Event attributes>: Columns for each of the event attribute values (column name is the event attribute name).

Object-centric perspectives

Perspectives convert an object-centric model into the traditional case-centric eventlog, allowing to view and analyze object-centric models in analyses provided by charts. A single perspective is not able describe the object-centric model entirely, but just from a certain limited viewpoint. By using analyses with several perspectives, it's possible to get a more complete picture of the object-centric model. The perspective starts from a certain object type and traverses the object-object relations as many steps as desired.

To define a perspective, the following settings are defined in the chart settings:

  • Base Object type: Object of this type will be cases in the projected case-centric eventlog.
  • Object Relation Steps: Specifies how many object-object relations will be traversed in order to find events connected to the base objects. Value zero means that only those events are returned that are directly connected to the base objects.
  • Show Event Types: List of event type names which are included into the perspective eventlog. If no events are explicitly defined, all events will be included, but their event attributes are not included.

The resulting perspective eventlog will have the following columns:

  • OcelObjectId (mapped to case id)
  • OcelEventType (mapped to event type)
  • OcelEventTime (mapped to timestamp)
  • OcelEventId
  • Object attributes of the base object type. Note that the object attribute values are "repeated" for all events belonging to the same object.
  • Event attributes of the selected event types. Values are null for events that don't have the attribute.

Note that there are no case attributes in the perspective event log.

Differences to OCEL 2.0 standard

Object-centric models in QPR ProcessAnalyzer are mainly following the OCEL 2.0 standard, but there are the following differences:

  • Changing of object attributes values over time is not supported.
  • ocel_time field of each event type table is moved to events datatable (as every event has a timestemp).
  • *_map_type columns are not needed as the model settings are used for the same purpose.
  • Object type tables: If OcelObjectTypeChangedField is not null, all the other field values are copied from the previous entry except:
    • OcelObjectTypeChangedField which has the names of the changed fields as a comma separated string.
    • The actual changed field which has the new value.
    • OcelObjectTypeTime which has the timestamp when the value changed.