Create Object-centric Model from Relational Schema: Difference between revisions

From QPR ProcessAnalyzer Wiki
Jump to navigation Jump to search
 
(36 intermediate revisions by the same user not shown)
Line 1: Line 1:
This page instructs how to create an [[Object-centric_Process_Mining_Model|object-centric model]] from a any relational data schema. The relational data schema contains tables for each object and relations between the tables are described using foreign keys. Other datatables needed by the object-centric model are automatically created by a script from the object tables. The object-centric model is built with following principles:
This page describes how to create an [[Object-centric_Process_Mining_Model|object-centric model]] from a relational data schema. Tables in the relational data are either objects or events in the object-centric model, and relations between the tables are described using foreign keys (which will become the object-to-object and event-to-object relations). The rest of the datatables needed by the object-centric model are created automatically by a script from the object type and event type tables.
* Each table is an object type.
* Each table row is an object.
* Each timestamp in the object table is an event.
** The event type name is the column name.
** The event is linked to the object where the timestamp is as an object attribute (only one object is linked to each event).
* Foreign keys define the object-to-object relations.


== Model creation principle ==
The object-centric model is built with following principles:
* Each datatable is either an [[Object-centric_Process_Mining_Model#Object-centric_model_structure|object type or event type datatable]].
* For object type tables:
** Each table row is an object.
** ''OcelObjectTypeObjectId'' column contains the object ID.
** Each timestamp in the table is an event.
*** The event type name is the column name.
*** The event is linked to the object where the timestamp is as an object attribute (only one object is linked to each event).
** Foreign keys between the object type tables define the object-to-object relations.
* For event type tables:
** Each row is an event
** The left-most date type of column contains the event timestamp.
** ''OcelEventTypeEventId'' column contains the event ID.
** Column with a foreign key to an object type table's ''OcelObjectTypeObjectId'' column is the link to the object.
== Model creation steps ==
To create an object-centric model using this method, proceed as follows:
To create an object-centric model using this method, proceed as follows:
# Create the object tables.
# Create the object types and event types tables. Each object type table needs to have the column named '''OcelObjectTypeObjectId''' containing unique ID's for objects. Each event type table needs to have the column named '''OcelEventTypeEventId''' containing unique ID's for events.  
# Define relations between the object tables using [[#Define_foreign_keys|foreign keys]].
# Define relations between the object type and event type tables using [[#Define_foreign_keys|foreign keys]].
# Create an [[Object-centric_Process_Mining_Model#Create_object-centric_model|object-centric model]] and [[Object-centric_Process_Mining_Model#Configure_object-centric_model|define datatables]] for the model. For the ''ObjectTypes'' tables, define the existing tables created previously. In addition, define the ''Objects'', ''Events'', ''ObjectToObject'', and ''EventToObject'' tables. These datatables don't exist at this point but they are created automatically by the script.
# Create an [[Object-centric_Process_Mining_Model#Create_object-centric_model|object-centric model]] and [[Object-centric_Process_Mining_Model#Configure_object-centric_model|define datatables]] for the model. For the ''Object types'' and ''Event types'' tables, define the tables created previously. In addition, define names for the ''Objects'', ''Events'', ''ObjectToObject'', and ''EventToObject'' datatables (these datatables will be created by the script).
# Run the [[#Update_object-centric_model_tables|script]] to update the object-centric model datatables.
# Run the [[#Update_object-centric_model_datatables|script]] to create the rest of the object-centric model datatables.


== Define foreign keys ==
== Define foreign keys ==
Foreign keys define how the object tables (and thus individual objects) are linked together. The object-to-object relations in the object-centric model will be based on them. Foreign keys can be defined using the expression language script. Here is an example script that sets some foreign keys:
Foreign keys define how the object type tables (and thus individual objects) are linked together. The object-to-object relations in the object-centric model will be based on them. Foreign keys can be defined using the expression language. Here is an example script that sets some foreign keys:
<syntaxhighlight lang="typescript" line>
<syntaxhighlight lang="typescript" line>
// Set foreign key from datatable "Sales Order Item" to datatable "Sales Order Header". Source and target datatable column name is "VBELN".
// Set foreign key from datatable "Sales Order Item" to datatable "Sales Order Header". Source and target datatable column name is "VBELN".
Line 41: Line 52:
     ],
     ],
     "TargetDatatable": "Sales Order Item"
     "TargetDatatable": "Sales Order Item"
  }
]);
Project.DatatableByName("Sales Order Changed").SetForeignKeys([
  #{
    "Columns": [
      #{ "Source": "Sales Order ID", "Target": "OcelObjectTypeObjectId" }
    ],
    "TargetDatatable": "Sales Order Header"
   }
   }
]);
]);
</syntaxhighlight>
</syntaxhighlight>


== Update object-centric model datatables ==
== Create object-centric model datatables ==
Following script uses the ''object types'' datatables defined for the object-centric model as the source data, and creates the ''events'', ''objects'', ''event-to-object'', and ''object-to-object'' tables (if they don't exist). The script can also be used to update existing datatables when the data in the object types datatables changes. Note that the ''event types tables'' are not needed at all and thus they are not created by this script. The script assumes that there is only one object-centric model in the project where the script is run.
This script uses the ''object types'' and ''event types'' datatables defined for the object-centric model as the source data to create the ''events'', ''objects'', ''event-to-object'', and ''object-to-object'' tables (if they don't exist). The script can also be used to update existing datatables when the data in the object types datatables changes. The script assumes that there is only one object-centric model in the project where the script is run.


<syntaxhighlight lang="typescript" line>
<syntaxhighlight lang="typescript" line>
let modelObject = Project.Models.Where(IsOcelModel);
let modelObject = Project.Models.Where(IsOcelModel);
if (Count(modelObject) == 0) {
if (Count(modelObject) == 0) {
   throw "There is no object-centric model in the project.";
   throw "There is no object-centric model in the project - there should be one model.";
} else if (Count(modelObject) > 1) {
} else if (Count(modelObject) > 1) {
   throw "There are several object-centric models in the project.";
   throw "There are several object-centric models in the project - there should be one model.";
}  
}
modelObject = modelObject[0];
modelObject = modelObject[0];
let objectTypeNames = modelObject.Configuration.OcelDataSource.ObjectTypes.Keys;
let COLUMN_EVENTS_EVENT_ID = "OcelEventId";
let COLUMN_EVENTS_TYPE = "OcelEventType";
let COLUMN_EVENTS_TIME = "OcelEventTime";
let COLUMN_OBJECTS_OBJECT_ID = "OcelObjectId";
let COLUMN_OBJECTS_TYPE = "OcelObjectType";
let COLUMN_EVENT_TYPES_EVENT_ID = "OcelEventTypeEventId";
let COLUMN_OBJECT_TYPES_OBJECT_ID = "OcelObjectTypeObjectId";
let COLUMN_EVENT_OBJECT_EVENT_ID = "OcelEventToObjectSourceId";
let COLUMN_EVENT_OBJECT_OBJECT_ID = "OcelEventToObjectTargetId";
let COLUMN_EVENT_OBJECT_QUALIFIER = "OcelEventToObjectQualifier";
let COLUMN_OBJECT_OBJECT_SOURCE_ID = "OcelObjectToObjectSourceId";
let COLUMN_OBJECT_OBJECT_TARGET_ID = "OcelObjectToObjectTargetId";
let COLUMN_OBJECT_OBJECT_QUALIFIER = "OcelObjectToObjectQualifier";
let COLUMN_OBJECT_TYPE = "Object Type";
let COLUMN_OBJECT_ID = "Object ID";


function getDatatableAndCreateIfNeeded(datatableName) {
function createOrGetDatatable(datatableName) {
   let dt = Project.DatatableByName(datatableName);
   let dt = Project.DatatableByName(datatableName);
   if (dt == null) {
   if (dt == null) {
Line 69: Line 103:
}
}


// Create Objects table
function importDataframeToDatatable(datatableName, df) {
  createOrGetDatatable(datatableName)
    .Truncate()
    .Import(df, #{ "ImportExistingColumnOnly": false });
}
 
// Add data to Objects table from ObjectTypes tables.
let objectTypeNames = modelObject.Configuration.OcelDataSource.ObjectTypes.Keys;
let allObjectsDataFrame;
let allObjectsDataFrame;
for (let i = 0; i < Count(objectTypeNames); i++) {
for (let i = 0; i < Count(objectTypeNames); i++) {
   let objectsDatatableName = modelObject.Configuration.OcelDataSource.ObjectTypes[objectTypeNames[i]];
   let objectsDatatableName = modelObject.Configuration.OcelDataSource.ObjectTypes[objectTypeNames[i]];
   let objectsDataFrame = getDatatableAndCreateIfNeeded(objectsDatatableName)
   let objectsDatatable = Project.DatatableByName(objectsDatatableName);
  if (Count(objectsDatatable.Columns.Where(Name == COLUMN_OBJECT_TYPES_OBJECT_ID)) == 0) {
    throw "Objects datatable \"" + objectsDatatableName + "\" should have a column named " + COLUMN_OBJECT_TYPES_OBJECT_ID + ".";
  }
  let objectsDataFrame = objectsDatatable
     .SqlDataFrame
     .SqlDataFrame
     .Select(["OcelObjectTypeObjectId"])
     .Select([COLUMN_OBJECT_TYPES_OBJECT_ID])
     .RenameColumns(["OcelObjectId": "OcelObjectTypeObjectId"])
     .RenameColumns([COLUMN_OBJECTS_OBJECT_ID: COLUMN_OBJECT_TYPES_OBJECT_ID])
     .WithColumn("OcelObjectType", #expr{objectTypeNames[i]})
     .WithColumn(COLUMN_OBJECTS_TYPE, #expr{ objectTypeNames[i] })
     .Select(["OcelObjectId", "OcelObjectType"]);
     .Select([COLUMN_OBJECTS_OBJECT_ID, COLUMN_OBJECTS_TYPE]);
   if (allObjectsDataFrame == _EMPTY) {
   if (allObjectsDataFrame == _EMPTY) {
    allObjectsDataFrame = objectsDataFrame;
    allObjectsDataFrame = objectsDataFrame;
   } else {
   } else {
     allObjectsDataFrame = allObjectsDataFrame.Append(objectsDataFrame);
     allObjectsDataFrame = allObjectsDataFrame.Append(objectsDataFrame);
   }
   }
}
}
getDatatableAndCreateIfNeeded(modelObject.Configuration.OcelDataSource.Objects)
  .Truncate()
  .Import(allObjectsDataFrame, #{ "ImportExistingColumnOnly": false });


// Create Events table
// Add data to Events table from ObjectTypes tables.
let allEventsDataFrame;
let allEventsDataFrame;
for (let i = 0; i < Count(objectTypeNames); i++) {
for (let i = 0; i < Count(objectTypeNames); i++) {
   let objectsDatatableName = modelObject.Configuration.OcelDataSource.ObjectTypes[objectTypeNames[i]];
  let objectTypeName = objectTypeNames[i];
   let objectsDatatable = getDatatableAndCreateIfNeeded(objectsDatatableName);
   let objectsDatatableName = modelObject.Configuration.OcelDataSource.ObjectTypes[objectTypeName];
   let objectsDatatable = Project.DatatableByName(objectsDatatableName);
   let dateColumns = objectsDatatable.Columns.Where(DataType == "DateTime").Name;
   let dateColumns = objectsDatatable.Columns.Where(DataType == "DateTime").Name;
   for (let j = 0; j < Count(dateColumns); j++) {
   for (let j = 0; j < Count(dateColumns); j++) {
Line 99: Line 142:
     let eventsDataFrame = objectsDatatable
     let eventsDataFrame = objectsDatatable
       .SqlDataFrame
       .SqlDataFrame
       .Where(Column(#expr{dateColumn}) != null)
       .Where(Column(#expr{ dateColumn }) != null)
       .Select(["OcelObjectTypeObjectId", #expr{dateColumn}])
       .Select([COLUMN_OBJECT_TYPES_OBJECT_ID, #expr{ dateColumn }])
       .WithColumn("OcelEventId", Concat(Column("OcelObjectTypeObjectId"), "_", #expr{dateColumn}))
       .WithColumn(COLUMN_EVENTS_EVENT_ID, Concat(Column(COLUMN_OBJECT_TYPES_OBJECT_ID), "_", #expr{ dateColumn }))
       .WithColumn("OcelEventType", #expr{dateColumn})
       .WithColumn(COLUMN_EVENTS_TYPE, #expr{ dateColumn })
       .RenameColumns(["OcelEventTime": #expr{dateColumn}])
       .RenameColumns([COLUMN_EVENTS_TIME: #expr{ dateColumn }])
       .WithColumn("Object Type", #expr{objectTypeNames[i]})
       .WithColumn(COLUMN_OBJECT_TYPE, #expr{ objectTypeName })
       .WithColumn("Object ID", Column("OcelObjectTypeObjectId"))
       .WithColumn(COLUMN_OBJECT_ID, Column(COLUMN_OBJECT_TYPES_OBJECT_ID))
       .Select(["OcelEventId", "OcelEventType", "OcelEventTime", "Object Type", "Object ID"]);
       .Select([COLUMN_EVENTS_EVENT_ID, COLUMN_EVENTS_TYPE, COLUMN_EVENTS_TIME, COLUMN_OBJECT_TYPE, COLUMN_OBJECT_ID]);
     if (allEventsDataFrame == _EMPTY) {
     if (allEventsDataFrame == _EMPTY) {
       allEventsDataFrame = eventsDataFrame;
       allEventsDataFrame = eventsDataFrame;
Line 114: Line 157:
   }
   }
}
}
getDatatableAndCreateIfNeeded(modelObject.Configuration.OcelDataSource.Events)
  .Truncate()
  .Import(allEventsDataFrame, #{ "ImportExistingColumnOnly": false });


// Create Event-to-Object table
// Add data to Events table from EventTypes tables.
let eventTypeNames = modelObject.Configuration.OcelDataSource.TryGetValue("EventTypes").Keys;
for (let i = 0; i < Count(eventTypeNames); i++) {
  let eventTypeName = eventTypeNames[i];
  let eventsDatatableName = modelObject.Configuration.OcelDataSource.EventTypes[eventTypeName];
  let eventsDatatable = Project.DatatableByName(eventsDatatableName);
  if (Count(eventsDatatable.Columns.Where(Name == COLUMN_EVENT_TYPES_EVENT_ID)) == 0) {
    throw "Events datatable \"" + eventsDatatableName + "\" should have a column named " + COLUMN_EVENT_TYPES_EVENT_ID + ".";
  }
  let dateColumns = eventsDatatable.Columns.Where(DataType == "DateTime").Name;
  if (Count(dateColumns) == 0) {
    throw "EventTypes datatable \"" + eventsDatatableName + "\" should have a date type of column for timestamps.";
  }
 
  let foreignKeys = eventsDatatable.ForeignKeys;
  if (Count(foreignKeys) == 0) {
    throw "Foreign key to the ObjectType table is missing from EventTypes datatable \"" + eventsDatatableName + "\".";
  }
  for (let j = 0; j < Count(foreignKeys); j++) {
    if (Count(foreignKeys[j].Columns) > 1) {
      throw "Multi-column foreign keys not supported for event type tables.";
    }
    let objectsTable = foreignKeys[j].TargetDatatable;
    let sourceColumn = foreignKeys[j].Columns[0].Source;
    let targetColumn = foreignKeys[j].Columns[0].Target;
    let objectsDatatableNames = modelObject.Configuration.OcelDataSource.ObjectTypes.Values;
    if (!objectsTable.In(objectsDatatableNames)) {
      throw "Foreign key in the EventTypes datatable \"" + eventsDatatableName + "\" is not referring to an object type table.";
    }
    if (targetColumn != COLUMN_OBJECT_TYPES_OBJECT_ID) {
      throw "Foreign key in the EventTypes datatable \"" + eventsDatatableName + "\" is not referring to an OcelObjectTypeObjectId column.";
    }
 
    // Define expression to generate the event type name. Examples (add between #sql{ ... }):
    // #expr{eventTypeName}
    // #expr{dateColumns[0]}
    // Column("OtherColumn")
    // Concat( Column("OtherColumn"), ": ", #expr{eventTypeName} )
    let eventTypeNameExpression = ToSqlExpression(#sql{ #expr{eventTypeName} });
 
    let eventsDataFrame = eventsDatatable
      .SqlDataFrame
      .RenameColumns([COLUMN_EVENTS_EVENT_ID: COLUMN_EVENT_TYPES_EVENT_ID])
      .WithColumn(COLUMN_EVENTS_TYPE, #expr{ eventTypeNameExpression })
      .WithColumn(COLUMN_EVENTS_TIME, Column(dateColumns[0]))
      .WithColumn(COLUMN_OBJECT_TYPE, #expr{ objectsTable })
      .WithColumn(COLUMN_OBJECT_ID, Column(sourceColumn))
      .Select([COLUMN_EVENTS_EVENT_ID, COLUMN_EVENTS_TYPE, COLUMN_EVENTS_TIME, COLUMN_OBJECT_TYPE, COLUMN_OBJECT_ID]);
    if (allEventsDataFrame == _EMPTY) {
      allEventsDataFrame = eventsDataFrame;
    } else {
      allEventsDataFrame = allEventsDataFrame.Append(eventsDataFrame);
    }
  }
}
 
// Add event type tables columns as common event attributes.
for (let i = 0; i < Count(eventTypeNames); i++) {
  let eventsDatatableName = modelObject.Configuration.OcelDataSource.EventTypes[eventTypeNames[i]];
  let eventsDatatable = Project.DatatableByName(eventsDatatableName);
 
  let foreignKeys = eventsDatatable.ForeignKeys;
  let eventAttributeColumns = eventsDatatable.Columns.Name;
  for (let j = 0; j < Count(foreignKeys); j++) {
    eventAttributeColumns = Except(eventAttributeColumns, foreignKeys[j].Columns[0].Source);
  }
  eventAttributeColumns = Except(eventAttributeColumns, [(eventsDatatable.Columns.Where(DataType == "DateTime").Name)[0]]);
 
  if (Count(eventAttributeColumns) > 0) {
    let eventTypeDataFrame = eventsDatatable.SqlDataFrame
      .Select(eventAttributeColumns);
    allEventsDataFrame = allEventsDataFrame
      .Join(eventTypeDataFrame, [COLUMN_EVENTS_EVENT_ID: COLUMN_EVENT_TYPES_EVENT_ID], "leftouter")
      .RemoveColumns([COLUMN_EVENT_TYPES_EVENT_ID]);
  }
}
 
// Add data to Event-to-Object table from ObjectTypes tables.
let allEventToObjectDataFrame;
let allEventToObjectDataFrame;
for (let i = 0; i < Count(objectTypeNames); i++) {
for (let i = 0; i < Count(objectTypeNames); i++) {
   let objectsDatatableName = modelObject.Configuration.OcelDataSource.ObjectTypes[objectTypeNames[i]];
   let objectsDatatableName = modelObject.Configuration.OcelDataSource.ObjectTypes[objectTypeNames[i]];
   let objectsDatatable = getDatatableAndCreateIfNeeded(objectsDatatableName);
   let objectsDatatable = Project.DatatableByName(objectsDatatableName);
   let dateColumns = objectsDatatable.Columns.Where(DataType == "DateTime").Name;
   let dateColumns = objectsDatatable.Columns.Where(DataType == "DateTime").Name;
   for (let j = 0; j < Count(dateColumns); j++) {
   for (let j = 0; j < Count(dateColumns); j++) {
Line 128: Line 245:
     let eventToObjectDataFrame = objectsDatatable
     let eventToObjectDataFrame = objectsDatatable
       .SqlDataFrame
       .SqlDataFrame
       .Where(Column(#expr{dateColumn}) != null)
       .Where(Column(#expr{ dateColumn }) != null)
       .Select(["OcelObjectTypeObjectId"])
       .Select([COLUMN_OBJECT_TYPES_OBJECT_ID])
       .WithColumn("OcelEventToObjectSourceId", Concat(Column("OcelObjectTypeObjectId"), "_", #expr{dateColumn}))
       .WithColumn(COLUMN_EVENT_OBJECT_EVENT_ID, Concat(Column(COLUMN_OBJECT_TYPES_OBJECT_ID), "_", #expr{ dateColumn }))
       .RenameColumns(["OcelEventToObjectTargetId": "OcelObjectTypeObjectId"])
       .RenameColumns([COLUMN_EVENT_OBJECT_OBJECT_ID: COLUMN_OBJECT_TYPES_OBJECT_ID])
       .WithColumn("OcelEventToObjectQualifier", "")
       .WithColumn(COLUMN_EVENT_OBJECT_QUALIFIER, "")
       .Select(["OcelEventToObjectSourceId", "OcelEventToObjectTargetId", "OcelEventToObjectQualifier"]);
       .Select([COLUMN_EVENT_OBJECT_EVENT_ID, COLUMN_EVENT_OBJECT_OBJECT_ID, COLUMN_EVENT_OBJECT_QUALIFIER]);
     if (allEventToObjectDataFrame == _EMPTY) {
     if (allEventToObjectDataFrame == _EMPTY) {
       allEventToObjectDataFrame = eventToObjectDataFrame;
       allEventToObjectDataFrame = eventToObjectDataFrame;
Line 141: Line 258:
   }
   }
}
}
getDatatableAndCreateIfNeeded(modelObject.Configuration.OcelDataSource.EventToObject)
  .Truncate()
  .Import(allEventToObjectDataFrame, #{ "ImportExistingColumnOnly": false });


// Create Object-to-object table
// Add data to Event-to-Object table from EventTypes tables.
for (let i = 0; i < Count(eventTypeNames); i++) {
  let eventsDatatableName = modelObject.Configuration.OcelDataSource.EventTypes[eventTypeNames[i]];
  let eventsDatatable = Project.DatatableByName(eventsDatatableName);
  let foreignKeys = eventsDatatable.ForeignKeys;
  for (let j = 0; j < Count(foreignKeys); j++) {
    let objectIdColumn = foreignKeys[j].Columns[0].Source;
    let eventToObjectDataFrame = eventsDatatable
      .SqlDataFrame
      .RenameColumns([COLUMN_EVENT_OBJECT_EVENT_ID: COLUMN_EVENT_TYPES_EVENT_ID])
      .WithColumn(COLUMN_EVENT_OBJECT_OBJECT_ID, Column(objectIdColumn))
      .WithColumn(COLUMN_EVENT_OBJECT_QUALIFIER, "")
      .Select([COLUMN_EVENT_OBJECT_EVENT_ID, COLUMN_EVENT_OBJECT_OBJECT_ID, COLUMN_EVENT_OBJECT_QUALIFIER]);
    if (allEventToObjectDataFrame == _EMPTY) {
      allEventToObjectDataFrame = eventToObjectDataFrame;
    } else {
      allEventToObjectDataFrame = allEventToObjectDataFrame.Append(eventToObjectDataFrame);
    }
  }
}
 
// Add data to Object-to-object table from ObjectTypes tables.
let TARGET_COLUMN_PREFIX = "Target";
let allObjectToObjectDataFrame;
let allObjectToObjectDataFrame;
for (let i = 0; i < Count(objectTypeNames); i++) {
for (let i = 0; i < Count(objectTypeNames); i++) {
   let objectsDatatableName = modelObject.Configuration.OcelDataSource.ObjectTypes[objectTypeNames[i]];
   let objectsDatatableName = modelObject.Configuration.OcelDataSource.ObjectTypes[objectTypeNames[i]];
   let objectsDatatable = getDatatableAndCreateIfNeeded(objectsDatatableName);
   let objectsDatatable = Project.DatatableByName(objectsDatatableName);
   let foreignKeys = objectsDatatable.ForeignKeys;
   let foreignKeys = objectsDatatable.ForeignKeys;
   for (let j = 0; j < Count(foreignKeys); j++) {
   for (let j = 0; j < Count(foreignKeys); j++) {
     let joinedColumns = foreignKeys[j].Columns;
     let joinedColumns = foreignKeys[j].Columns;
     let targetDataFrame = Project.DatatableByName(foreignKeys[j].TargetDatatable).SqlDataFrame
     let targetDataFrame = Project.DatatableByName(foreignKeys[j].TargetDatatable).SqlDataFrame
       .Select(Flatten(["OcelObjectTypeObjectId", joinedColumns.Target]))
       .Select(Distinct(Flatten([COLUMN_OBJECT_TYPES_OBJECT_ID, joinedColumns.Target])))
       .RenameColumns(Flatten(["OcelObjectTypeObjectId", joinedColumns.Target]).{let col = _; [("Target" + col): col]}[0]);
       .RenameColumns(Flatten([COLUMN_OBJECT_TYPES_OBJECT_ID, joinedColumns.Target]).{ let col = _;[(TARGET_COLUMN_PREFIX + col): col] }[0]);
     let objectToObjectDataFrame = objectsDatatable.SqlDataFrame
     let objectToObjectDataFrame = objectsDatatable.SqlDataFrame
       .Select(Flatten(["OcelObjectTypeObjectId", joinedColumns.Source]))
       .Select(Distinct(Flatten([COLUMN_OBJECT_TYPES_OBJECT_ID, joinedColumns.Source])))
       .Join(targetDataFrame, joinedColumns.{let col = _; [col.Source: ("Target" + col.Target)]}[0], "inner")
       .Join(targetDataFrame, joinedColumns.{ let col = _;[col.Source: (TARGET_COLUMN_PREFIX + col.Target)] }[0], "inner")
       .WithColumn("OcelObjectToObjectSourceId", Column("OcelObjectTypeObjectId"))
       .WithColumn(COLUMN_OBJECT_OBJECT_SOURCE_ID, Column(COLUMN_OBJECT_TYPES_OBJECT_ID))
       .WithColumn("OcelObjectToObjectTargetId", Column("TargetOcelObjectTypeObjectId"))
       .WithColumn(COLUMN_OBJECT_OBJECT_TARGET_ID, Column(TARGET_COLUMN_PREFIX + COLUMN_OBJECT_TYPES_OBJECT_ID))
       .WithColumn("OcelObjectToObjectQualifier", "")
       .WithColumn(COLUMN_OBJECT_OBJECT_QUALIFIER, "")
       .Select(["OcelObjectToObjectSourceId", "OcelObjectToObjectTargetId", "OcelObjectToObjectQualifier"]);
       .Select([COLUMN_OBJECT_OBJECT_SOURCE_ID, COLUMN_OBJECT_OBJECT_TARGET_ID, COLUMN_OBJECT_OBJECT_QUALIFIER]);
     if (allObjectToObjectDataFrame == _EMPTY) {
     if (allObjectToObjectDataFrame == _EMPTY) {
       allObjectToObjectDataFrame = objectToObjectDataFrame;
       allObjectToObjectDataFrame = objectToObjectDataFrame;
     } else {
     } else {
       allObjectToObjectDataFrame = allObjectToObjectDataFrame.Append(objectToObjectDataFrame);
       allObjectToObjectDataFrame = allObjectToObjectDataFrame.Append(objectToObjectDataFrame);
  }
    }
   }
   }
}
}
getDatatableAndCreateIfNeeded(modelObject.Configuration.OcelDataSource.ObjectToObject)
 
  .Truncate()
importDataframeToDatatable(modelObject.Configuration.OcelDataSource.Objects, allObjectsDataFrame);
  .Import(allObjectToObjectDataFrame, #{ "ImportExistingColumnOnly": false });
importDataframeToDatatable(modelObject.Configuration.OcelDataSource.Events, allEventsDataFrame);
importDataframeToDatatable(modelObject.Configuration.OcelDataSource.EventToObject, allEventToObjectDataFrame);
importDataframeToDatatable(modelObject.Configuration.OcelDataSource.ObjectToObject, allObjectToObjectDataFrame);
</syntaxhighlight>
</syntaxhighlight>

Latest revision as of 18:22, 30 October 2025

This page describes how to create an object-centric model from a relational data schema. Tables in the relational data are either objects or events in the object-centric model, and relations between the tables are described using foreign keys (which will become the object-to-object and event-to-object relations). The rest of the datatables needed by the object-centric model are created automatically by a script from the object type and event type tables.

Model creation principle

The object-centric model is built with following principles:

  • Each datatable is either an object type or event type datatable.
  • For object type tables:
    • Each table row is an object.
    • OcelObjectTypeObjectId column contains the object ID.
    • Each timestamp in the table is an event.
      • The event type name is the column name.
      • The event is linked to the object where the timestamp is as an object attribute (only one object is linked to each event).
    • Foreign keys between the object type tables define the object-to-object relations.
  • For event type tables:
    • Each row is an event
    • The left-most date type of column contains the event timestamp.
    • OcelEventTypeEventId column contains the event ID.
    • Column with a foreign key to an object type table's OcelObjectTypeObjectId column is the link to the object.

Model creation steps

To create an object-centric model using this method, proceed as follows:

  1. Create the object types and event types tables. Each object type table needs to have the column named OcelObjectTypeObjectId containing unique ID's for objects. Each event type table needs to have the column named OcelEventTypeEventId containing unique ID's for events.
  2. Define relations between the object type and event type tables using foreign keys.
  3. Create an object-centric model and define datatables for the model. For the Object types and Event types tables, define the tables created previously. In addition, define names for the Objects, Events, ObjectToObject, and EventToObject datatables (these datatables will be created by the script).
  4. Run the script to create the rest of the object-centric model datatables.

Define foreign keys

Foreign keys define how the object type tables (and thus individual objects) are linked together. The object-to-object relations in the object-centric model will be based on them. Foreign keys can be defined using the expression language. Here is an example script that sets some foreign keys:

// Set foreign key from datatable "Sales Order Item" to datatable "Sales Order Header". Source and target datatable column name is "VBELN".
Project.DatatableByName("Sales Order Item").SetForeignKeys([
  #{
    "Columns": [
      #{ "Source": "VBELN", "Target": "VBELN" }
    ],
    "TargetDatatable": "Sales Order Header"
  }
]);

// Set foreign key from datatable "Delivery Item" to datatable "Delivery Header". Source and target datatable column name is "VBELN".
// Set foreign key from datatable "Delivery Item" to datatable "Sales Order Item". Source datatable columns are "VGBEL" and "VGPOS", and corresponding columns in the target datatable are "VBELN" and "POSNR".
Project.DatatableByName("Delivery Item").SetForeignKeys([
  #{
    "Columns": [
      #{ "Source": "VBELN", "Target": "VBELN" }
    ],
    "TargetDatatable": "Delivery Header"
  },
  #{
    "Columns": [
      #{ "Source": "VGBEL", "Target": "VBELN" },
      #{ "Source": "VGPOS", "Target": "POSNR" }
    ],
    "TargetDatatable": "Sales Order Item"
  }
]);

Project.DatatableByName("Sales Order Changed").SetForeignKeys([
  #{
    "Columns": [
      #{ "Source": "Sales Order ID", "Target": "OcelObjectTypeObjectId" }
    ],
    "TargetDatatable": "Sales Order Header"
  }
]);

Create object-centric model datatables

This script uses the object types and event types datatables defined for the object-centric model as the source data to create the events, objects, event-to-object, and object-to-object tables (if they don't exist). The script can also be used to update existing datatables when the data in the object types datatables changes. The script assumes that there is only one object-centric model in the project where the script is run.

let modelObject = Project.Models.Where(IsOcelModel);
if (Count(modelObject) == 0) {
  throw "There is no object-centric model in the project - there should be one model.";
} else if (Count(modelObject) > 1) {
  throw "There are several object-centric models in the project - there should be one model.";
}
modelObject = modelObject[0];
let COLUMN_EVENTS_EVENT_ID = "OcelEventId";
let COLUMN_EVENTS_TYPE = "OcelEventType";
let COLUMN_EVENTS_TIME = "OcelEventTime";
let COLUMN_OBJECTS_OBJECT_ID = "OcelObjectId";
let COLUMN_OBJECTS_TYPE = "OcelObjectType";
let COLUMN_EVENT_TYPES_EVENT_ID = "OcelEventTypeEventId";
let COLUMN_OBJECT_TYPES_OBJECT_ID = "OcelObjectTypeObjectId";
let COLUMN_EVENT_OBJECT_EVENT_ID = "OcelEventToObjectSourceId";
let COLUMN_EVENT_OBJECT_OBJECT_ID = "OcelEventToObjectTargetId";
let COLUMN_EVENT_OBJECT_QUALIFIER = "OcelEventToObjectQualifier";
let COLUMN_OBJECT_OBJECT_SOURCE_ID = "OcelObjectToObjectSourceId";
let COLUMN_OBJECT_OBJECT_TARGET_ID = "OcelObjectToObjectTargetId";
let COLUMN_OBJECT_OBJECT_QUALIFIER = "OcelObjectToObjectQualifier";
let COLUMN_OBJECT_TYPE = "Object Type";
let COLUMN_OBJECT_ID = "Object ID";

function createOrGetDatatable(datatableName) {
  let dt = Project.DatatableByName(datatableName);
  if (dt == null) {
    dt = Project.CreateDatatable(#{
      "Name": datatableName,
      "Type": "Snowflake"
    });
  }
  return dt;
}

function importDataframeToDatatable(datatableName, df) {
  createOrGetDatatable(datatableName)
    .Truncate()
    .Import(df, #{ "ImportExistingColumnOnly": false });
}

// Add data to Objects table from ObjectTypes tables.
let objectTypeNames = modelObject.Configuration.OcelDataSource.ObjectTypes.Keys;
let allObjectsDataFrame;
for (let i = 0; i < Count(objectTypeNames); i++) {
  let objectsDatatableName = modelObject.Configuration.OcelDataSource.ObjectTypes[objectTypeNames[i]];
  let objectsDatatable = Project.DatatableByName(objectsDatatableName);
  if (Count(objectsDatatable.Columns.Where(Name == COLUMN_OBJECT_TYPES_OBJECT_ID)) == 0) {
    throw "Objects datatable \"" + objectsDatatableName + "\" should have a column named " + COLUMN_OBJECT_TYPES_OBJECT_ID + ".";
  }
  let objectsDataFrame = objectsDatatable
    .SqlDataFrame
    .Select([COLUMN_OBJECT_TYPES_OBJECT_ID])
    .RenameColumns([COLUMN_OBJECTS_OBJECT_ID: COLUMN_OBJECT_TYPES_OBJECT_ID])
    .WithColumn(COLUMN_OBJECTS_TYPE, #expr{ objectTypeNames[i] })
    .Select([COLUMN_OBJECTS_OBJECT_ID, COLUMN_OBJECTS_TYPE]);
  if (allObjectsDataFrame == _EMPTY) {
    allObjectsDataFrame = objectsDataFrame;
  } else {
    allObjectsDataFrame = allObjectsDataFrame.Append(objectsDataFrame);
  }
}

// Add data to Events table from ObjectTypes tables.
let allEventsDataFrame;
for (let i = 0; i < Count(objectTypeNames); i++) {
  let objectTypeName = objectTypeNames[i];
  let objectsDatatableName = modelObject.Configuration.OcelDataSource.ObjectTypes[objectTypeName];
  let objectsDatatable = Project.DatatableByName(objectsDatatableName);
  let dateColumns = objectsDatatable.Columns.Where(DataType == "DateTime").Name;
  for (let j = 0; j < Count(dateColumns); j++) {
    let dateColumn = dateColumns[j];
    let eventsDataFrame = objectsDatatable
      .SqlDataFrame
      .Where(Column(#expr{ dateColumn }) != null)
      .Select([COLUMN_OBJECT_TYPES_OBJECT_ID, #expr{ dateColumn }])
      .WithColumn(COLUMN_EVENTS_EVENT_ID, Concat(Column(COLUMN_OBJECT_TYPES_OBJECT_ID), "_", #expr{ dateColumn }))
      .WithColumn(COLUMN_EVENTS_TYPE, #expr{ dateColumn })
      .RenameColumns([COLUMN_EVENTS_TIME: #expr{ dateColumn }])
      .WithColumn(COLUMN_OBJECT_TYPE, #expr{ objectTypeName })
      .WithColumn(COLUMN_OBJECT_ID, Column(COLUMN_OBJECT_TYPES_OBJECT_ID))
      .Select([COLUMN_EVENTS_EVENT_ID, COLUMN_EVENTS_TYPE, COLUMN_EVENTS_TIME, COLUMN_OBJECT_TYPE, COLUMN_OBJECT_ID]);
    if (allEventsDataFrame == _EMPTY) {
      allEventsDataFrame = eventsDataFrame;
    } else {
      allEventsDataFrame = allEventsDataFrame.Append(eventsDataFrame);
    }
  }
}

// Add data to Events table from EventTypes tables.
let eventTypeNames = modelObject.Configuration.OcelDataSource.TryGetValue("EventTypes").Keys;
for (let i = 0; i < Count(eventTypeNames); i++) {
  let eventTypeName = eventTypeNames[i];
  let eventsDatatableName = modelObject.Configuration.OcelDataSource.EventTypes[eventTypeName];
  let eventsDatatable = Project.DatatableByName(eventsDatatableName);
  if (Count(eventsDatatable.Columns.Where(Name == COLUMN_EVENT_TYPES_EVENT_ID)) == 0) {
    throw "Events datatable \"" + eventsDatatableName + "\" should have a column named " + COLUMN_EVENT_TYPES_EVENT_ID + ".";
  }
  let dateColumns = eventsDatatable.Columns.Where(DataType == "DateTime").Name;
  if (Count(dateColumns) == 0) {
    throw "EventTypes datatable \"" + eventsDatatableName + "\" should have a date type of column for timestamps.";
  }

  let foreignKeys = eventsDatatable.ForeignKeys;
  if (Count(foreignKeys) == 0) {
    throw "Foreign key to the ObjectType table is missing from EventTypes datatable \"" + eventsDatatableName + "\".";
  }
  for (let j = 0; j < Count(foreignKeys); j++) {
    if (Count(foreignKeys[j].Columns) > 1) {
      throw "Multi-column foreign keys not supported for event type tables.";
    }
    let objectsTable = foreignKeys[j].TargetDatatable;
    let sourceColumn = foreignKeys[j].Columns[0].Source;
    let targetColumn = foreignKeys[j].Columns[0].Target;
    let objectsDatatableNames = modelObject.Configuration.OcelDataSource.ObjectTypes.Values;
    if (!objectsTable.In(objectsDatatableNames)) {
      throw "Foreign key in the EventTypes datatable \"" + eventsDatatableName + "\" is not referring to an object type table.";
    }
    if (targetColumn != COLUMN_OBJECT_TYPES_OBJECT_ID) {
      throw "Foreign key in the EventTypes datatable \"" + eventsDatatableName + "\" is not referring to an OcelObjectTypeObjectId column.";
    }

    // Define expression to generate the event type name. Examples (add between #sql{ ... }):
    // #expr{eventTypeName}
    // #expr{dateColumns[0]}
    // Column("OtherColumn")
    // Concat( Column("OtherColumn"), ": ", #expr{eventTypeName} )
    let eventTypeNameExpression = ToSqlExpression(#sql{ #expr{eventTypeName} });

    let eventsDataFrame = eventsDatatable
      .SqlDataFrame
      .RenameColumns([COLUMN_EVENTS_EVENT_ID: COLUMN_EVENT_TYPES_EVENT_ID])
      .WithColumn(COLUMN_EVENTS_TYPE, #expr{ eventTypeNameExpression })
      .WithColumn(COLUMN_EVENTS_TIME, Column(dateColumns[0]))
      .WithColumn(COLUMN_OBJECT_TYPE, #expr{ objectsTable })
      .WithColumn(COLUMN_OBJECT_ID, Column(sourceColumn))
      .Select([COLUMN_EVENTS_EVENT_ID, COLUMN_EVENTS_TYPE, COLUMN_EVENTS_TIME, COLUMN_OBJECT_TYPE, COLUMN_OBJECT_ID]);
    if (allEventsDataFrame == _EMPTY) {
      allEventsDataFrame = eventsDataFrame;
    } else {
      allEventsDataFrame = allEventsDataFrame.Append(eventsDataFrame);
    }
  }
}

// Add event type tables columns as common event attributes.
for (let i = 0; i < Count(eventTypeNames); i++) {
  let eventsDatatableName = modelObject.Configuration.OcelDataSource.EventTypes[eventTypeNames[i]];
  let eventsDatatable = Project.DatatableByName(eventsDatatableName);

  let foreignKeys = eventsDatatable.ForeignKeys;
  let eventAttributeColumns = eventsDatatable.Columns.Name;
  for (let j = 0; j < Count(foreignKeys); j++) {
    eventAttributeColumns = Except(eventAttributeColumns, foreignKeys[j].Columns[0].Source);
  }
  eventAttributeColumns = Except(eventAttributeColumns, [(eventsDatatable.Columns.Where(DataType == "DateTime").Name)[0]]);

  if (Count(eventAttributeColumns) > 0) {
    let eventTypeDataFrame = eventsDatatable.SqlDataFrame
      .Select(eventAttributeColumns);
    allEventsDataFrame = allEventsDataFrame
      .Join(eventTypeDataFrame, [COLUMN_EVENTS_EVENT_ID: COLUMN_EVENT_TYPES_EVENT_ID], "leftouter")
      .RemoveColumns([COLUMN_EVENT_TYPES_EVENT_ID]);
  }
}

// Add data to Event-to-Object table from ObjectTypes tables.
let allEventToObjectDataFrame;
for (let i = 0; i < Count(objectTypeNames); i++) {
  let objectsDatatableName = modelObject.Configuration.OcelDataSource.ObjectTypes[objectTypeNames[i]];
  let objectsDatatable = Project.DatatableByName(objectsDatatableName);
  let dateColumns = objectsDatatable.Columns.Where(DataType == "DateTime").Name;
  for (let j = 0; j < Count(dateColumns); j++) {
    let dateColumn = dateColumns[j];
    let eventToObjectDataFrame = objectsDatatable
      .SqlDataFrame
      .Where(Column(#expr{ dateColumn }) != null)
      .Select([COLUMN_OBJECT_TYPES_OBJECT_ID])
      .WithColumn(COLUMN_EVENT_OBJECT_EVENT_ID, Concat(Column(COLUMN_OBJECT_TYPES_OBJECT_ID), "_", #expr{ dateColumn }))
      .RenameColumns([COLUMN_EVENT_OBJECT_OBJECT_ID: COLUMN_OBJECT_TYPES_OBJECT_ID])
      .WithColumn(COLUMN_EVENT_OBJECT_QUALIFIER, "")
      .Select([COLUMN_EVENT_OBJECT_EVENT_ID, COLUMN_EVENT_OBJECT_OBJECT_ID, COLUMN_EVENT_OBJECT_QUALIFIER]);
    if (allEventToObjectDataFrame == _EMPTY) {
      allEventToObjectDataFrame = eventToObjectDataFrame;
    } else {
      allEventToObjectDataFrame = allEventToObjectDataFrame.Append(eventToObjectDataFrame);
    }
  }
}

// Add data to Event-to-Object table from EventTypes tables.
for (let i = 0; i < Count(eventTypeNames); i++) {
  let eventsDatatableName = modelObject.Configuration.OcelDataSource.EventTypes[eventTypeNames[i]];
  let eventsDatatable = Project.DatatableByName(eventsDatatableName);
  let foreignKeys = eventsDatatable.ForeignKeys;
  for (let j = 0; j < Count(foreignKeys); j++) {
    let objectIdColumn = foreignKeys[j].Columns[0].Source;
    let eventToObjectDataFrame = eventsDatatable
      .SqlDataFrame
      .RenameColumns([COLUMN_EVENT_OBJECT_EVENT_ID: COLUMN_EVENT_TYPES_EVENT_ID])
      .WithColumn(COLUMN_EVENT_OBJECT_OBJECT_ID, Column(objectIdColumn))
      .WithColumn(COLUMN_EVENT_OBJECT_QUALIFIER, "")
      .Select([COLUMN_EVENT_OBJECT_EVENT_ID, COLUMN_EVENT_OBJECT_OBJECT_ID, COLUMN_EVENT_OBJECT_QUALIFIER]);
    if (allEventToObjectDataFrame == _EMPTY) {
      allEventToObjectDataFrame = eventToObjectDataFrame;
    } else {
      allEventToObjectDataFrame = allEventToObjectDataFrame.Append(eventToObjectDataFrame);
    }
  }
}

// Add data to Object-to-object table from ObjectTypes tables.
let TARGET_COLUMN_PREFIX = "Target";
let allObjectToObjectDataFrame;
for (let i = 0; i < Count(objectTypeNames); i++) {
  let objectsDatatableName = modelObject.Configuration.OcelDataSource.ObjectTypes[objectTypeNames[i]];
  let objectsDatatable = Project.DatatableByName(objectsDatatableName);
  let foreignKeys = objectsDatatable.ForeignKeys;
  for (let j = 0; j < Count(foreignKeys); j++) {
    let joinedColumns = foreignKeys[j].Columns;
    let targetDataFrame = Project.DatatableByName(foreignKeys[j].TargetDatatable).SqlDataFrame
      .Select(Distinct(Flatten([COLUMN_OBJECT_TYPES_OBJECT_ID, joinedColumns.Target])))
      .RenameColumns(Flatten([COLUMN_OBJECT_TYPES_OBJECT_ID, joinedColumns.Target]).{ let col = _;[(TARGET_COLUMN_PREFIX + col): col] }[0]);
    let objectToObjectDataFrame = objectsDatatable.SqlDataFrame
      .Select(Distinct(Flatten([COLUMN_OBJECT_TYPES_OBJECT_ID, joinedColumns.Source])))
      .Join(targetDataFrame, joinedColumns.{ let col = _;[col.Source: (TARGET_COLUMN_PREFIX + col.Target)] }[0], "inner")
      .WithColumn(COLUMN_OBJECT_OBJECT_SOURCE_ID, Column(COLUMN_OBJECT_TYPES_OBJECT_ID))
      .WithColumn(COLUMN_OBJECT_OBJECT_TARGET_ID, Column(TARGET_COLUMN_PREFIX + COLUMN_OBJECT_TYPES_OBJECT_ID))
      .WithColumn(COLUMN_OBJECT_OBJECT_QUALIFIER, "")
      .Select([COLUMN_OBJECT_OBJECT_SOURCE_ID, COLUMN_OBJECT_OBJECT_TARGET_ID, COLUMN_OBJECT_OBJECT_QUALIFIER]);
    if (allObjectToObjectDataFrame == _EMPTY) {
      allObjectToObjectDataFrame = objectToObjectDataFrame;
    } else {
      allObjectToObjectDataFrame = allObjectToObjectDataFrame.Append(objectToObjectDataFrame);
    }
  }
}

importDataframeToDatatable(modelObject.Configuration.OcelDataSource.Objects, allObjectsDataFrame);
importDataframeToDatatable(modelObject.Configuration.OcelDataSource.Events, allEventsDataFrame);
importDataframeToDatatable(modelObject.Configuration.OcelDataSource.EventToObject, allEventToObjectDataFrame);
importDataframeToDatatable(modelObject.Configuration.OcelDataSource.ObjectToObject, allObjectToObjectDataFrame);