DataFlow in Expression Language: Difference between revisions

From QPR ProcessAnalyzer Wiki
Jump to navigation Jump to search
No edit summary
No edit summary
Line 1: Line 1:
DataFlow is an object representing a stream for tabular data. Difference to the DataFrame is that the DataFrame contains all its contents stored to the system memory. If there is lot of data, also lot of memory is required when using the DataFrame. On the other hand, in the DataFlow, the tabular contents "flows" from a source and is stored to a destination. Data can be manipulated, while having only a small portion of the entire data in memory at the same time. Thus DataFlows are suitable for ETL where the data volumes are large.
DataFlow is an object representing a stream of tabular data. DataFlow contains data with the similar structure as DataFrame, but difference is that in the DataFrame all its contents is stored to the system memory. If there is lot of data, also lot of memory is required when using DataFrames. On the other hand in the DataFlow, contents "flows" from the source to the destination, and data can be manipulated, while having only a small portion of the entire data in memory at the same time. Thus, DataFlows are suitable for ETL where data volumes are high.


{| class="wikitable"
{| class="wikitable"

Revision as of 16:03, 7 December 2022

DataFlow is an object representing a stream of tabular data. DataFlow contains data with the similar structure as DataFrame, but difference is that in the DataFrame all its contents is stored to the system memory. If there is lot of data, also lot of memory is required when using DataFrames. On the other hand in the DataFlow, contents "flows" from the source to the destination, and data can be manipulated, while having only a small portion of the entire data in memory at the same time. Thus, DataFlows are suitable for ETL where data volumes are high.

Function Punctions Description
Persist (String*) Datatable
  1. Datatable name (String)
  2. Additional parameters (Dictionary)
Writes DataFlow into datatable. Works similarly as the same function in the DataFrames.