Importing Data to Datatable from CSV File: Difference between revisions

From QPR ProcessAnalyzer Wiki
Jump to navigation Jump to search
(Created page with "Data can be imported to datatables from CSV files using the Workspace screen. Import is done as follows: # On the left side projects...")
 
No edit summary
Line 14: Line 14:


== Data types ==
== Data types ==
It's very important to set the most suitable data types for each column, as it greatly affects the usage of the data in the analyses. Incorrect data type usually make the analysis impossible. The following data types are available:
{| class="wikitable"
! '''Data type'''
! '''Use'''
! '''Settings'''
|-
||String
||Data type for textual data. Also suitable for distinct values that are not numerical, e.g. low, medium high.
||
|-
||Integer
||Data type for whole numbers, i.e. numbers that don't need decimals.
||Unit
|-
||Decimal number
||
||
|-
||Date
||Data type for precise timestamps and
||
|-
||Boolean
||Data type that contains only two values, true and false. The ''null' value is also available (like in other data types) which in practise is the third possible value for boolean type of data. If no more values are needed, boolean is the most efficient data type to use. Note that in dashboards, the true and false values can be translated into user understandable texts, such as "yes" and "no".
||
|-
||Timespan
||
||
|}


== Supported CSV Formats ==
== Supported CSV Formats ==

Revision as of 13:25, 21 June 2021

Data can be imported to datatables from CSV files using the Workspace screen. Import is done as follows:

  1. On the left side projects hierarchy, select the project where the target datatable is located.
  2. Open the Datatables tab.
  3. Select the datatable where to import data.
  4. Click the Import button.
  5. Select the CSV file to be imported, adjust data type and conversion settings if needed, and click Start import.

Import Settings

The data import dialog has the following settings:

  • Select CSV file: Button to select the CSV file. You can use this button to change the CSV file if you want to change the file.
  • Column separator: Column separator character used by the CSV file. The column separator is automatically detected when the file is selected. In some cases, the automatic detection does not find the correct column separator, and this selection allows to change the column separator. If you want to use space as a column separator, select <space>, or if you want to use tabulator, select <tab>. Note that when changing the column separator, all column specific settings in the table are reset.
  • Skip errors: Allows to continue the import when encountering errors and import null values in place of cells where the conversion could not be made. If error skipping is not in use, the import will stop to the first encountered error. The data conversions are checked before importing actual data to the datatables, so data conversion errors will not lead to partially done imported.
  • Character encoding: Character encoding used by the CSV file. Most text files use UTF-8 encoding which is the default. If some of the characters in the CSV file look incorrect in the table preview, try to change other character encoding to find the correct one. Note that there is no automatic detection of the character encoding. Note also that changing the character encoding will reset all column specific settings

Data types

It's very important to set the most suitable data types for each column, as it greatly affects the usage of the data in the analyses. Incorrect data type usually make the analysis impossible. The following data types are available:

Data type Use Settings
String Data type for textual data. Also suitable for distinct values that are not numerical, e.g. low, medium high.
Integer Data type for whole numbers, i.e. numbers that don't need decimals. Unit
Decimal number
Date Data type for precise timestamps and
Boolean Data type that contains only two values, true and false. The null' value is also available (like in other data types) which in practise is the third possible value for boolean type of data. If no more values are needed, boolean is the most efficient data type to use. Note that in dashboards, the true and false values can be translated into user understandable texts, such as "yes" and "no".
Timespan

Supported CSV Formats