Revision as of 17:04, 26 May 2023

Introduction to AMI Historical Database

The AMI Historical Table is a columnar-based on-disk table engine with support for partitioning. The introduction section will briefly describe the core concepts of the AMI Historical database: Partitioning, Data storage and optimizations.

Partitioning

Partitioning is used to organize data for performance. To understand why partitioning is critical for large datasets, let’s first consider a non-partitioned standard table: as the number or rows increases, the performance (time to insert, update, query and alter) will decline. Partitioning provides an elegant solution by breaking up rows into separate isolated “virtual tables” called partitions. Once data is broken into partitions, the row count of one partition will not affect the performance of other partitions. In order to dictate how rows are organized into partitions, the AMI Historical Table introduces a special PARTITION column type. The value in the PARTITION column(s) determines which partition a given row will be assigned to. In other words, all rows for a given partition will have the same value in the PARTITION column(s).

By way of example, data is often partitioned by a date: Imagine we have the following historical table, partitioned by the column “date”

When inserting row(s) whose date match(es) the existing partition key(s), the query engine will automatically navigate to the corresponding partition according to the partition key and append the rows accordingly.

Note that adding another day of data will automatically create a new partition without affecting existing days’ data: For example, if we want to insert two new rows into the table whose “Date” == 20221203

To create a historical table add USE PersistEngine="HISTORICAL" and include the PARTITION keyword after the column(s) that the rows should be partitioned by. Ex:

CREATE PUBLIC TABLE PartitionDemo(date LONG PARTITION, symbol STRING, qty INTEGER, comments STRING) use PersistEngine="HISTORICAL";

The file structure of AMIHDB is such that each table has its own directory and each partition is assigned its own subdirectory. Each column and index has its own file(s) per partition directory.

<amiroot>/hdb/Table_Name.3fhtab/Partition_Id.dfhpar/<Files within a partition>

● Use ami.hdb.root.dir property to override location of hdb root

● Use ami.hdb.filehandles.max property to control how many files can be open at a time

● Table_Name - is the name of the table

● Partition_Id - An auto-incrementing id (starting at 0001)

● Files within a partition:

○ KEY.3fhkey - This stores the (plaintext) key for this partition

○ ColumnName.3fhcol - Stores data for the column

○ IndexName.3fhidx - Stores index data

@@ Line 31: / Line 31: @@
 <amiroot>/hdb/Table_Name.3fhtab/Partition_Id.dfhpar/<Files within a partition> <br>
-&nbsp;&nbsp;&nbsp;● Use '''ami.hdb.root.dir''' property to override location of hdb root<br>
+: ● Use '''ami.hdb.root.dir''' property to override location of hdb root<br>
-&nbsp;&nbsp;&nbsp;● Use '''ami.hdb.filehandles.max''' property to control how many files can be open at a time<br>
+: ● Use '''ami.hdb.filehandles.max''' property to control how many files can be open at a time<br>
-&nbsp;&nbsp;&nbsp;● Table_Name - is the name of the table<br>
+: ● Table_Name - is the name of the table<br>
-● Partition_Id - An auto-incrementing id (starting at 0001) <br>
+: ● Partition_Id - An auto-incrementing id (starting at 0001) <br>
-● Files within a partition:<br>
+: ● Files within a partition:<br>
-○ KEY.3fhkey - This stores the (plaintext) key for this partition<br>
+:: ○ KEY.3fhkey - This stores the (plaintext) key for this partition<br>
-○ ColumnName.3fhcol - Stores data for the column<br>
+:: ○ ColumnName.3fhcol - Stores data for the column<br>
-○ IndexName.3fhidx - Stores index data<br>
+:: ○ IndexName.3fhidx - Stores index data<br>

Difference between revisions of "AMI Historical Database"

Revision as of 17:04, 26 May 2023

Introduction to AMI Historical Database

Partitioning

Navigation menu

Search