登录查看更多内容

Historical Data Tracking in PostgreSQL - Part 2: Trigger Functions

Jaime Martínez Verdú

Ingeniero y Analista de Datos | Experto en Transformación Digital y Estrategia de Datos

发布日期: 2023年8月4日

Introduction

In the first article of this series, we explored the concept of historical data tracking in #PostgreSQL and discussed the importance of maintaining historical records in certain scenarios. To achieve this functionality in PostgreSQL, where native support for #DataHistorification is absent, we rely on #TriggerFunctions. In this article, we will delve into the definition and implementation of trigger functions, which play a crucial role in managing historical data.

Defining Trigger Functions

Trigger functions in PostgreSQL are user-defined functions that are automatically executed in response to specified events, such as INSERT, UPDATE, or DELETE operations on a table. These functions allow us to capture these events and perform custom actions, making them a powerful tool for #DataTracking.

Let's take a look at trigger functions we need in our historification process:

stg.trg_fnc_products_delete()

This function captures DELETE operations on the 'stg.products' table and updates the corresponding historical record in 'stg.products_hist' by setting the 'end_date' to the current timestamp and marking it as 'deleted = true'. Here's the code breakdown:

CREATE OR REPLACE FUNCTION stg.trg_fnc_products_delete(
RETURNS TRIGGER
LANGUAGE plpgsql
AS $function$
BEGIN
	UPDATE stg.products_hist SET end_date = NOW(), deleted = TRUE
    WHERE id = OLD.id AND end_date IS NULL;
	RETURN OLD;
END;
$function$;)

The 'UPDATE' statement within the function sets the 'end_date' column to the current timestamp using the 'NOW()' function, indicating the end of validity for the expired historical record. It also marks the record as deleted by setting 'deleted = true'. The condition 'WHERE id = old.id AND end_date IS NULL' ensures that only the active historical record associated with the deleted record is updated. The 'OLD' keyword refers to the deleted row from the 'stg.products' table.

stg.trg_fnc_products_insert()

This function captures INSERT operations on the 'products' table and copies the newly inserted row into the 'products_hist' table, thereby creating a historical record. The code is as follows:

领英推荐

Database Internals - Write Ahead Log File Structure

Vivek Bansal 1 年前

Upgrading ReportPortal with backup/restore of Postgres…

Gaurav Singh 3 个月前

From Planning to Performance: Your Ultimate Guide to…

CST - Cyber Sapient 1 个月前

CREATE OR REPLACE FUNCTION stg.trg_fnc_products_insert(
RETURNS TRIGGER
LANGUAGE plpgsql
AS $function$
BEGIN
	INSERT INTO stg.products_hist SELECT NOW(), NULL, FALSE, NEW.*;
	RETURN NEW;
END;
$function$;

The 'INSERT INTO' statement inserts a new row into the historical table. The 'SELECT NOW(), NULL, FALSE, NEW.' statement selects the values from the newly inserted row ('NEW.') and adds the current timestamp ('start_date = NOW()'), a NULL value for 'end_date', and FALSE for 'deleted' as additional columns for the insertion. The 'NEW' keyword refers to the inserted row from the 'products' table.

stg.trg_fnc_products_update()

This function handles UPDATE operations on the 'products' table by updating the 'end_date' of the previous version and inserting the updated row as a new version in the 'products_hist' table, ensuring historical tracking of data changes:

CREATE OR REPLACE FUNCTION stg.trg_fnc_products_update(
RETURNS TRIGGER
LANGUAGE plpgsql
AS $function$
BEGIN
	UPDATE stg.products_hist SET end_date = NOW()
    WHERE id = NEW.id AND end_date IS NULL;
	INSERT INTO stg.products_hist SELECT NOW(), NULL, FALSE, NEW.*;
	RETURN NEW;
END;
$function$;)

The function consists of two main parts:

The 'UPDATE' statement updates the 'end_date' column of the corresponding historical record, setting its value to the current timestamp ('NOW()') for the row that matches the 'id' of the updated row ('NEW.id') and where 'end_date' is currently NULL, indicating the active record. The 'NEW' keyword refers to the updated row from the 'products' table.
The 'INSERT INTO' statement then inserts the updated row into the historical table in a new version.

General Recommendations for Creating Trigger Functions for Historical Data Management

Careful Column Ordering: As mentioned in the previous article, when designing your historical table, make sure the 'start_date', 'end_date', and 'deleted' columns are positioned at the beginning of the table. This ensures easier handling of trigger functions when additional columns are added later.
Maintain Consistency: Ensure that the logic in your trigger functions is consistent with the data model and business requirements.
Consider Performance: Trigger functions are executed with every relevant table operation, so efficiency is essential. Optimize your queries and consider using partial indexes to speed up historical data retrieval.
Backups and Archiving: Historical data can grow significantly, so implement regular backups and consider archiving older historical records to maintain a manageable database size.

Trigger functions in PostgreSQL are powerful tools that enable us to create historical records and maintain a trail of data changes. By defining appropriate trigger functions, we can effectively track and manage historical data in the absence of native support.

In the next article, we'll dive into the exciting world of ETL processes and how they seamlessly integrate with historified tables. By the end of the series, you will have a comprehensive understanding of historical data management and be equipped with the knowledge to implement it effectively in your own projects.

Stay curious and stay committed to optimizing your data management practices! We look forward to seeing you in the final chapter of our Historical Data Tracking series.

#HistoricalDataTracking #PostgreSQL #DataManagement #DataHistorification #DatabaseTips #BestPractices #DataManagement #DatabaseTips #HistoricalData

要查看或添加评论，请登录

Jaime Martínez Verdú的更多文章

Ensuring Data Integrity: Query Modification and Verification Strategies

2023年12月16日

Ensuring Data Integrity: Query Modification and Verification Strategies

As data analysts and engineers working extensively with #PostgreSQL, the process of modifying existing queries within a…
Dynamic Code vs. Hardcoding SQL

2023年10月8日

Dynamic Code vs. Hardcoding SQL

In the world of database management, #PostgreSQL stands out as a powerful open-source relational database system…
Historical Data Tracking in PostgreSQL - Part 3: ETL Process Adjustment

2023年9月24日

Historical Data Tracking in PostgreSQL - Part 3: ETL Process Adjustment

Introduction In the first two parts of this series, we explored how to set up historical data tracking in #PostgreSQL…

1 条评论
Historical Data Tracking in PostgreSQL - Part 1: Historical Table and Triggers

2023年7月30日

Historical Data Tracking in PostgreSQL - Part 1: Historical Table and Triggers

Introduction to Historical Data Tracking in PostgreSQL Historical data tracking, also known as #historification, plays…
Efficient Querying of Historical Data in a Database

2023年7月15日

Efficient Querying of Historical Data in a Database

Introduction In this article, we will explore the process of querying historical data stored in a database table in…

1 条评论
Leveraging LAG() for Accurate Click-Through Rate Analysis

2023年6月12日

Leveraging LAG() for Accurate Click-Through Rate Analysis

In today's data-driven world, organizations are increasingly focusing on leveraging data analysis to drive positive…
The story of how I optimized a SQL code to improve our data analysis in my company

2023年5月14日

The story of how I optimized a SQL code to improve our data analysis in my company

As a member of the data analytics team at ClimateTrade?, I recently encountered a challenge: a view that was being used…

3 条评论
TRABAJO DE AUDITORíA A MECANIZADOS DEL VINALOPó S.L. Y FORMULACIóN DE PROPUESTAS DE MEJORA

2016年5月7日

TRABAJO DE AUDITORíA A MECANIZADOS DEL VINALOPó S.L. Y FORMULACIóN DE PROPUESTAS DE MEJORA

Os presento el estudio de consultoría / auditoría empresarial que realicé a mi empresa hace un par de a?os como Trabajo…

3 条评论
Toda historia tiene su final, en este caso, feliz. Pero antes,...

2014年9月19日

Toda historia tiene su final, en este caso, feliz. Pero antes,...

Me gustaría agradecer la dedicación de todos y cada uno de los profesores del ?#?MBAUA? pues me han ense?ado una forma…

1 条评论
KeraWear

2014年6月17日

KeraWear

Hoy me he llegado un gran sorpresa cuando me he enterado de que KeraWear ha sido el vídeo finalista del concurso…

See all articles

Historical Data Tracking in PostgreSQL - Part 2: Trigger Functions

Jaime Martínez Verdú

Ingeniero y Analista de Datos | Experto en Transformación Digital y Estrategia de Datos

Introduction

Defining Trigger Functions

领英推荐

General Recommendations for Creating Trigger Functions for Historical Data Management

Jaime Martínez Verdú的更多文章

社区洞察

其他会员也浏览了

High-Performance PostgreSQL: A Dive Into the Internals

A Step-by-Step Guide to Installing Trino for Data Migration

What is going on during optimization in PostgreSQL?

Step-by-Step Guide to Setting Up a PostgreSQL Database and User for Application Development

Postgres for Everything

Over-indexing

Working with JSON data in PostgreSQL

Mastering PostgreSQL Configuration Settings: A Comprehensive Guide

Window functions in PostgreSQL: The secret weapon of SQL ninjas

BYTEA vs Large Objects (LOBs) in PostgreSQL

Introduction

Defining Trigger Functions

领英推荐

General Recommendations for Creating Trigger Functions for Historical Data Management

Jaime Martínez Verdú的更多文章

Ensuring Data Integrity: Query Modification and Verification Strategies

Dynamic Code vs. Hardcoding SQL

Historical Data Tracking in PostgreSQL - Part 3: ETL Process Adjustment

Historical Data Tracking in PostgreSQL - Part 1: Historical Table and Triggers

Efficient Querying of Historical Data in a Database

Leveraging LAG() for Accurate Click-Through Rate Analysis

The story of how I optimized a SQL code to improve our data analysis in my company

TRABAJO DE AUDITORíA A MECANIZADOS DEL VINALOPó S.L. Y FORMULACIóN DE PROPUESTAS DE MEJORA

Toda historia tiene su final, en este caso, feliz. Pero antes,...

KeraWear

社区洞察

其他会员也浏览了

High-Performance PostgreSQL: A Dive Into the Internals

A Step-by-Step Guide to Installing Trino for Data Migration

What is going on during optimization in PostgreSQL?

Step-by-Step Guide to Setting Up a PostgreSQL Database and User for Application Development

Postgres for Everything

Over-indexing

Working with JSON data in PostgreSQL

Mastering PostgreSQL Configuration Settings: A Comprehensive Guide

Window functions in PostgreSQL: The secret weapon of SQL ninjas

BYTEA vs Large Objects (LOBs) in PostgreSQL