登录查看更多内容

Snowflake Shorts: Identifying Patterns and Trends from Queries

Divyansh Saxena

?? Snowflake Advanced Certified Architect ?? Snowflake Data SuperHero 2023-25 ?? | Snowflake Jaipur User Group Leader | Snowflake SnowPro SME | ex-IBM | 12X Multi-Hyperscalar Cloud ?? Certified | 7K Network @ LinkedIn

发布日期: 2023年9月6日

If you are a data enthusiast, I am sure that you must have been in a phase where you spent hours identifying queries with similar execution plans to understand the optimization scope. Snowflake now has a feature in PuPr, allowing us to identify patterns and trends from queries. Let's discuss more about it in today's snowflake shorts.

Introduction of New Columns

query_hash and query_parameterized_hash are new output columns in the ACCOUNT_USAGE views and in the output of INFORMATION_SCHEMA table functions. These columns contain a hash of the query text. You can use this hash to analyze repeated queries.

These columns are available in the following views and in the output of the following table functions:

QUERY_HISTORY
TASK_HISTORY
QUERY_ACCELERATION_ELIGIBLE

As of today, they are part of the behavior bundle release 2023_06. You can enable it with the below query:

SELECT SYSTEM$ENABLE_BEHAVIOR_CHANGE_BUNDLE('2023_06');

Understanding QUERY_HASH

The query_hash column contains a hash value that is computed, based on the canonicalized text of the SQL statement. Repeated queries that have exactly the same query text have the same query_hash values.

For instance, the below queries will have the same QUERY_HASH

SELECT * FROM TEST_DB.SF_SCH_203.CUSTOMER WHERE     NAME = 'DIVYANSH';

SELECT * from test_db.sf_sch_203.customer where name= 'DIVYANSH';

Queries will have the same hash if their text differs only in:

Case insensitive identifier, session variable, and stage name
White Space
Comments

Understanding QUERY_PARAMETERIZED_HASH

query_parameterized_hash contains a hash value that is computed based on the parameterized query, which means the version of the query after the literals are parameterized. These literals must be used in the query predicate and must be used with one of the following comparison operators:

= (equal to)
!= (not equal to)
>= (greater than or equal to)
<= (less than or equal to)

For below 2 queries, the query_parameterized_hash will be the same:

领英推荐

Data Analysts, Stop Ignoring Pandas Series

Benjamin Bennett Alexander 1 周前

20 Stupid Claims About Big Data

Bernard Marr 9 年前

Basic Data Structure Types You Must Know

StrataScratch 5 个月前

SELECT * FROM TEST_DB.SF_SCH_203.CUSTOMER WHERE  NAME = 'DIVYANSH';

SELECT * FROM TEST_DB.SF_SCH_203.CUSTOMER WHERE     NAME = 'PAUL';

Queries will have the same parameterized hash if their text differs only in:

Case insensitive identifier, session variable, and stage name
White Space
Comments

Important Things to Note

Over time, the logic used by Snowflake to generate the query hash can change. Changes to this logic can result in different hashes produced for the same query.

The views and table function output that include the query_hash and query_parameterized_hash columns also include the following columns that specify the version of the logic used to produce the hashes:

query_hash_version
query_parameterized_hash_version

If these columns contain different version numbers for different periods of time, you can use these version columns to identify the different hashes for the same query.

About Me:

Hi there! I am Divyansh Saxena

I am an experienced Data Engineer with a proven track record of success in Snowflake Data Cloud technology. Highly skilled in designing, implementing, and maintaining data pipelines, ETL workflows, and data warehousing solutions. Possessing advanced knowledge of Snowflake’s features and functionality, I am a Snowflake Data superhero & and Snowflake Snowpro Core SME. With a major career in Snowflake Data Cloud, I have a deep understanding of cloud-native data architecture and can leverage it to deliver high-performing, scalable, and secure data solutions.

Follow me on Medium for regular updates on Snowflake Best Practices and other trending topics:

Sneha K ?

Data Engineer @ IBM | Microsoft Certified, Cloud Computing, Data Management

1 年

This inspires me to get one snowpro certificate ?? .

1 次回应

要查看或添加评论，请登录

Divyansh Saxena的更多文章

[2025] Improve Your Pandas Workloads Using Snowflake Snowpark Pandas API

2025年3月28日

[2025] Improve Your Pandas Workloads Using Snowflake Snowpark Pandas API

Those who are familiar with the Pandas Dataframes, I believe you are aware of Pandas not been optimized for handling…

1 条评论
Empowering Data, Building Futures: Snowflake Data Superhero @Partner Connect 2025

2025年3月21日

Empowering Data, Building Futures: Snowflake Data Superhero @Partner Connect 2025

I had a pleasure of meeting Yang Yang and Ash Willis during Snowflake partner connect 2025 in Bangalore. It was a great…

2 条评论
[2025] Confused Between COPY, Snowpipe, Dynamic Tables? Let’s Understand Ingestion Mechanisms in Snowflake

2025年3月16日

[2025] Confused Between COPY, Snowpipe, Dynamic Tables? Let’s Understand Ingestion Mechanisms in Snowflake

Bringing data into Snowflake is an easy job. You all must be aware of the services that snowflake offers like COPY INTO…

2 条评论
Snowflake Arctic, Snowflake Data Cleanroom, Upcoming Jaipur Virtual Meetup, and More...

2024年4月30日

Snowflake Arctic, Snowflake Data Cleanroom, Upcoming Jaipur Virtual Meetup, and More...

April is ending today, and with that I am back again with my latest monthly newsletter series on Discover Snowflake…
New Snowflake User Groups Now in India | April 13, 2024- Jaipur Meet-Up

2024年4月7日

New Snowflake User Groups Now in India | April 13, 2024- Jaipur Meet-Up

My fellow LinkedIn Connections, It's been a couple of weeks since I published my last LinkedIn article. It's never too…
500+ Enrollments! Avail Your Snowflake Snowpark for Python Course for FREE

2023年11月5日

500+ Enrollments! Avail Your Snowflake Snowpark for Python Course for FREE

Use the below link to enroll. Hurry, the Promo code Expires Today! I am happy to see that I have received an…
Elevate Your Snowflake Experience with Flurry Insights Native Apps

2023年11月4日

Elevate Your Snowflake Experience with Flurry Insights Native Apps

In the dynamic landscape of cloud computing, businesses are constantly seeking innovative solutions to optimize their…
Master Snowflake Snowpark API with My Latest Python Course!

2023年10月22日

Master Snowflake Snowpark API with My Latest Python Course!

Did you know that the put_stream in snowpark FileOperation uses the file/data already in memory using a BytesIO object…

1 条评论
Keep Abreast with Monthly Snowflake Rewind

2023年9月20日

Keep Abreast with Monthly Snowflake Rewind

For all the Data Enthusiasts on Linkedin, I know you all have been overwhelmed and excited by new features that…

1 条评论
Snowflake Shorts: Execute SQL Scripts From Snowflake Stage

2023年9月7日

Snowflake Shorts: Execute SQL Scripts From Snowflake Stage

Ever wonder about maintaining your own code repository inside Snowflake and deploying objects through Snowflake scripts…

1 条评论

See all articles

Snowflake Shorts: Identifying Patterns and Trends from Queries

Divyansh Saxena

?? Snowflake Advanced Certified Architect ?? Snowflake Data SuperHero 2023-25 ?? | Snowflake Jaipur User Group Leader | Snowflake SnowPro SME | ex-IBM | 12X Multi-Hyperscalar Cloud ?? Certified | 7K Network @ LinkedIn

Introduction of New Columns

Understanding QUERY_HASH

Understanding QUERY_PARAMETERIZED_HASH

领英推荐

Important Things to Note

About Me:

Divyansh Saxena的更多文章

社区洞察

其他会员也浏览了

An Overview of the materialized view of Snowflake

Exactly How SQL is Used on the Job by Data Scientists - A Case Study

Mastering Advanced SQL Queries for Data Professionals

OLAP vs OLTP: The Unsung Heroes of the Data World

Microsoft Data Platform News 2024 - Week 01

From Hero Mentality to Reproducibility: DataOps for Humans

DATABASE NORMALIZATION

BOT's #1: Adding dynamic images to your Google Data Studio report

The Hitchhiker's Guide to Data Lineage - Part I

how we organize data defines how we can search for it

Introduction of New Columns

Understanding QUERY_HASH

Understanding QUERY_PARAMETERIZED_HASH

领英推荐

Important Things to Note

About Me:

Divyansh Saxena的更多文章

[2025] Improve Your Pandas Workloads Using Snowflake Snowpark Pandas API

Empowering Data, Building Futures: Snowflake Data Superhero @Partner Connect 2025

[2025] Confused Between COPY, Snowpipe, Dynamic Tables? Let’s Understand Ingestion Mechanisms in Snowflake

Snowflake Arctic, Snowflake Data Cleanroom, Upcoming Jaipur Virtual Meetup, and More...

New Snowflake User Groups Now in India | April 13, 2024- Jaipur Meet-Up

500+ Enrollments! Avail Your Snowflake Snowpark for Python Course for FREE

Elevate Your Snowflake Experience with Flurry Insights Native Apps

Master Snowflake Snowpark API with My Latest Python Course!

Keep Abreast with Monthly Snowflake Rewind

Snowflake Shorts: Execute SQL Scripts From Snowflake Stage

社区洞察

其他会员也浏览了

An Overview of the materialized view of Snowflake

Exactly How SQL is Used on the Job by Data Scientists - A Case Study

Mastering Advanced SQL Queries for Data Professionals

OLAP vs OLTP: The Unsung Heroes of the Data World

Microsoft Data Platform News 2024 - Week 01

From Hero Mentality to Reproducibility: DataOps for Humans

DATABASE NORMALIZATION

BOT's #1: Adding dynamic images to your Google Data Studio report

The Hitchhiker's Guide to Data Lineage - Part I

how we organize data defines how we can search for it