登录查看更多内容

How to decide when a query is too slow and needs optimization

Nikolay Samokhvalov

?? Let's make your Postgres healthy and awesome: [email protected] // I stand with Ukraine ???? // Postgres.AI founder; PostgreSQL contributor; Postgres.FM co-host // Freediver

发布日期: 2023年11月5日

This is a cross-post from Twitter. I post a new PostgreSQL "howto" article there every day. Join me in this journey – subscribe here or on X, provide feedback, share!

"Slow" is a relative concept. In some cases, we might be happy with query latency 1 minute (or no?), while in other scenarios, even 1 ms might seem to be too slow.

Decision when to apply optimization techniques is important for efficiency – as Donald Knuth famously stated in "The Art of Computer Programming":

The real problem is that programmers have spent far too much time worrying about efficiency in the wrong places and at the wrong times; premature optimization is the root of all evil (or at least most of it) in programming.

Below we assume that we work with OLTP or hybrid workloads and need to decide if a certain query is too slow and requires optimization.

领英推荐

The Essential Guide to Node.js, SQL, Kafka, and Event…

Karthik Rana 1 个月前

FLaNK-AIM: 13 May 2024

Tim Spann 10 个月前

MySQL JSON Operations | Advantages and Limitations

Sourav Basak 7 个月前

How to conclude that a query is too slow

Do you have an OLTP case or an analytical one, or hybrid? For OLTP cases, requirements are more strict and dictated by human perception (see: What is a slow SQL query?), while for analytical needs, we can usually wait a minute or two – unless it's also user-facing. If it is, we probably consider 1 minute as too slow. In this case, consider using column-store database systems (and the Postgres ecosystem has a new offering here: check out Hydra ). For OLTP, the majority of user-facing queries should be below 100ms – ideally, below 10ms – so the complete requests to your backends that users make, do not exceed 100-200ms (each request can issue several SQL queries, depending on the case). Of course, non-user-facing queries such as those coming from background jobs, pg_dump, and so on, can last longer – assuming that the next principles are met.
In the case of OLTP, the second question should be: is this query "read-only" or it changes the data (be it DDL or just writing DML – INSERT/UPDATE/DELETE)? In this case, in OLTP, we shouldn't allow it to run longer than a second or two, unless we are 100% sure that this query won't block other queries for long. For massive writes, consider splitting them in batches so each batch doesn't last longer than 1-2 seconds. For DDL, be careful with lock acquisition and lock chains (read these posts: Common DB schema change mistakes and Useful queries to analyze PostgreSQL lock trees (a.k.a. lock queues)).
If you're dealing with a read-only query, make sure it's also not running for too long – long-running transactions make Postgres hold old dead tuples for long ("xmin horizon" is not advancing), so autovacuum cannot delete dead tuples that became dead after the start of our transaction. Aim to avoid transactions that last longer than one or a few hours (and if you absolutely need such long transactions, prefer running them at low-activity hours, when XID is progressing slowly, and do not run them often).
Finally, even if a query is relatively fast – for instance, 10ms – it might still be considered too slow if its frequency is high. For example, 10ms query running 1,000 times per second (you can check it via pg_stat_statements.calls), then Postgres needs to spend 10 seconds every second to process this group of queries. In this case, if lowering down the frequency is hard, the query should be considered slow, and an optimization attempt needs to be performed, to reduce resource consumption (the goal here is to reduce pg_stat_statements.total_exec_time – see the previous #PostgresMarathon posts about pgss).

Summary

All queries that last longer than 100-200 ms should be considered as slow, if they are user-facing. Good queries are those that are below 10 ms.
Background processing queries are ok to last longer. If they modify data and might block user-facing queries, then they should not be allowed to last longer than 1-2 s.
Be careful with DDLs – make sure they don't cause massive writes (if they do, it should be split into batches as well), and use low lock_timeout and retries to avoid blocking chains.
Do not allow long-running transactions. Make sure the xmin horizon is progressing and autovacuum can remove dead tuples promptly – do not allow transactions that last too long (>1-2h).
Optimize even fast (<100ms) queries if the corresponding pg_stat_statements.calls and pg_stat_statements.total_exec_time are high.

Ariston Bell

1 年

Nikolay Samokhvalov What key performance indicators or signs do you recommend people look for to determine if a query is indeed too slow and needs attention?

要查看或添加评论，请登录

Nikolay Samokhvalov的更多文章

How many TPS can we get from a single Postgres node?

2024年6月27日

How many TPS can we get from a single Postgres node?

1. Very close to 4M TPS! A few days ago, the Postgres.

9 条评论
How to perform initial / rough Postgres tuning

2023年12月29日

How to perform initial / rough Postgres tuning

I post a new PostgreSQL "howto" article every day. Join me in this journey – subscribe here or on X, provide feedback…

6 条评论
[Postgres] How to redefine a PK without downtime

2023年12月27日

[Postgres] How to redefine a PK without downtime

I post a new PostgreSQL "howto" article every day. Join me in this journey – subscribe here or on X, provide feedback…

2 条评论
[Postgres] How to speed up bulk load

2023年12月23日

[Postgres] How to speed up bulk load

I post a new PostgreSQL "howto" article every day. Join me in this journey – subscribe here or on X, provide feedback…

2 条评论
[Postgres] How to troubleshoot a growing pg_wal directory

2023年12月23日

[Postgres] How to troubleshoot a growing pg_wal directory

I post a new PostgreSQL "howto" article every day. Join me in this journey – subscribe here or on X, provide feedback…

1 条评论
[Postgres] How to deal with long-running transactions

2023年12月21日

[Postgres] How to deal with long-running transactions

I post a new PostgreSQL "howto" article every day. Join me in this journey – subscribe here or on X, provide feedback…

2 条评论
[Postgres] How to work with arrays, part 2

2023年12月3日

[Postgres] How to work with arrays, part 2

I post a new PostgreSQL "howto" article every day. Join me in this journey – subscribe here or on X, provide feedback…
[Postgres] How to work with arrays, part 1

2023年11月30日

[Postgres] How to work with arrays, part 1

I post a new PostgreSQL "howto" article every day. Join me in this journey – subscribe here or on X, provide feedback…

2 条评论
[Postgres] How to work with metadata

2023年11月21日

[Postgres] How to work with metadata

I post a new PostgreSQL "howto" article every day. Join me in this journey – subscribe here or on X, provide feedback…
How to use OpenAI APIs right from Postgres to implement semantic search and GPT chat

2023年11月20日

How to use OpenAI APIs right from Postgres to implement semantic search and GPT chat

I post a new PostgreSQL "howto" article every day. Join me in this journey – subscribe here or on X, provide feedback…

See all articles

How to decide when a query is too slow and needs optimization

Nikolay Samokhvalov

?? Let's make your Postgres healthy and awesome: [email protected] // I stand with Ukraine ???? // Postgres.AI founder; PostgreSQL contributor; Postgres.FM co-host // Freediver

领英推荐

How to conclude that a query is too slow

Summary

Nikolay Samokhvalov的更多文章

社区洞察

其他会员也浏览了

Understanding the Future of Apache Iceberg Catalogs

More fun with Medium story stats, JSON, Python, Pandas, and Oracle SQL Developer Web

Create A Flask App To Use PostgreSQL Database

Getting Started with Apache Airflow

Is SQL a Programming Language?

Developers - It's 2023. Can we please stop creating SQL Injections?

Spark Tidbits - Lesson 10

Managing ETLs Using Apache AirFlow - Getting started part 1

Best Practices in SQL Coding

Python Database Connection Tutorial

领英推荐

How to conclude that a query is too slow

Summary

Nikolay Samokhvalov的更多文章

How many TPS can we get from a single Postgres node?

How to perform initial / rough Postgres tuning

[Postgres] How to redefine a PK without downtime

[Postgres] How to speed up bulk load

[Postgres] How to troubleshoot a growing pg_wal directory

[Postgres] How to deal with long-running transactions

[Postgres] How to work with arrays, part 2

[Postgres] How to work with arrays, part 1

[Postgres] How to work with metadata

How to use OpenAI APIs right from Postgres to implement semantic search and GPT chat

社区洞察

其他会员也浏览了

Understanding the Future of Apache Iceberg Catalogs

More fun with Medium story stats, JSON, Python, Pandas, and Oracle SQL Developer Web

Create A Flask App To Use PostgreSQL Database

Getting Started with Apache Airflow

Is SQL a Programming Language?

Developers - It's 2023. Can we please stop creating SQL Injections?

Spark Tidbits - Lesson 10

Managing ETLs Using Apache AirFlow - Getting started part 1

Best Practices in SQL Coding

Python Database Connection Tutorial