登录查看更多内容

Read more to design better

Arpit Bhayani

发布日期: 2024年6月16日

Thank you so much for reading this edition of the newsletter ?? If you found it interesting, you will also love my courses

By the way,

Being hands-on is the best way for you to learn. Practice interesting programming challenges like building your own BitTorrent client, Redis, DNS server, and even SQLite from scratch on CodeCrafters.

?? Video I posted this week

This week I posted How Rockset achieves zero data latency and workload isolation at scale

Rockset has been a great database for me to dissect, and one of the most interesting things that I learned about it is how it achieves zero data latency and workload isolation at scale.

After talking about their architecture, storage layer, and query execution in the last three videos, I published the 4th one about their most amazing feature - Compute Compute Separation - and its internal details.

Instead of just talking about what it does, I have covered the intuition behind the approach and evolution of architecture. It will give you a ton of insights into how distributed databases are designed, built, and scaled.

?? Paper I read this week

This week I spent reading Query Attribute Recommendation at Amazon Search

This week I read a research paper by Amazon that solves a Search problem very similar to the one that I solved for Unacademy.

领英推荐

Deploying Serverless Application with AWS Lambda and…

Neal K. Davis 1 年前

Companies may inflate the title to retain you

Arpit Bhayani 12 个月前

Coding Challenge #17 - Memcached Server

John Crickett 1 年前

High-quality input to a search engine results in good quality results, but people type really short queries like "iPhone 13". Although this information seems complete to us, it is not sufficient for search engines. So there is a need to add Query Understanding.

The core idea is to build a model that expands the search query and generates and extracts relevant search query attributes which are then passed to search engines for better ranking, advertising, and recommendation.

For example: iPhone 13 -> brand:apple, os:ios, model:iphone, etc...

The information retrieval domain is fascinating and something I focussed on during my master's and during my first two years at Unacademy. The paper is pretty short and crisp and something you can even prototype, hence putting it out as a recommendation.

You can download this and other papers I recommend from my papershelf.

How PostgreSQL store large rows?

While exploring PostgreSQL internals, I stumbled upon an interesting internal detail about how it stores long rows.

PostgreSQL stores data in B-Trees and requires one row not to exceed the page size (about 8KB). So, when we insert a row longer than 8KB, it first tries to compress the overflowing data using its built-in compression algorithms. if it works, then great!

If it still does not fit, then TOAST comes into the picture. The core idea is to segment the data into chunks and store them in a dedicated TOAST table, while the original table holds the reference (checksum and virtual address) to ensure efficient retrieval.

The TOAST table for a particular table is named pg_toast_; you can find them with a simple query. This table stores the toast chunks (compressed if required), each identified by a chunk ID and sequence number for ordered retrieval.

We as users of PostgreSQL need not worry about storage as the database transparently manages it. Still, TOASTed data can incur slight performance overhead due to the additional layer of indirection and lookup.

We cannot completely avoid the TOAST, but we can minimize the need for it by designing the schema well, some best practices are

choose data types suited for the expected data size
for long columns, pick types (like bytea) that offer built-in compression
Track TOAST activity using pg_toast_table to identify bottlenecks
TOAST comes in multiple flavors, pick the one that suits your needs

For example, the PLAIN strategy offers a good balance for frequently accessed, compressible data; while EXTENDED works well for infrequently accessed data.

You can find this post on my LinkedIn and Twitter; do leave a like.

?? Interesting articles I read this week

I read a few engineering blogs almost every single day, and here are the three articles I would recommend you to read.

Thank you so much for reading this edition of the newsletter ?? If you found it interesting, you will also love my courses

I keep sharing no fluff stuff across my socials, so, if you resonate do give me a follow on Twitter, LinkedIn, YouTube, and GitHub.

Arpit's Newsletter

121,450 位关注者

William Caro Bautista

9 个月

Que importante y valioso tema el abordado en este artículo por nuestro autor, muy interesante y como se argumenta determinante para la persona y profesional que está en permanente compromiso de avanzar, de crecer, de mejorar, de enriquecer su talento y conocimiento. Esto se facilita enormemente con férreos hábitos y disciplina de lectura, consulta e investigación en temáticas específicas y de conjunto los cuales se presentan en los documentos de dise?o de una forma que es práctica para su aplicación. Por ello de lo acertado, valioso y practico para reflexionar, adicionando la decisión de iniciar con esta practica para avanzar y crecer.

CHESTER SWANSON SR.

Realtor Associate @ Next Trend Realty LLC | HAR REALTOR, IRS Tax Preparer

9 个月

Well said!.

2 次回应

Raghav B.

Pursuing PGDM

9 个月

Arpit Bhayani Informative.

1 次回应

Arpit Bhayani

9 个月

System Design for SDE-1s: https://arpitbhayani.me/sys-design System Design for SDE-2s and above: https://arpitbhayani.me/course Redis Internals: https://arpitbhayani.me/redis-internals My knowledge base: https://arpitbhayani.me/knowledge-base Bookshelf: https://arpitbhayani.me/bookshelf Research Papers: https://arpitbhayani.me/papershelf

2 次回应

查看更多评论

要查看或添加评论，请登录

Arpit Bhayani的更多文章

The Ideal End To An Ideal Career

2025年3月23日

The Ideal End To An Ideal Career

This edition of the newsletter contains one quick write-up that will help you grow faster in your career a video I…

6 条评论
How to Find and Ride the Next Tech Wave

2025年3月16日

How to Find and Ride the Next Tech Wave

This edition of the newsletter contains one quick write-up that will help you grow faster in your career a video I…

6 条评论
Engineer or Manager? How to Decide Your Path

2025年3月9日

Engineer or Manager? How to Decide Your Path

This edition of the newsletter contains one quick write-up that will help you grow faster in your career a video I…

7 条评论
One Career Bet Worth Taking

2025年3月2日

One Career Bet Worth Taking

This edition of the newsletter contains one quick write-up that will help you grow faster in your career a video I…

5 条评论
Leave your job with grace and gratitude

2025年2月23日

Leave your job with grace and gratitude

This edition of the newsletter contains one quick write-up that will help you grow faster in your career a video I…

7 条评论
Turn Boring Projects into Opportunities

2025年2月16日

Turn Boring Projects into Opportunities

This edition of the newsletter contains one quick write-up that will help you grow faster in your career a video I…

1 条评论
When is the right time to switch?

2025年2月10日

When is the right time to switch?

This edition of the newsletter contains one quick write-up that will help you grow faster in your career a video I…

8 条评论
Ramping up faster in your new job

2025年2月2日

Ramping up faster in your new job

This edition of the newsletter contains one quick write-up that will help you grow faster in your career a video I…

4 条评论
Back Your Disagreement with Data

2025年1月26日

Back Your Disagreement with Data

This edition of the newsletter contains one quick write-up that will help you grow faster in your career a video I…

2 条评论
Doubt yourself every day

2025年1月19日

Doubt yourself every day

This edition of the newsletter contains one quick write-up that will help you grow faster in your career a video I…

9 条评论

See all articles

Read more to design better

Arpit Bhayani