登录查看更多内容

Hash Table Internals - Part 11 - Implementing Hash Maps

Arpit Bhayani

发布日期: 2022年9月8日

A map, when implemented with Hash Tables, is called a Hash Map; and just like a regular map it supports operations like put, get, and iterate.

Key Implementation Details

Hash Maps are required to store the application keys of any type, but the Hash Table understands only integers. We first pass the application keys through a hash function to map it to a 32-bit integer, and then the usual Hash Table operation takes over.

Given that the application key to hash key is a frequent operation, we would keep it handy by storing it alongside the application key. This helps us save repeated computations.

While looking up a key in the hash map, we first reach the slot, and then compare if the key present there is really the key we are looking for, as we cannot just rely on the equality of the hash keys. The Hash Map, hence, accepts a key comparator function.

With Chained Hash Tables

Each node of the linked list in the chained hash table has the following structure

struct node {
    int32 hash_key;
    void *key;
    void *value;
    struct node *next;
}

void * key and void * value allows us to hold a key and a value of any type, while int32 hash_key enables us to hold a pre-computed hash.

At the Hash Table level, we hold

the array of linked list
size of the array
number of keys
key comparator function

Instead of having a contains function, we have a lookup function that returns the value for the matching key, and NULL if the key does not exist. Thus, the lookup function doubles as a contains function.

When duplicate keys are inserted, we can do one of the following

do not insert at all
delete the old key and re-insert it again, bringing it to the head of the list, making it quicker to reach

With Hash Tables having open addressing

Each slot of the hash table has the following structure

领英推荐

Spring Boot Projections Uncovered: How to Fetch Just…

Ahmed Safwat 6 个月前

Week of July 15th

Stefan Krawczyk 8 个月前

Week of May 13th

Stefan Krawczyk 10 个月前


struct node {
    int32 hash_key;
    void *key;
    void *value;
    bool is_empty;
    bool is_deleted;
}

void *key and void *value allows us to hold a key and a value of any type, while int32 hash_key enables us to hold a pre-computed hash, saving a lot of runtime computations. is_empty tells us if the slot is empty, while is_deleted represents a soft deleted slot.

At the Hash Table level, we hold

the array of linked list
size of the array
number of active keys
number of used slots
key comparator function

Here the load factor will be computed as number of used slots/size of the array because the soft deleted keys also affect the performance of the hash table.

During insert, lookup, and delete when we find the matching hash, and explicitly compare the application keys, as multiple keys can hash to the same location, and we cannot rely on just hash key equivalence.

Here's the video of my explaining this in-depth ?? do check it out

Thank you so much for reading ?? If you found this helpful, do spread the word about it on social media; it would mean the world to me.

If you liked this short essay, you might also like my courses on

I teach an interactive course on System Design where you'll learn how to intuitively design scalable systems. The course will help you

become a better engineer
ace your technical discussions
get you acquainted with a spectrum of topics ranging from Storage Engines, High-throughput systems, to super-clever algorithms behind them.

I have compressed my ~10 years of work experience into this course, and aim to accelerate your engineering growth 100x. To date, the course is trusted by 800+ engineers from 11 different countries and here you can find what they say about the course.

Together, we will dissect and build some amazing systems and understand the intricate details. You can find the week-by-week curriculum and topics, testimonials, and other information at https://arpitbhayani.me/masterclass.

Arpit's Newsletter

120,753 位关注者

Devendra Vishwakarma

Senior Manager at Ciena

2 年

Can you make video on Bloom filters, and why bloom filters are recommended for lookup than using a traditional Hash table/map ?

Ankit Singh

2 年

#AsliEngineers :)

Arpit Bhayani

2 年

More about me: arpitbhayani.me Newsletter: arpitbhayani.me/newsletter Subscribe #AsliEngineering for such in-depth engineering concepts: https://www.youtube.com/c/ArpitBhayani System Design course: arpitbhayani.me/masterclass Microservices: https://courses.arpitbhayani.me/designing-microservices All GitHub Outages: https://courses.arpitbhayani.me/github-outage-dissections/

1 次回应

查看更多评论

要查看或添加评论，请登录

Arpit Bhayani的更多文章

How to Find and Ride the Next Tech Wave

2025年3月16日

How to Find and Ride the Next Tech Wave

This edition of the newsletter contains one quick write-up that will help you grow faster in your career a video I…

6 条评论
Engineer or Manager? How to Decide Your Path

2025年3月9日

Engineer or Manager? How to Decide Your Path

This edition of the newsletter contains one quick write-up that will help you grow faster in your career a video I…

5 条评论
One Career Bet Worth Taking

2025年3月2日

One Career Bet Worth Taking

This edition of the newsletter contains one quick write-up that will help you grow faster in your career a video I…

5 条评论
Leave your job with grace and gratitude

2025年2月23日

Leave your job with grace and gratitude

This edition of the newsletter contains one quick write-up that will help you grow faster in your career a video I…

7 条评论
Turn Boring Projects into Opportunities

2025年2月16日

Turn Boring Projects into Opportunities

This edition of the newsletter contains one quick write-up that will help you grow faster in your career a video I…

1 条评论
When is the right time to switch?

2025年2月10日

When is the right time to switch?

This edition of the newsletter contains one quick write-up that will help you grow faster in your career a video I…

8 条评论
Ramping up faster in your new job

2025年2月2日

Ramping up faster in your new job

This edition of the newsletter contains one quick write-up that will help you grow faster in your career a video I…

4 条评论
Back Your Disagreement with Data

2025年1月26日

Back Your Disagreement with Data

This edition of the newsletter contains one quick write-up that will help you grow faster in your career a video I…

2 条评论
Doubt yourself every day

2025年1月19日

Doubt yourself every day

This edition of the newsletter contains one quick write-up that will help you grow faster in your career a video I…

9 条评论
Not everything needs to be dumbed down

2025年1月12日

Not everything needs to be dumbed down

This edition of the newsletter contains one quick write-up that will help you grow faster in your career a video I…

11 条评论

See all articles

Hash Table Internals - Part 11 - Implementing Hash Maps

Arpit Bhayani

Key Implementation Details

With Chained Hash Tables

With Hash Tables having open addressing

领英推荐

Arpit's Newsletter

120,753 位关注者

Arpit Bhayani的更多文章

社区洞察

其他会员也浏览了

Week of May 13th

Lakes, Lakehouses, Warehouse and.....MDM?

Stack and Heap Memory in .NET

??Objective 1.2 Transaction Control Language

?? Batch Processing Battle: find_in_batches vs find_each vs in_batches – A Deep Dive for ROR Developers ??

?? Understanding Memory Allocation: Primitive vs Non-Primitive Data Types

??? Understanding Structs vs. Classes in C#

Snowflake Materialized View Query Auto-Rewrite

Data Transfer Object pattern and Mapper

Spot the Smells: 7 Code Smells That Make Your Code Look Bad

Key Implementation Details

With Chained Hash Tables

With Hash Tables having open addressing

领英推荐

Arpit's Newsletter

120,753 位关注者

Arpit Bhayani的更多文章

How to Find and Ride the Next Tech Wave

Engineer or Manager? How to Decide Your Path

One Career Bet Worth Taking

Leave your job with grace and gratitude

Turn Boring Projects into Opportunities

When is the right time to switch?

Ramping up faster in your new job

Back Your Disagreement with Data

Doubt yourself every day

Not everything needs to be dumbed down

社区洞察

其他会员也浏览了

Week of May 13th

Lakes, Lakehouses, Warehouse and.....MDM?

Stack and Heap Memory in .NET

??Objective 1.2 Transaction Control Language

?? Batch Processing Battle: find_in_batches vs find_each vs in_batches – A Deep Dive for ROR Developers ??

?? Understanding Memory Allocation: Primitive vs Non-Primitive Data Types

??? Understanding Structs vs. Classes in C#

Snowflake Materialized View Query Auto-Rewrite

Data Transfer Object pattern and Mapper

Spot the Smells: 7 Code Smells That Make Your Code Look Bad