登录查看更多内容

Introduction to Probabilistic Data Structures

Vivekanand Kirubanandan

Founder @ SDE Skills - Building a community to upskill software engineers | Investor | Engineering Leader

发布日期: 2023年10月29日

I have often written about the SDE Skills meetup. I may be biased, but I think it is one of the best tech interview prep groups around. The secret to its success is the amazing set of volunteers and the engaging participants.

Yesterday, we had our first in-person System Design Meetup that we have had in a while. This is the first of many more to come.

Here is the handout Nagarajulu Aerakoni shared: https://jmpto.us/23-10-28_Handout

Here is a quick run down of what happened (based on the notes I took):

Data structures - Pt 1: Bloom Filters and more...

Hash functions are used to map an infinite input space to a finite output space. Hash functions are usually one-way functions and suffer from collisions, where multiple inputs may map to the same output.

Bloom filters are used to answer the question of membership. Does this element exist in the bloom filter? The response is a Definite No, or a Maybe Yes. Due to the nature of how it is structured, it cannot provide a deterministic affirmative answer. It is fast, efficient and this comes at the cost of false positives, where it may provide a “it may exist” even if it doesn’t exist in the filter.

Count-Min Sketch is used to count the number of appearances of an item. There are quite a bit of similarities in the design and application of bloom filters and count-min-sketch. This can provide an answer to how many times have we seen this element. The response may be overstated by a bit.

Quick sidebar on applying these during tech interviews

In a tech interview, how you talk about solutions is super important. While these cool algorithms, Bloom filters and Count-Min Sketch, can help save space when dealing with lots of data they do suffer from incorrect responses. That said:

Don't start with these fancy algorithms: Bloom filters or Count-Min Sketch are rarely a part of any system design interview question’s initial design. Keep your solutions simple and stay away from these “fancy” algorithms. These algorithms are like special tools.

Bring them up at the right time. Usually, when we are discussing the storage, time-complexity of your design, if it is appropriate, you may want to bring up these algorithms to explain how you can significantly improve the design by leveraging these. This is very similar to how you think about adding a load balancer, or a caching layer. Applying them prematurely will make you sound very text-bookish.

领英推荐

What are the Best Strategies for Tackling Algorithm…

Sandeep Jain 7 个月前

Implementing Machine Learning Algorithms in Java

ParallelStaff 7 个月前

AI Will Actually Create More Jobs in Tech

Daivik Goel 3 个月前

And lastly, don’t bring these up if you can’t explain these algorithms well.

Data structures part 2: Counters

On to Linear counters, and LogLog style counters.

Often times, you will need to estimate the cardinality (number of unique elements).

Linear Counters uses a hash function to estimate the cardinality, suffers from overcounting due to collision.
A variety of LogLog style counters that estimates the cardinality based on the number of trailing zeros in the hash values of the elements in the set.

What Next?

We had 20+ participants, the room had capacity for a few more. If you have some time to spare join us in upcoming weeks. Engage with us in many channels that we have.

WhatsApp Notification group: https://jmpto.us/WhatsApp
Discord group with over 10k members: https://jmpto.us/discord
Meetup group with over 4k members: https://jmpto.us/meetup

#systemdesign #algorithms #meetup #sdeskills

要查看或添加评论，请登录

Vivekanand Kirubanandan的更多文章

Do you know how to navigate the “phases” of a System Design Interview?

2024年2月22日

Do you know how to navigate the “phases” of a System Design Interview?

TLDR; there are 5 parts. Requirements gathering, high level design, low level design, system verification and…

1 条评论
I am Engineering Manager who is Passively Looking, But No Bites - What Gives?

2024年2月19日

I am Engineering Manager who is Passively Looking, But No Bites - What Gives?

Recently during the SDE Skills meetup, I caught up with an Engineering Manager who was looking for some advice on how…

1 条评论
Cold start prep for a Engineering Manager Interviews

2023年11月10日

Cold start prep for a Engineering Manager Interviews

I heard this question about half-a-dozen times in the past two months - I am an engineering manager who hasn’t…
Transform your Thoughts into Words: Leveraging AI to Elevate Content Creation

2023年8月13日

Transform your Thoughts into Words: Leveraging AI to Elevate Content Creation

Are you're facing writer's block? Do you find verbalizing your thoughts is easier? Consider leveraging this approach to…

2 条评论
Backtracking: An Effective Approach to Solve Complex Interview Questions

2023年7月25日

Backtracking: An Effective Approach to Solve Complex Interview Questions

In the world of technical interviews, problem-solving abilities are paramount. Interviewers often present candidates…
Announcing the launch of Conversions API Gateway Multiple Accounts

2023年2月23日

Announcing the launch of Conversions API Gateway Multiple Accounts

Over the past couple of years, I have been working on an advertiser owned cloud-hosted ad tech product that helps…

2 条评论
SDE Skills - Stepping into 2020

2019年12月29日

SDE Skills - Stepping into 2020

And that is a wrap! As we are stepping into 2020, I thought this is a good time to pause and reflect on 2019. SDE…

7 条评论
Our Little Community Initiative - 300+ Mock Interviews for GHC aspirants

2019年11月5日

Our Little Community Initiative - 300+ Mock Interviews for GHC aspirants

-Jointly written by Khushali & Vivek Hello, I am Khushali Desai, an Engineer at Walmart Labs. I co-lead the “Indian…

3 条评论
Hiring Senior Software Engineers

2019年8月16日

Hiring Senior Software Engineers

Just recently, I was thinking about how hard is it to hire Senior Software Engineers. I dug around a bit into the…

4 条评论
Practicing Behavioral Interviewing

2019年8月11日

Practicing Behavioral Interviewing

Why this session? Most interviews have a segment that focuses on your soft-skills. In addition to discovering your core…

1 条评论

See all articles

Introduction to Probabilistic Data Structures

Vivekanand Kirubanandan

Founder @ SDE Skills - Building a community to upskill software engineers | Investor | Engineering Leader

Data structures - Pt 1: Bloom Filters and more...

Quick sidebar on applying these during tech interviews

领英推荐

Data structures part 2: Counters

What Next?

Vivekanand Kirubanandan的更多文章

社区洞察

其他会员也浏览了

AI Will Actually Create More Jobs in Tech

Programming Languages for AI

Data Science with Cybersecurity

Marvelous MLOPs #42: My Story of Becoming an MLOps Engineer

Scala #EmployeeSpotlight Series: Tahir Imran

Unlocking the Power of Synthetic Data - How Python Faker Package Might be Changing the Game for Data Scientists

Do Software Engineers Actually Need to Know Algorithms?

If the CEO asks: How does DeepSeek-R1 work - a technology perspective

Immediate required: AI/ML Engineer with Python c2c jobs Austin, TX

Introducing PyPIM: A Python Interpreter

Data structures - Pt 1: Bloom Filters and more...

Quick sidebar on applying these during tech interviews

领英推荐

Data structures part 2: Counters

What Next?

Vivekanand Kirubanandan的更多文章

Do you know how to navigate the “phases” of a System Design Interview?

I am Engineering Manager who is Passively Looking, But No Bites - What Gives?

Cold start prep for a Engineering Manager Interviews

Transform your Thoughts into Words: Leveraging AI to Elevate Content Creation

Backtracking: An Effective Approach to Solve Complex Interview Questions

Announcing the launch of Conversions API Gateway Multiple Accounts

SDE Skills - Stepping into 2020

Our Little Community Initiative - 300+ Mock Interviews for GHC aspirants

Hiring Senior Software Engineers

Practicing Behavioral Interviewing

社区洞察

其他会员也浏览了

AI Will Actually Create More Jobs in Tech

Programming Languages for AI

Data Science with Cybersecurity

Marvelous MLOPs #42: My Story of Becoming an MLOps Engineer

Scala #EmployeeSpotlight Series: Tahir Imran

Unlocking the Power of Synthetic Data - How Python Faker Package Might be Changing the Game for Data Scientists

Do Software Engineers Actually Need to Know Algorithms?

If the CEO asks: How does DeepSeek-R1 work - a technology perspective

Immediate required: AI/ML Engineer with Python c2c jobs Austin, TX

Introducing PyPIM: A Python Interpreter