登录查看更多内容

Interview Cheat Sheet: When to use which Data Structure?

Prateek Tiwari

Senior Data Engineer || Python, SQL, Spark, Pyspark, AWS/Azure|| Big Data & Cloud Solutions || ETL Pipeline & Cloud Optimization || Writer || Ex- Infoscion

发布日期: 2024年3月8日

Sometimes, we get confuse when to use which data structures using interviews it can be overwhelming to think about which data structure you need to use. We will briefly discuss the use cases for the various categories of data structures.

A data structure is a collection of data values, the relationships among them, and the functions or operations that can be applied to the data.

Lists are generally used when you need to just store data in some ordered manner. Runtime isn’t too important when you’re using a list. The two main types of lists are Linked Lists and Array Lists.

LinkedList

get() = O(N)
add() = Θ(1)
remove() = Θ(N)
space = O(N)

Array List

get() = O(1)
add() = Θ(N)
remove() = Θ(N)
space = O(N)

Sets

are generally used when you have unordered data. In sets, you are not allowed to have any duplicates and you can only store keys! Essentially, sets are just maps with some dummy value. Data Structures that use sets are used when you just need to store elements.

Maps

are data structures that store a key and a value. If you insert a key value pair into a map where that key already exists, you replace that key’s value with the value of the key value pair you are inserting. You declare Maps in the following manner:

Map<Key type, Value type>
In maps, you have an immutable key and values that can be any data type. Frequently, maps can be used
to see how frequently a key turns up.

Trees

These can be used whenever the keys in questions are implementing comparable. Trees can be used when you are attempting to find some sort of order. Specific types of self balancing trees are 2–3 Trees and Left Leaning Red Black Trees. You would choose to use trees over hashing when the key is hard to hash. This can occur when a key is very long, and would take a lot of time to perform arithmetic on it or analyze it.

Binary Search Tree

search() = Average: O(logN) Worst Case: O(N)
insert() = Average: O(logN) Worst Case: O(N)
delete() = Average: O(logN) Worst Case: O(N)
space = O(N)

Balanced Search Trees (2–3 Trees and Left Leaning Red Black Trees)

get() = Θ(log N)
insert() = Θ(log N)
remove() = Θ(log N)
space = O(N)

Hashing data structures

are used when you need Θ(1) runtime for all operations assuming that you have a good hash code. You would also want to use hashing data sets when your data is unordered.

领英推荐

Coffee & Data - First Edition - January 2025

Tommaso Lucentini 2 个月前

Commonly Asked Data Structure Interview Questions

StrataScratch 6 个月前

Excel Interview Questions For Data Analyst: Must-Know…

Ze Learning Labb 6 个月前

Stacks

These are used when you want to look at the most recent thing that has been done. Stacks are known as first in last out data types (FILO). This can be applied to search history, delivery items etc. Stacks are often used in depth first search in trees and also graphs, which we will go over in the next chapter.

Stack

push() = Θ(1)
pop() = Θ(1)
peek() = Θ(1)
space() = O(N)

Queues

These are used when we need to view items by the order in which they were done. Queues are known as first in last out data types (FIFO). In the real world, this is used when we have to keep track of waitlists. Additionally Queues are used for breadth first search in trees as well as graphs.

Queue

add() = Θ(1)
remove() = Θ(1)
peek() = Θ(1)

Heaps or Priority Queues

These can be used whenever the keys we are examining implement Comparable. Anytime you need the most or least of something, use a Heap. These are not for overall order, unless it’s taking out one item at a time.

Heap

search() = Θ(1)
insert() = Average: O(1) Worst Case: O(log N)
delete() = O(log(N))
peek() = O(1)
space = O(N)

Tries

are used when you want to perform searches and insertions using prefixes. There are 2 types of tries, normal tries and ternary search tries. Normal tries have a very fast speed but in exchange they take up a lot of space. Ternary search tries on the other hand are relatively slow but take up much less space.

Tries

M is the length of the string you are attempting to insert/search for, N is amount of keys, L is
the length of the longest key, and R is the size of the alphabet.
search() = Θ(M)
insert() = Θ(M)
space = O(N ?L?R)

2. Ternary Search Tries

N is the amount of keys and L is the length of the longest key
search() = Θ(N)
insert() = Θ(N)
space = O(N ?L)

Final Thoughts:

A cheat sheet for the time complexities of the data structure operations and it can be overwhelming to think about which data structure you need to use.

I hope you found this article useful as a simple introduction to data structures. I would love to hear your thoughts.

With that in mind, I am going to end this article if you like it, please like and subscribe.

LastBrainCell

898 位关注者

要查看或添加评论，请登录

Prateek Tiwari的更多文章

11 Must-Know SQL String Functions in Python for Data Analysts and Engineer's

2024年5月27日

11 Must-Know SQL String Functions in Python for Data Analysts and Engineer's

Whether you're working with big data or just cleaning up a dataset, mastering string functions can greatly enhance your…
What is Slowly Changing Dimensions in Data Engineering: A Comprehensive Guide

2024年5月26日

What is Slowly Changing Dimensions in Data Engineering: A Comprehensive Guide

In the ever-evolving landscape of data engineering, managing change is both an art and a science. One of the critical…
Mastering Data Engineering: 5 Best Practices, Essential Tools, and Top Resources

2024年5月17日

Mastering Data Engineering: 5 Best Practices, Essential Tools, and Top Resources

In today's data-driven world, the role of data engineering has become pivotal. Businesses are constantly seeking ways…

1 条评论
Stop Using SELECT DISTINCT : Boost Your SQL Performance

2024年5月5日

Stop Using SELECT DISTINCT : Boost Your SQL Performance

In the realm of SQL queries, SELECT DISTINCT has long been the go-to method for retrieving unique values. However…
SQL Query Performance

2024年4月23日

SQL Query Performance

In the realm of database management, optimizing SQL query performance is both an art and a science. As databases grow…
Advanced SQL: Power of Conditional Aggregation

2024年4月5日

Advanced SQL: Power of Conditional Aggregation

Conditional aggregation is a game-changer in the world of SQL analysis. It empowers you to go beyond basic…
What is Apache Spark ?

2024年4月2日

What is Apache Spark ?

Introduction: In today’s data-driven world, where organizations grapple with ever-expanding volumes of data, Apache…

1 条评论
Mastering SQL Window Functions for Powerful Data Analysis : ROW_NUMBER, RANK, and DENSE_RANK

2024年3月21日

Mastering SQL Window Functions for Powerful Data Analysis : ROW_NUMBER, RANK, and DENSE_RANK

SQL window functions are a game-changer for data analysts and scientists. They enable you to perform calculations and…
15 Must-Know SQL Functions for Data Analyst

2024年3月18日

15 Must-Know SQL Functions for Data Analyst

String data is a fundamental building block in many databases. As a data scientist or analyst, efficiently wrangling…
?? 10 Advanced SQL Queries Every Data Analyst Should Master ??

2024年3月14日

?? 10 Advanced SQL Queries Every Data Analyst Should Master ??

In the world of data analytics and data science, proficiency in SQL (Structured Query Language) is crucial for…

See all articles

Interview Cheat Sheet: When to use which Data Structure?

Prateek Tiwari

Senior Data Engineer || Python, SQL, Spark, Pyspark, AWS/Azure|| Big Data & Cloud Solutions || ETL Pipeline & Cloud Optimization || Writer || Ex- Infoscion

Sometimes, we get confuse when to use which data structures using interviews it can be overwhelming to think about which data structure you need to use. We will briefly discuss the use cases for the various categories of data structures.

LinkedList

Array List

Sets

Maps

Trees

Binary Search Tree

Balanced Search Trees (2–3 Trees and Left Leaning Red Black Trees)

Hashing data structures

领英推荐

Stacks

Stack

Queues

Queue

Heaps or Priority Queues

Heap

Tries

Tries

2. Ternary Search Tries

Final Thoughts:

With that in mind, I am going to end this article if you like it, please like and subscribe.

LastBrainCell

898 位关注者

Prateek Tiwari的更多文章

社区洞察

其他会员也浏览了

How To Identify A Good/Bad Data Scientist In A Job Interview?

Data Analyst Interview Questions For Experienced: Most-Asked Technical Interview Questions For Experienced Data Analyst | Experienced Data Analyst Int

Data Collection Tools Comparison: Finding the Right Fit for Your Needs

Cracking Data Analyst Behavioral Interviews

The Art of Data Analysis: Parnikaa's Approach to Solving B2B Production Challenges

Qualitative Data

Episode 5: Redefining the Hiring Process: AI-Powered Recruitment Using Natural Selection and Genetic Algorithms

Essential Interview Preparation for Data Analysts and Data Scientists

Acing your take home data science interview exercises!

Unifying Insights: The Power of a Comprehensive Qualitative Research Repository

Sometimes, we get confuse when to use which data structures using interviews it can be overwhelming to think about which data structure you need to use. We will briefly discuss the use cases for the various categories of data structures.

LinkedList

Array List

Sets

Maps

Trees

Binary Search Tree

Balanced Search Trees (2–3 Trees and Left Leaning Red Black Trees)

Hashing data structures

领英推荐

Stacks

Stack

Queues

Queue

Heaps or Priority Queues

Heap

Tries

Tries

2. Ternary Search Tries

Final Thoughts:

With that in mind, I am going to end this article if you like it, please like and subscribe.

LastBrainCell

898 位关注者

Prateek Tiwari的更多文章

11 Must-Know SQL String Functions in Python for Data Analysts and Engineer's

What is Slowly Changing Dimensions in Data Engineering: A Comprehensive Guide

Mastering Data Engineering: 5 Best Practices, Essential Tools, and Top Resources

Stop Using SELECT DISTINCT : Boost Your SQL Performance

SQL Query Performance

Advanced SQL: Power of Conditional Aggregation

What is Apache Spark ?

Mastering SQL Window Functions for Powerful Data Analysis : ROW_NUMBER, RANK, and DENSE_RANK

15 Must-Know SQL Functions for Data Analyst

?? 10 Advanced SQL Queries Every Data Analyst Should Master ??

社区洞察

其他会员也浏览了

How To Identify A Good/Bad Data Scientist In A Job Interview?

Data Analyst Interview Questions For Experienced: Most-Asked Technical Interview Questions For Experienced Data Analyst | Experienced Data Analyst Int

Data Collection Tools Comparison: Finding the Right Fit for Your Needs

Cracking Data Analyst Behavioral Interviews

The Art of Data Analysis: Parnikaa's Approach to Solving B2B Production Challenges

Qualitative Data

Episode 5: Redefining the Hiring Process: AI-Powered Recruitment Using Natural Selection and Genetic Algorithms

Essential Interview Preparation for Data Analysts and Data Scientists

Acing your take home data science interview exercises!

Unifying Insights: The Power of a Comprehensive Qualitative Research Repository