登录查看更多内容

(part 4) (5 mins reading)(MongoDB) Primary modeling patterns and Relationships

Ezat Elzalouy

Software Engineer @ Mismar | Javascript | Typescript | SQL - NOSQL | 5 years exp

发布日期: 2023年5月5日

From this article onwards, we will use this updated?mind map?as a measure of our progress in all future articles. I have made improvements to the mind map, and we will now be using it consistently going forward.

Relationships

In any database you will build, you find yourself selecting a relationship to build your related data like One-to-Many, One-to-One, or Many-to-Many. So in mongoDb To model relationships, there are three main approaches: Embedding, Referencing, and a Hypred approach that combines both and we talked about the difference between them in the previous articles. if you don't know the difference just click here.

It's not a Rule for matching between the approach and the relationship but it's better in most of the Database domains we see in our systems.

so Embedding is best suited for one-to-one and one-to-many relationships, where the related data is small and doesn't change frequently.

But Referencing involves storing data references to related data in separate documents, which are then quired separately. This approach is best suited for many-to-many relationships. Where the related data is large and frequently changing.

you can read this article to know more about where to use Embedding or Referencing, Where this article builds upon the previous articles.

One-to-One

Let's say we have these two domains

user data with Address data.

// user documen
{
   _id: "joe",
   name: "Joe Bookreader"
}

// address document
{
   patron_id: "joe", // reference to patron document
   street: "123 Fake Street",
   city: "Faketon",
   state: "MA",
   zip: "12345"
}

Movies with multi sub-documents like this

领英推荐

Maximize Your POI Data Insights with Foursquare Places…

Foursquare 2 年前

Production-Grade LLM Applications that React to Your…

Rohan Paul 9 个月前

A Revolution in Analytical Technology

Tom Davenport 7 年前

{
? "_id": 1,
? "title": "The Arrival of a Train",
? "year": 1896,
? "runtime": 1,
? "released": ISODate("01-25-1896"),
? "poster": "https://ia.media-imdb.com/images/M/MV5BMjEyNDk5MDYzOV5BMl5BanBnXkFtZTgwNjIxMTEwMzE@._V1_SX300.jpg",
? "plot": "A group of people are standing in a straight line along the platform of a railway station, waiting for a train, which is seen coming at some distance. When the train stops at the platform, ...",
? "fullplot": "A group of people are standing in a straight line along the platform of a railway station, waiting for a train, which is seen coming at some distance. When the train stops at the platform, the line dissolves. The doors of the railway-cars open, and people on the platform help passengers to get off.",
? "lastupdated": ISODate("2015-08-15T10:06:53"),
? "type": "movie",
? "directors": [ "Auguste Lumière", "Louis Lumière" ],
? "imdb": {
? ? "rating": 7.3,
? ? "votes": 5043,
? ? "id": 12
? },
? "countries": [ "France" ],
? "genres": [ "Documentary", "Short" ],
? "tomatoes": {
? ? "viewer": {
? ? ? "rating": 3.7,
? ? ? "numReviews": 59
? ? },
? ? "lastUpdated": ISODate("2020-01-09T00:02:53")
? }
}

What is the problem?

In the first Data model, while our application needs to retrieve all the data of the user at one query, we could use the One-to-one pattern with the Embedded document pattern. While the address data is frequently retrieved with user information. Then Referencing approach here can cause un-useful multiple queries to resolve the reference. On the other hand, using the Embedding pattern will make the reading queries faster and will achieve the application demands.

But second data model, The application demands are different and we need to show only an overview of the movie. The Embedded pattern here can cause a reading issue. This unnecessary data can cause extra load on your server and slow down read operations and let's imagine you have millions of Movies documents.?So you can use SubSet to store the most accessed data in a separate collection.

The subset pattern implementation depends on separating the collection into two collections with a One-to-One relationship, the first one for the frequently accessed data and the second one for unnecessary data:

Movie most Frequently accessed data

{
  "_id": 1,
  "title": "The Arrival of a Train",
  "year": 1896,
  "runtime": 1,
  "released": ISODate("1896-01-25"),
  "type": "movie",
  "directors": [ "Auguste Lumière", "Louis Lumière" ],
  "countries": [ "France" ],
  "genres": [ "Documentary", "Short" ],
}

Movie Details

{
  "_id": 156,
  "movie_id": 1, // reference to the movie collection
  "poster": "https://ia.media-imdb.com/images/M/MV5BMjEyNDk5MDYzOV5BMl5BanBnXkFtZTgwNjIxMTEwMzE@._V1_SX300.jpg",
  "plot": "A group of people are standing in a straight line along the platform of a railway station, waiting for a train, which is seen coming at some distance. When the train stops at the platform, ...",
  "fullplot": "A group of people are standing in a straight line along the platform of a railway station, waiting for a train, which is seen coming at some distance. When the train stops at the platform, the line dissolves. The doors of the railway-cars open, and people on the platform help passengers to get off.",
  "lastupdated": ISODate("2015-08-15T10:06:53"),
  "imdb": {
    "rating": 7.3,
    "votes": 5043,
    "id": 12
  },
  "tomatoes": {
    "viewer": {
      "rating": 3.7,
      "numReviews": 59
    },
    "lastUpdated": ISODate("2020-01-29T00:02:53")
  }

Using smaller documents containing more frequently-accessed data reduces the overall size of the working set. These smaller documents result in improved reading performance and make more memory available for the application.

However, it is important to understand your application and the way it loads data. If you split your data into multiple collections improperly, your application will often need to make multiple trips to the database and rely on?JOIN?operations to retrieve all of the data that it needs.

要查看或添加评论，请登录

Ezat Elzalouy的更多文章

(part 5)(5 mins Reading) Primary modeling patterns and relationships.

2023年6月4日

(part 5)(5 mins Reading) Primary modeling patterns and relationships.

In the previous article, we were provided with a clear explanation of the fundamental concepts utilized here. However…
(part 3) (5 min read) Which MongoDB Data Modeling concepts is essential to know?

2023年4月28日

(part 3) (5 min read) Which MongoDB Data Modeling concepts is essential to know?

In our previous article, we discussed four crucial concepts that one should consider when constructing a database…
(part 2) (5 min read) Which MongoDB Data Modeling concepts is essential to know?

2023年4月26日

(part 2) (5 min read) Which MongoDB Data Modeling concepts is essential to know?

I have developed a new mind map chart that will aid us in maintaining our focus on the core ideas that we are…
Which MongoDB concepts and queries are essential to know? (part 1)?

2023年4月14日

Which MongoDB concepts and queries are essential to know? (part 1)?

This is my inaugural article on this platform. I hope that it proves to be beneficial for all of you.

2 条评论

(part 4) (5 mins reading)(MongoDB) Primary modeling patterns and Relationships

Ezat Elzalouy

Software Engineer @ Mismar | Javascript | Typescript | SQL - NOSQL | 5 years exp

Relationships

One-to-One

领英推荐

What is the problem?

However, it is important to understand your application and the way it loads data. If you split your data into multiple collections improperly, your application will often need to make multiple trips to the database and rely on?JOIN?operations to retrieve all of the data that it needs.

Ezat Elzalouy的更多文章

社区洞察

其他会员也浏览了

Data, meet Graph: Kubrick Partners with Neo4j

End-to-end RAG application with source retriveal on Databricks Platform

Understanding Databases like Graph, Vector, and Relational Databases with Real-World Examples

Diving into the Deep End of RDF: OWL, SHACL, and SPARQL, vs TerminusDB data products

Databricks in 2024 vs. 2025: Growth, Challenges, and What’s Next

Shallow vs. Deep Pagination in GraphQL:

Generating 1 Billion Rows of Complex Synthetic Data ??

Unlocking The Power Of Semantic Search with Weaviate

Elasticsearch Was Great, But Graph RAG is the Future

Turning business language into data insights within your enterprise

Relationships

One-to-One

领英推荐

What is the problem?

However, it is important to understand your application and the way it loads data. If you split your data into multiple collections improperly, your application will often need to make multiple trips to the database and rely on?JOIN?operations to retrieve all of the data that it needs.

Ezat Elzalouy的更多文章

(part 5)(5 mins Reading) Primary modeling patterns and relationships.

(part 3) (5 min read) Which MongoDB Data Modeling concepts is essential to know?

(part 2) (5 min read) Which MongoDB Data Modeling concepts is essential to know?

Which MongoDB concepts and queries are essential to know? (part 1)?

社区洞察

其他会员也浏览了

Data, meet Graph: Kubrick Partners with Neo4j

End-to-end RAG application with source retriveal on Databricks Platform

Understanding Databases like Graph, Vector, and Relational Databases with Real-World Examples

Diving into the Deep End of RDF: OWL, SHACL, and SPARQL, vs TerminusDB data products

Databricks in 2024 vs. 2025: Growth, Challenges, and What’s Next

Shallow vs. Deep Pagination in GraphQL:

Generating 1 Billion Rows of Complex Synthetic Data ??

Unlocking The Power Of Semantic Search with Weaviate

Elasticsearch Was Great, But Graph RAG is the Future

Turning business language into data insights within your enterprise