Data modelling

Data modelling


What is data modeling

Data modeling is the process of creating a simple diagram of a complex software system, using text and symbols to represent the way data will flow. The diagram can be used to ensure efficient use of data as a blueprint for the construction of new software or for reengineering a legacy application.

Typically, a data model can be thought of as a?flowchart?that illustrates the relationships among pieces of data. It enables?stakeholders?to identify errors and make changes before programming code is written. Alternatively, models can be created as part of?reverse-engineering?efforts that extract models from existing systems, as seen with?NoSQL?data.

Data modeling is an important skill for data scientists and others involved with?data analysis. Traditionally, data models were built during the analysis and design phases of a project to ensure that the requirements for a new application are understood. A data model can become the basis for building a more detailed data schema. Data models can also be used later in the?data lifecycle?to rationalize data designs that were originally created by programmers on an ad hoc basis.

Learn about the six steps in the data modeling process.
Types of data modeling

Data modelers often use multiple models to view the same data and ensure that all processes, entities, relationships and data flows have been identified. They initiate new data modeling projects by gathering requirements from business stakeholders.

THIS ARTICLE IS PART OF

What is data architecture? A data management blueprint
Which also includes:
5 principles of a well-designed data architecture
Data modeling vs. data architecture: What's the difference?
Data architecture vs. information architecture: How they differ
DOWNLOAD
1

Download this entire guide for FREE now!

There are three main types of data models:

Conceptual data model.?This approach is a high-level description of a database design that shows how data interrelates and what kind of data can be stored in the database. The intended audience for conceptual data models is the business side of an organization. The conceptual data model defines the?data structure?that the business requires. Once the conceptual data model is created, it can be refined and transferred into a logical data model.
Logical data model.?These models are used to create the database structure and describe the data from a technical perspective. The technical side of an organization uses logical data models as detailed representations of database designs. This data model serves as the basis for the?creation of a physical data model.
Physical data model.?This data model is specific to the application and database to be implemented. It is used to create the tables and?fields?that store database data. A physical data model describes a database design for a specific database management system (DBMS). Both the technical and business sides of an organization use this type of model.
Find out how the three approaches to data modeling work together to form the overall process.
Data modeling examples

Data modeling emerged in the 1960s as DBMSes became more popular. It enabled organizations to bring consistency,?repeatability?and disciplined development to data processing. Application users and programmers used the data model as a reference when communicating with database designers.

Some examples of data modeling approaches include the following.

Hierarchical data modeling

Hierarchical data models organize data in a treelike, one-to-many arrangement. This model originally replaced file systems in many popular use cases. IBM's?Information Management System?is an example of the hierarchical approach, which was widely used in businesses, especially banking. Although hierarchical data models were mostly superseded -- beginning in the 1980s -- by relational data models, the hierarchical method is used today in Extensible Markup Language and?geographic information systems.

Network data modeling

Network data models developed as a way to provide data designers with a broad conceptual view of their systems. For example, the Conference on Data Systems Languages, formed in the late 1950s, guided the development of a standard programming language that could be used across various types of computers.

Relational data modeling

The relational data model was proposed as an alternative to the hierarchical data model, which required a detailed understanding of the physical data storage employed. The relational data model does not require developers to define data paths.

Relational data modeling was first described in a 1970 technical paper by IBM researcher E.F. Codd. Codd's relational model set the stage for industry use of?relational databases, which use tables to connect data segments, as compared to the hierarchical model where data is implicitly joined together. Relational data modeling was coupled with?Structured Query Language, which gained a foothold in enterprise computing as an efficient means to process data.

Entity relationship modeling

Relational data modeling took another step forward as the use of entity relationship (ER) models became popular. ER models use diagrams to graphically depict the elements in a database and facilitate the understanding of underlying models.

With relational modeling, data types are determined and rarely changed over time. Entities, or objects, consist of?attributes. For example, an employee entity attribute could include last name, first name, years employed and so on. Relationships are visually mapped, providing a way to communicate data design objectives to participants in data development and maintenance. Over time, data architects adopted modeling tools, such as Idera's ER/Studio, Erwin Data Modeler and SAP PowerDesigner, for designing systems.

Object-oriented modeling

As?object-oriented programming?advanced in the 1990s, object-oriented modeling gained traction as another way to design systems. Object-oriented approaches are similar to ER methods, but they differ because they focus on object abstractions of real-world entities.

Objects are grouped in class hierarchies, and the objects can inherit attributes and methods from parent classes. This inheritance trait provides some advantages compared with ER modeling; it ensures data integrity and supports complex data relationships. At the same time, data models emerged for data warehousing needs. Notable examples are?snowflake schema?and?star schema?dimensional models.

NEO4J

Graph data models will be familiar to programmers who have worked with ER diagrams and object models, as well as to grade school students who have parsed sentences.
Graph data modeling

An offshoot of hierarchical and network data modeling is the property graph model, which, together with?graph databases, is increasingly used for describing complex relationships within data sets. It is popular in social media,?recommender?and fraud detection applications.

Using the graph data model, designers describe their system as a connected graph of nodes and relationships. Graph data models can be used for text analysis and to create models that uncover relationships among data points within documents.

Learn more about the data modeling techniques and the?challenges businesses face with the data modeling process?from author and data modeling instructor Steve Hoberman.

This was last updated in?June 2021
Continue Reading About data modeling
Data model design tips to help standardize business data
How to choose the right database to fit your data model(s)
The death of the data model?
How to unlock the true value of data
Cleveland Clinic's COVID-19 strategy driven by data modeling
Related Terms
cellular automaton (CA)
A cellular automaton (CA) is a collection of cells arranged in a grid of specified shape, such that each cell changes state as a ...?See?complete?definition
schema
In computer programming, a schema (pronounced SKEE-mah) is the organization or structure for a database, while in artificial ...?See?complete?definition
What is data architecture? A data management blueprint
Data architecture is a discipline that documents an organization's data assets, maps how data flows through its systems and ...?See?complete?definition
Dig Deeper on Data modeling tools and techniques
ArangoDB advances graph database fortunes with new funding
By:?Sean?Kerner
multimodel database
By:?Jack?Vaughan
database (DB)
By:?Ben?Lutkevich
7 data modeling techniques and concepts for business
By:?Rick?Sherman

?        

要查看或添加评论,请登录

Darshika Srivastava的更多文章

  • LGD Model

    LGD Model

    Loss Given Default (LGD) models play a crucial role in credit risk measurement. These models estimate the potential…

  • CCAR ROLE

    CCAR ROLE

    What is the Opportunity? The CCAR and Capital Adequacy role will be responsible for supporting the company’s capital…

  • End User

    End User

    What Is End User? In product development, an end user (sometimes end-user)[a] is a person who ultimately uses or is…

  • METADATA

    METADATA

    WHAT IS METADATA? Often referred to as data that describes other data, metadata is structured reference data that helps…

  • SSL

    SSL

    What is SSL? SSL, or Secure Sockets Layer, is an encryption-based Internet security protocol. It was first developed by…

  • BLOATWARE

    BLOATWARE

    What is bloatware? How to identify and remove it Unwanted pre-installed software -- also known as bloatware -- has long…

  • Data Democratization

    Data Democratization

    What is Data Democratization? Unlocking the Power of Data Cultures For Businesses Data is a vital asset in today's…

  • Rooting

    Rooting

    What is Rooting? Rooting is the process by which users of Android devices can attain privileged control (known as root…

  • Data Strategy

    Data Strategy

    What is a Data Strategy? A data strategy is a long-term plan that defines the technology, processes, people, and rules…

  • Product

    Product

    What is the Definition of Product? Ask a few people that question, and their specific answers will vary, but they’ll…

社区洞察

其他会员也浏览了