Data Warehouse

Data Warehouse

A?Data Warehousing?(DW) is process for collecting and managing data from varied sources to provide meaningful business insights. A Data warehouse is typically used to connect and analyze business data from heterogeneous sources. The data warehouse is the core of the BI system which is built for data analysis and reporting.

The decision support database (Data Warehouse) is maintained separately from the organization’s operational database. However, the data warehouse is not a product but an environment. It is an architectural construct of an information system which provides users with current and historical decision support information which is difficult to access or present in the traditional operational data store.

Data warehouse system is also known by the following name:

  • Decision Support System (DSS)
  • Executive Information System
  • Management Information System
  • Business Intelligence Solution
  • Analytic Application
  • Data Warehouse

How Datawarehouse works?

A Data Warehouse works as a central repository where information arrives from one or more data sources. Data flows into a data warehouse from the transactional system and other relational databases.

Data may be:

  1. Structured
  2. Semi-structured
  3. Unstructured data

The data is processed, transformed, and ingested so that users can access the processed data in the Data Warehouse through Business Intelligence tools, SQL clients, and spreadsheets. A data warehouse merges information coming from different sources into one comprehensive database.

By merging all of this information in one place, an organization can analyze its customers more holistically. This helps to ensure that it has considered all the information available. Data warehousing makes data mining possible. Data mining is looking for patterns in the data that may lead to higher sales and profits.

Types of Data Warehouse

Three main types of Data Warehouses (DWH) are:

1. Enterprise Data Warehouse (EDW):

Enterprise Data Warehouse (EDW) is a centralized warehouse. It provides decision support service across the enterprise. It offers a unified approach for organizing and representing data. It also provide the ability to classify data according to the subject and give access according to those divisions.

2. Operational Data Store:

Operational Data Store, which is also called ODS, are nothing but data store required when neither Data warehouse nor OLTP systems support organizations reporting needs. In ODS, Data warehouse is refreshed in real time. Hence, it is widely preferred for routine activities like storing records of the Employees.

3. Data Mart:

A data mart is a subset of the data warehouse. It specially designed for a particular line of business, such as sales, finance, sales or finance. In an independent data mart, data can collect directly from sources.

Components of Data warehouse

Four components of Data Warehouses are:

Load manager:?Load manager is also called the front component. It performs with all the operations associated with the extraction and load of data into the warehouse. These operations include transformations to prepare the data for entering into the Data warehouse.

Warehouse Manager:?Warehouse manager performs operations associated with the management of the data in the warehouse. It performs operations like analysis of data to ensure consistency, creation of indexes and views, generation of denormalization and aggregations, transformation and merging of source data and archiving and baking-up data.

Query Manager:?Query manager?is also known as backend component. It performs all the operation operations related to the management of user queries. The operations of this Data warehouse components are direct queries to the appropriate tables for scheduling the execution of queries.

End-user access tools:

This is categorized into five different groups like 1. Data Reporting 2. Query Tools 3. Application development tools 4. EIS tools, 5. OLAP tools and data mining tools.


要查看或添加评论,请登录

NISHI KUMARI的更多文章

  • Amazon SageMaker

    Amazon SageMaker

    Amazon SageMaker is a fully managed machine learning (ML) service provided by Amazon Web Services (AWS). It enables…

  • What is SharePoint?

    What is SharePoint?

    SharePoint is a web-based collaborative platform developed by Microsoft, launched in 2001. It is primarily used for web…

  • What is Data Pipeline?

    What is Data Pipeline?

    A data pipeline is a series of processes and tools designed to collect, process, and deliver data from various sources…

  • What is Azure Logic Apps?

    What is Azure Logic Apps?

    Azure Logic Apps, from Microsoft Azure, is a cloud-based Platform-as-a-Service (PaaS) that is used to automate tasks…

  • What is Power Automate

    What is Power Automate

    Microsoft Power Automate is a comprehensive cloud-based automation platform designed to streamline and optimize…

  • Campaign Optimization Techniques

    Campaign Optimization Techniques

    Campaign optimization is a crucial aspect of any marketing strategy, whether it be for a small business or a…

  • What is Account Management?

    What is Account Management?

    Account management is a post-sales role that focuses on nurturing client relationships. Account managers have two…

  • What is Product Analytics?

    What is Product Analytics?

    Product analytics is the process of collecting and studying data on how people use your product. It tracks user…

  • Econometrics

    Econometrics

    Econometrics is the use of statistical and mathematical models to develop theories or test existing hypotheses in…

  • What is CRUD?

    What is CRUD?

    CRUD refers to the four basic operations a software application should be able to perform – Create, Read, Update, and…

社区洞察

其他会员也浏览了