课程: End-to-End Real-World Data Engineering Project with Databricks

免费学习该课程!

今天就开通帐号,24,600 门业界名师课程任您挑!

Data lakehouse: High-level solution

Data lakehouse: High-level solution

- [Instructor] In order to get into our project, we need to make sure that we have at least a high-level understanding of data lake house. A data lake house combines the best features of data lakes and data warehouses. It offers the flexibility and scalability of data lake. But side by side, it also offers the capability of data management and asset transactions of a data warehouse. So in short, you keep your data into the data lake, but you can do all the operations, which you can do in the data warehouses. That's make it very suitable for a variety of data loads and help you to do either as SQL way or an ML way. Our solution for global retail have a three layer architecture. The bronze layer that is for a raw ingestion. Silver layer for clean and conform data. And the gold layer for the business level aggregates. Let's take a little deep dive into each layer. The bronze layer is going to be our raw layer. Here, we are going to keep the data as it is as we received. We are not going…

内容