登录查看更多内容

Workshop - 08/03/24

John Miner

Data Architect at Insight

发布日期: 2024年5月29日

Building Modern Data Platform for Analytics

I want to thank my friend Ed Pollack for asking me to speak at SQL Saturday Albany. This full day pre-conference workshop is a bargain for those professionals who want to learn about the MDP. You get breakfast, lunch, and my half of a decade experience on creating Modern Data Platforms in Azure for various clients who want to do analytics.

Here are the details that will be covered that day. Please sign-up for the workshop using Eventbrite. Hope to see you this summer!

Many companies are placing their corporate information into data lakes in the cloud. Since storage costs are cheap, the amount of data stored in the lake can easily exceed the amount of data seen in a typical relational database. Regardless of the types of files in the data lake, there is always a need to transform the raw data files into refined data files for analytics, machine learning, and/or AI.

领英推荐

Microsoft Fabric - Comprehensive Analytics Solution…

Quadrant Technologies 11 个月前

Comprehensive Guide to Azure Databricks Unity Catalog

Aritra Ghosh 1 年前

Databricks SQL Series — Part 5 — Managing and Securing…

Krishna Yogi Kolluru 7 个月前

The Delta Lakehouse design uses a medallion (bronze, silver, and gold) architecture for data quality. We can we abstract the read and write actions in Spark to create dynamic notebooks to process data files. Data pipelines can be used to bring remote data into the lake as well as orchestrate data processing. A metadata driven design allows for the inputs to the dynamic notebooks to be stored in a central place.

The most important part of a modern data platform is security. Microsoft Entra, formally known as Azure Active Directory, can be used to secure the files in storage. This security layer is used in both the Apache Spark and Serverless SQL pools. Designers use a variety of tools for reporting. The Serverless SQL Pool turns a data lake files into a read only database tables. While the demos in this course are Azure specific, the concepts can be used with any cloud service.

Lessons:

Infrastructure deployment (storage, key vault, Databricks, Synapse)
Create a service principle for services
Create medallion zones + assign rights
Introduction to Data Factory pipelines
How to create a hybrid design
Working with different sources (database, file shares, rest API's)
Hard coding vs meta data design
Full vs incremental load patterns
Configuring clusters + storage for security
Writing data engineering notebooks
Orchestrating pipelines with Data Factory
Creating a presentation layer with Synapse Serverless Pools
Connecting to Synapse with Power BI

John Miner

Data Architect at Insight

8 个月

There is less than 24 days before the pre-conference sessions for SQL Saturday Albany. Don't miss out training from IT professionals who have been using the technology for years. The cost includes training and food. Hope to see you in my class!

要查看或添加评论，请登录

John Miner的更多文章

Why use Tally Tables in the Fabric Warehouse?

2025年2月26日

Why use Tally Tables in the Fabric Warehouse?

Technical Problem Did you know that Edgar F. Codd is considered the father of the relational model that is used by most…
Streaming Data with Azure Databricks

2025年2月25日

Streaming Data with Azure Databricks

Technical Problem The core functionality of Apache Spark has support for structured streaming using either a batch or a…

1 条评论
Upcoming Fabric Webinars from Insight

2025年2月19日

Upcoming Fabric Webinars from Insight

Don't miss the opportunity to boost your data skills with Insight and Microsoft. This webinar series will help you…
How to develop solutions with Fabric Data Warehouse?

2025年2月18日

How to develop solutions with Fabric Data Warehouse?

Technology Details The SQL endpoint of the Fabric Data Warehouse allows programs to read from and write to tables. The…
Understanding file formats within the Fabric Lakehouse

2025年2月10日

Understanding file formats within the Fabric Lakehouse

I am looking forward to talking to the Cloud Data Driven user group on March 13th. You can find all the presentation…

3 条评论
Engineering a Lakehouse with Azure Databricks with Spark Dataframes

2025年2月3日

Engineering a Lakehouse with Azure Databricks with Spark Dataframes

Problem Time does surely fly. I remember when Databricks was released to general availability in Azure in March 2018.
Create an Azure Databricks SQL Warehouse

2025年1月21日

Create an Azure Databricks SQL Warehouse

Problem Many companies are leveraging data lakes to manage both structured and unstructured data. However, not all…

2 条评论
How to Load a Fabric Warehouse?

2025年1月9日

How to Load a Fabric Warehouse?

Technology The data warehouse in Microsoft Fabric was re-written to use One Lake storage. This means each and every…
My Year End Wrap Up for 2024

2024年12月26日

My Year End Wrap Up for 2024

Hi Folks, It has been a very busy year. At the start of this year I wanted to learn Fabric in depth.

1 条评论
Virtualizing GCP data with Fabric Shortcuts

2024年12月16日

Virtualizing GCP data with Fabric Shortcuts

New Technology Before the invention of shortcuts in Microsoft Fabric, big data engineers had to create pipelines to…

See all articles

Workshop - 08/03/24

John Miner

Data Architect at Insight

Building Modern Data Platform for Analytics

领英推荐

John Miner的更多文章

社区洞察

其他会员也浏览了

Seamless analytics with Microsoft Fabric

?? DATA Pill #108 - Orchestrating 2000+ dbt Models, Databricks + Tabular

How to use Databricks Unity Catalog to implement Data model of Bronze, Silver, and Gold layer in Delta Lakehouse

???? Skyrocket Your Data Skills: Microsoft Azure's Data Engineering - Your Ticket to Success in 2024! ????

Revolutionizing Data Engineering with Delta Lake and Azure Databricks

Azure Synapse vs Databricks: Data Platform Comparison

Storage in Microsoft Fabric

COPY INTO in Databricks

Native and Agnostic Data Platforms

DATABRICKS OPTIMIZATION TECHNIQUES

Building Modern Data Platform for Analytics

领英推荐

John Miner的更多文章

Why use Tally Tables in the Fabric Warehouse?

Streaming Data with Azure Databricks

Upcoming Fabric Webinars from Insight

How to develop solutions with Fabric Data Warehouse?

Understanding file formats within the Fabric Lakehouse

Engineering a Lakehouse with Azure Databricks with Spark Dataframes

Create an Azure Databricks SQL Warehouse

How to Load a Fabric Warehouse?

My Year End Wrap Up for 2024

Virtualizing GCP data with Fabric Shortcuts

社区洞察

其他会员也浏览了

Seamless analytics with Microsoft Fabric

?? DATA Pill #108 - Orchestrating 2000+ dbt Models, Databricks + Tabular

How to use Databricks Unity Catalog to implement Data model of Bronze, Silver, and Gold layer in Delta Lakehouse

???? Skyrocket Your Data Skills: Microsoft Azure's Data Engineering - Your Ticket to Success in 2024! ????

Revolutionizing Data Engineering with Delta Lake and Azure Databricks

Azure Synapse vs Databricks: Data Platform Comparison

Storage in Microsoft Fabric

COPY INTO in Databricks

Native and Agnostic Data Platforms

DATABRICKS OPTIMIZATION TECHNIQUES