登录查看更多内容

Exploring Fabric: putting Microsoft's new analytics platform to the test

Jorrit Sandbrink

data/software engineer

发布日期: 2023年5月26日

It's been a couple of days since Microsoft let the world know about Fabric, their new unified end-to-end data analytics platform. It was quite the announcement! Fabric's proposition is bold. Shiny visuals and slick presentations have made data practitioners excited and curious, myself included. But a shiny new analytics platform is like a shiny new car, you test drive before you buy. In this post I share my first hands-on experience.

Unification?

Before diving into the tool, let's first understand the problem Microsoft aims to solve. Core to Fabric's value proposition is delivering a unified, end-to-end experience: no need to use any other analytics tool because all the functionality is available within a single platform. That sounds lovely, as today's reality is very different. The amount of specialized tools is enormous, and data engineering often feels more like systems engineering. Just have a look at the MAD (ML/AI/Data) Landscape for 2023 and you'll understand. The added value of a data team would be so much bigger if it could focus on building data products, rather than integrating different tools.

The amount of specialized tools is enormous, and data engineering often feels more like systems engineering.

No alt text provided for this image — Fabric is all about unification

Free trial :)

Fabric is in public preview, and Microsoft offers a free trial. Great! It's easy enough to sign up and activate you trial.

Exploring end-to-end capabilities

More than anything, I'm curious if Fabric can deliver on their promise of unification. End-to-end analytics requires many different ingredients, and I explored some of them:

data ingestion,
visualization (reporting),
MLOps.

I will now walk through the steps I took and share relevant insights.

Data ingestion

I first created a lakehouse using Fabric's UI.

Next step: getting data in the lakehouse, I chose to work with the popular diabetes sample dataset.

领英推荐

THE SNOWFLAKE CORTEX UMBRELLA

Snowflake 6 个月前

Tidyverse

360DigiTMG 1 年前

Uncovering Hidden Insights: The Power of POI…

Foursquare 1 年前

Well, that was simple enough. All UI work, not a single line of code written.

Visualization (reporting)

Getting data in was as simple as ABC. Now the visualization. With Power BI taking a prominent place in Fabric (I heard it's the Power BI team that leads the Fabric efforts, maybe that has to do with it), I expected this to be a first-class experience. I was right. For every lakehouse you make, a Power BI "dataset" gets created automatically. After a few clicks you're in the Power BI interface ready to build your report.

MLOps

So far so good, Fabric made ingestion and visualization super easy. What about Machine Learning? After all, I didn't load the diabetes data just to create some simple charts. I found some "Data Science" functionality in the UI.

Aha, there's some actual code! This marks the end of my UI-only experience. I'm not surprised that a more advanced use case like Machine Learning isn't fully captured by a user interface (yet). Also happy that Fabric still lets me do what I like the most: writing code. Another delight: Fabric's live Spark pools really work...no waiting for a pool to start!

I had to change the code a little to load the diabetes data from the lakehouse into Spark, run an experiment, and train and register a model. Microsoft uses the open-source MLFlow framework to manage the ML lifecycle within Fabric. The results can be inspected interactively.

The last step was to load my newly created model and use it to make predictions against the diabetes data.

Conclusion

This was my first interaction with Fabric and it was pretty good. I was able to easily ingest data, visualize it, train a ML model on it, and use that model to make predictions...all within a single tool! I can't say the experience was completely flawless, there are definitely some glitches here and there. But Fabric is still in preview, so I call that acceptable. I don't think general availability is near, but I'm excited to keep track of the developments!

#microsoftfabric #dataengineering #analytics #mlflow #powerbi

Roel Peters

Co-founder of Gatekeeper | Easy hard skill assessments

1 年

Have you encountered and/or tried any enterprise features? E.g. cooperative features, branching and version control, staging environments, ...?

查看更多评论

要查看或添加评论，请登录

Jorrit Sandbrink的更多文章

Databricks Photon and its relation to Apache Spark

2023年11月18日

Databricks Photon and its relation to Apache Spark

Understanding what Databricks Photon is isn't very easy. Specifically, Photon's relation to Apache Spark can be…

2 条评论
Bird versus Bear: Comparing DuckDB and Polars

2023年11月13日

Bird versus Bear: Comparing DuckDB and Polars

I've been exploring #DuckDB and #Polars as faster alternatives for #PySpark when dealing with non-big data. Modern…
A way to avoid the "void data type" in PySpark and Delta

2023年10月25日

A way to avoid the "void data type" in PySpark and Delta

If you're working with PySpark a lot, you're likely to encounter the "void data type" sooner or later. This data type…
Mapping Microsoft's Data Analytics Landscape – Comparing Databricks, Synapse and Fabric

2023年7月19日

Mapping Microsoft's Data Analytics Landscape – Comparing Databricks, Synapse and Fabric

Recently Microsoft announced Fabric, their new data analytics platform. With that they introduced another offering in…

2 条评论
Microsoft OneLake adopts Delta, says goodbye to closed storage formats

2023年5月24日

Microsoft OneLake adopts Delta, says goodbye to closed storage formats

Big announcements were made on #MSBuild yesterday. Fabric was introduced as Microsoft's new unified data analytics…

4 条评论
Which Data Lake storage format wins the popularity contest?

2023年5月21日

Which Data Lake storage format wins the popularity contest?

I'm a big fan of the Data Lakehouse paradigm. One of the most important building blocks of a Lakehouse is the data…

See all articles

Exploring Fabric: putting Microsoft's new analytics platform to the test

Jorrit Sandbrink

data/software engineer

Unification?

Free trial :)

Exploring end-to-end capabilities

Data ingestion

领英推荐

Visualization (reporting)

MLOps

Conclusion

Jorrit Sandbrink的更多文章

社区洞察

其他会员也浏览了

Unlocking the Power of AI-Driven Insights and Analytics with Amazon QuickSight Q

Augmenting Gold Layer with Semantic Link and SparkSQL in Microsoft Fabric

Introducing Microsoft Fabric: The future of Data, Analytics & AI

Databricks Data Analytics: Solving Real-World Business Problems

Navigating the Future of Data Solutions for Modern Enterprises with Microsoft Fabric

Navigating the Analytics Landscape: Microsoft Fabric vs. Databricks

?? Introducing Microsoft Fabric: Your Complete Analytics Solution!

Harnessing the Power of Microsoft Fabric for Business Transformation in the Era of AI

Exploring the Best Auto Labeling Methods with Microsoft Purview

Jumpstarting Data Driven Decision Intelligence with Microsoft Fabric

Unification?

Free trial :)

Exploring end-to-end capabilities

Data ingestion

领英推荐

Visualization (reporting)

MLOps

Conclusion

Jorrit Sandbrink的更多文章

Databricks Photon and its relation to Apache Spark

Bird versus Bear: Comparing DuckDB and Polars

A way to avoid the "void data type" in PySpark and Delta

Mapping Microsoft's Data Analytics Landscape – Comparing Databricks, Synapse and Fabric

Microsoft OneLake adopts Delta, says goodbye to closed storage formats

Which Data Lake storage format wins the popularity contest?

社区洞察

其他会员也浏览了

Unlocking the Power of AI-Driven Insights and Analytics with Amazon QuickSight Q

Augmenting Gold Layer with Semantic Link and SparkSQL in Microsoft Fabric

Introducing Microsoft Fabric: The future of Data, Analytics & AI

Databricks Data Analytics: Solving Real-World Business Problems

Navigating the Future of Data Solutions for Modern Enterprises with Microsoft Fabric

Navigating the Analytics Landscape: Microsoft Fabric vs. Databricks

?? Introducing Microsoft Fabric: Your Complete Analytics Solution!

Harnessing the Power of Microsoft Fabric for Business Transformation in the Era of AI

Exploring the Best Auto Labeling Methods with Microsoft Purview

Jumpstarting Data Driven Decision Intelligence with Microsoft Fabric