Snowflake VS Azure Synapse | 7 reasons why you should choose Snowflake OR Synapse on Azure
Radu Vunvulea
VP of Cloud | Cloud Strategy and Transformation | Microsoft Regional Director | Microsoft MVP
Microsoft supports two large data warehouse and data analytics solutions inside?(1) Azure - Azure Synapse + Data Lake?and (2)?Snowflake. Both of them are two mature products that are similar from many points of view.?
More than one time, I was in a situation when customers were asking: 'What approach should I have? Should I go with an Azure Synapse centric approach or Snowflake on top of Azure?"?
There are a lot of factors that can influence the decision like business, team skills, data strategy, compliance, data model or expected features. The next section of the article covers the differences between Snowflake on Azure and Azure Synapse centric from different dimensions. Based on the public features publicly available on the market at the end of 2021.
Both solutions provide:
LAST EDIT: After I finished the article, I realized that it is long, and most of you would like to see the conclusion. Let's start with the ending, and if you want to drill down, you can read the rest of the article (smile).
Final Thoughts
Both approaches offer similar experience from the data warehouse capabilities. There are small flavours of each of them, like Secondary Index available only on Snowflake or Stored Procedures supported only on Azure Synapse. This is not a stopper in most cases because there are workarounds for them.?
Below you can find a direct comparison between the two approaches.?
Scale: 1 (lowest) -5 (highest)
Snowflake is the big winner if you look for a solution provided as SaaS, where cost economics can be kept simple, with reduced effort on the ops and management side.
The most significant impact is at Governance, Security, Automation, Scalability and Integration, where?Azure Synapse Centric solution is 3 steps ahead. You can do it also using Snowflake, but the Azure Synapse approach provides native support for data governance, key vaults, WAF, integration with native Azure Monitor, policy management and Azure DevOps and GitHub. Lack of integration with a CI/CD and Source Control system is a big minus for large systems.
Dimensions
The security dimension is covered in detail in the second part of the article. I dedicated an entire section because of the complexity and the high number of different perspectives.
Indexing
Integration
Data Sharing
Queries
Cost
Scale
Data Governance
Security Dimension
There are 11 security criteria that I took into account. Covering security at different layers and tools and mechanisms to do the governance and monitoring. Azure Synapse centric solution is well integrated with the Microsoft Azure ecosystem and provides better E2E security. At the data layer, both offer mature security and governance features.?
Data Security
Application Security
领英推荐
Endpoint Security
Access Control
Network Security
Perimeter Security
Physical Security
Operations Management
Security Audit and Tracking?
Monitoring and Recommendations
Policy Management
Core Features
An Azure Synapse centric solution supports data governance with Pureview and native integration with Azure Services for ELT, ETL and Spark. CI/CD pipelines and Source Control are supported by GitHub and Azure DevOps integration.
Snowflake has significant advantages when pause and resume are required, and secondary index or in-memory capability is essential. The most significant advantage of Snowflake is the SaaS offer, where there are no maintenance and operational cost of the platform itself.?
Pricing
The devil is in the details is true when we talk about pricing estimations. Nevertheless, the order of magnitude is the same. The running cost of a large solution running on top of Azure Synapse or Snowflake is similar. Running costs cannot be considered a significant differential factor.??
Computation
Storage
Tiers
Saving plans
Overall cost