If data platforms were cars, Snowflake would be...
Nick Akincilar
Analytics, AI & Cloud Data Architect | Solutions Whisperer | Tech Writer
I know you are probably thinking what in the world cars have anything to do with Snowflake data platform... Let me save you some time & say.... NOTHING!!!
However, I am a car guy and my favorite way to explain technology to business is using analogies that most people can relate to & understand. As a result, I tend to use cars a lot as a prop for my technology explanations & comparisons and garnish it with many other everyday items & gadgets to make my point across. So if you are wondering what is so special about Snowflake where the largest companies in the world are shifting their data platforms and running their entire businesses on it, here is why by way of using car analogies:
If data platforms were like cars, here is what it would look like in terms of how they would compare.
Your tired old on-prem data warehouses & Hadoop data lakes would be like 1950-1970 era cars. They would be huge, heavy & with big engines making small horsepower. Compared to today's cars, they would be slow, not handle well, won't provide much protection during an accident, and require a lot of ongoing maintenance and tuning to get that carburetor engine running properly. If you wanted your car to be more agile & faster, it would require a lot of time & expensive modifications and in the end, you would end up with a one-off custom kit car with you being the only person who knows how to run it properly. Most of the time, the best way to get a car from that time period to be handle better, be faster and more reliable would be to replace it & buy a better car (upgrade).
Your first-gen cloud-hosted data warehouse & lakes would be like cars from 1980-2000. They would have mostly fuel-injected engines with basic sensors & computers to automatically manage & tweak the engine output(some automation) to get better power from a smaller engine, be more reliable with less maintenance to keep it running well. You would also be a lot safer in an accident. You could easily slap on a turbo or supercharger kit to make it go much faster. Change shocks to make it handle better. Overall better performance, agility & safety from a much smaller and reliable package with much less work to make them go faster & turn better. The closer the production date is to the year 2000, the better and easier it gets. Again there is only so much you can do to make that car go faster within its limits. (meaning scaling was easier but still limited and usually a one-way ticket which was always UP or OUT and stayed that way till next big change) Once you hit that limit, your options are limited before blowing up the engine. You either replace the engine with a bigger one or replace the car yet again for a better car.
Current-gen cloud PaaS or SaaS platforms are much closer to today's modern traditional internal combustion engine cars. They are much more reliable, faster & safer. Their engines(compute power) would have a cylinder activation tech where you can switch between 4, 6, 8, 12, or more cylinders(nodes) to change the power output without having to modify the engine itself but this is usually a business disruptive process meaning they would have to pull over and shut the car down briefly to make this change. However, most owners would just stick with the engine size they got unless they absolutely need to change it because nobody wants to pull over on the highway and wait 10-15 mins while you got a bunch of impatient passengers(business users) during a trip. They also would be much more technologically advanced with ton of different features & gadgets where some can be very useful (auto windshield wipers) and others would require a Ph.D. to operate it and hardly ever gets used (like the launch control on BMW M cars or gesture & voice control features). Essentially sophisticated, high-tech & faster set of cars but not necessarily easier to use from the previous gen cars for most of us (I still prefer my 2000 BMW E46 to newer models, it didn't have as many features but the ones it had & the ones I cared about worked great and never used much of the newer features on later models). Also because of the complexity of tech and gadgets, if something goes wrong, figuring out & fixing these complex machines requires a highly skilled person. They are hard to find so you usually won't find these people at your local garage. They call them master technicians. They work in dealerships, get paid really well, and usually the guys who walk around in the shop with a tablet in their hand that is connected to your car with Bluetooth while sipping on their cappuccinos. They are the ones that come and talk to you to tell you it will be a $250 diagnosis charge but they were not able to replicate the problem you were experiencing. So current-gen cloud data platforms are like a faster, better & more sophisticated car with less maintenance on your part but not necessarily always easier to use and/or easy to fix/configure without a hard-to-find expensive expert.
This was all about speed, handling & maintenance. What about concurrency? This is life, stuff happens when you least expect it. You may have started as a happy single person with a fast 2 seater convertible. Then you get married and keep the car. Then you get news of a little one coming up in 9 months so you run and trade in your convertible with a 4 door sedan or a small SUV. Then the doctor tells you it is twins & give up the small SUV and upgrade to a bigger SUV (much more money). They get a little older and want their friends to go to the soccer practice with them forever, now you definitely need a minivan or a very big SUV (more money). Then they suddenly don't like their friends anymore, or friends decide to launch a fully loaded orange soda cup in the car as a joke and you ban them for life from riding in your vehicle. Now you are stuck with a big SUV that you are paying for. (This would be a typical self-service BI & Analytics workload). Or... maybe you are divorced with kids and you marry someone who also has kids (Mergers & Acquisitions), or your in-laws show up(Monday morning rush) and you suddenly end up with 2 or 3 times the number of people and may need a mini-bus. No one really wants to buy nor drive a mini-bus just in case they need the capacity a few times a month. With current traditional data platforms, this is essentially what IT has to deal with. They need to estimate the worst-case scenario in terms of capacity for the next few years and buy something big enough. And when things change and what they have is not enough, they have to buy a bigger system. If they planned it right but divorce happens where you lose headcount due to unknown circumstances or maybe IT was really bad at capacity planning, now you are stuck with that bigger box that you have to pay for & not need for a while.
领英推荐
And what about the cloud providers and being locked into only one of them? What if you are a FORD person but your business partner is CHEVY guy or you are a BMW guru but you got a job in a company that only works with AUDI? Does this mean, you can't work together or may have to learn about other's car technology and parts before you can work?
Let's talk Snowflake. So what kind of car would Snowflake be and why is it so damn popular?
Snowflake would be a brand new way of transportation vs. just a car you lease. Something between a Tesla, a futuristic car with features that are yet to be invented, and a private jet. It would also be delivered as a service & not sold as an actual product.
What do I mean?
Pretty cool right? Run the same size car but pay less because you only drive few hours a day vs 24 that you pay for with a regular lease. OR spend about the same as you would with a traditional lease but end up driving a variety of cars that are much faster and bigger than what you would normally get as you only pay while the engines are running(more cost per second but shorter engine time period as faster cars get you there quicker).
Ok, now that you know what kind of car Snowflake would be, what kind of car would you rather have?
I personally rather be able to have a garage full of choices(cars & planes) that I can pick what I need at any point within seconds & pay only for the duration of use down to a second vs. having to pick one car, pre-pay for 24x7 usage, and locked into that car for few years at a time if the TCO is same or better.
If you like this article, feel free to like it and share it with your network... If not, pretend this didn't happen & you have not wasted 10 mins of your life reading this.
Lead/Architect Data Engineer | Mastering SAP BW, HANA, SAP Modules | Cloud Data Technologies Specialist in Snowflake, Data Bricks | Proficient in ETL/ELT: Fivetran, Qlik Replication, BODS, HANA SDI, SDA
3 年Totally make sense. Super Analogy!!!!
Florida Sales Manager at Sigma Computing | Enterprise BI, Spreadsheet UI
3 年Great article. I use the car analogy all the time. Traditional DW = Custom Race Car - suped-up, can go fast on the track, a lot of custom/moving parts, the driver needs to set up and maintain themselves, will need to go to the shop monthly (no insurance or high premiums), not versatile (can't fit more people, not good for a Sunday cruise, and can't handle snow : ). Also, will need to spend the weekend cleaning and washing it off. Snowflake Data Platform = Open your garage and have unlimited access to lease any sedan/SUV/sports car you'd like. Whether it's commuting to work, bad weather, a camping trip with the entire family, or F1 race - the driver decides whenever they open the garage what they want to drive that day. Only paying for the amount of seconds they are driving the cars. The fleet of cars is always performant, clean, ready for a spin, the latest models and can even use auto-pilot! I'd rather focus my time on where I'm going, not spending time making sure I'm able to get there. Put the key in and go!
Data Geek, Speaker, Blogger, Sales Engineer, Angel Advisor
3 年I don’t think Snowflake is just one car it’s a rental car stand that lets you choose ANY car on the lot or the helicopter, boat or train in the “secret” back lot. Choose the tool right for the purpose (warehouse size and SQL command). You can get the beater, family car, economy, truck or semitrailer all from the same stall...in 150ms. Sometimes it’s Uber and you don’t drive the vehicle (Snowpipe, Search Optimization, Clustering or MV Maintenance). So I think it might be tough to pin it to just one type for car. It a multi-modal transportation platform that goes fast as hell!
The car analogy is great and this article is very insightful (and funny too!)
Architecting the Future Data and AI Solutions for a New Tomorrow
3 年Must share, great analogy.