登录查看更多内容

Stop replicating your SAP data to hyperscalers cloud storage for your ML use cases. You're loosing time, money and don't offer the right insights !

Didier Heck

helping organizations to find the fastest way to turn data into valuable business outcomes and to innovate

发布日期: 2022年9月7日

What's the issue ?

Everybody agrees that hyperscaler platforms are able to manage huge quantities of unstructured/semi-structured data and providing analytics and AI capabilities on those data sets.

However, to lay the foundation for analytics as well as data science experiments on their platform,?the hyperscalers are creating the?need for businesses to extract their business data out?to cheap cloud storages because without business data there is no context to the analysis they will provide on the non structured/semi-structured data they have collected from other sources.

This forced data replication is due to the fact that analytics & building of?machine learning models by those hyperscalers can only work or work seamlessly when the data resides on the respective hyperscalers platform’s native cloud storages.

This inadvertently brings in the need for expensive ETL and data pipelines to move the data across systems (sometimes with CDC to satisfy realtime replication) but it’s not just about additional cost !

It?leads to data inconsistency issues because as data science people don’t know what they don’t know about SAP processes and SAP's highly normalized data structures, when they simply extract out SAP data, they often make significant mistakes when they recontextualize it in their data lake. If the data is re-contextualized incorrectly there is no coherent way to use that data and send correct insights back.

On top, this is taking away the time and focus of data scientists, as they are the ones who end?up tackling data sourcing issues.

The Solution

SAP Federated-ML or FedML is a library built to address this issue. The library applies the Data Federation architecture of SAP Data Warehouse Cloud and provides functions that enable businesses and data scientists to build, train and deploy machine learning models on hyperscalers, thereby eliminating the need for replicating or migrating data out from its original source.

By abstracting the data connection, data load and model training on these hyperscalers, the SAP FedML library provides end to end integration with just a few lines of code.

On top, SAP FedML will help organizations to avoid vendor lock-in and aids them with reduction of their hyperscaler storage costs, and adherence to GDPR policies,?as data migration is eliminated. It also enables instant access to cross-cloud data sources, combined with SAP Business data managed through SAP Data Warehouse Cloud’s unified semantic models.?

Interested to see how it works ?

Azure is your hyperscaler of choice, then check this blog on how to immediately take advantage of SAP FedML with Azure ML 2.0

Google is your hyperscaler of choice, then check this blog on how to immediately take advantage of SAP FedML with Google Cloud Vertex AI 2.0

领英推荐

Why Is Cloud Data Analytics Important?

Digiprima Technologies 3 个月前

Top Emerging Data Warehousing Start-Ups In 2024;…

Spherical Insights LLP 10 个月前

Crafting a Digital Enterprise with Google Cloud Data…

Inflexion Analytics 8 个月前

Amazon is your hyperscaler of choice, then check this blog on how to immediately take advantage of SAP FedML with Amazon SageMaker 2.0

Only SAP data required for your advanced analysis use case ?

SAP’s Python package?hana_ml?and R package?hana.ml.r?make it easy to trigger such an advanced analysis from any Python or R environment on the data held in SAP Data Warehouse Cloud / SAP HANA Cloud. And those packages allows you also to easily take advantage of the hundred ML algorithms provided by the SAP HANA APL, PAL libraries as well as taking advantage of the SAP HANA Graph Engine and Spatial Engine.

Just install those packages in a notebook that runs in your hyperscaler’s environment (e.g install HANA ML to Databricks notebook),?trigger processing both in-HANA and Spark from that same notebook, and bring results of data processing to local environment (via .collect() statement) as needed.

Check those few examples :

要查看或添加评论，请登录

Didier Heck的更多文章

Why is SAP Business Data Cloud so different from SAP Datasphere and SAP Analytics Cloud combined ?

2025年2月20日

Why is SAP Business Data Cloud so different from SAP Datasphere and SAP Analytics Cloud combined ?

One solution for all data and analytics requirements SAP Business Data Cloud is a Software-as-a-Service solution…

6 条评论
Always ready to learn more about the possibilities offered by SAP HANA Cloud

2022年9月19日

Always ready to learn more about the possibilities offered by SAP HANA Cloud

Find here after a series of SAP Community blogs to satisfy your need to know always more about many different aspects…
Define your SAP Data Warehousing strategy : SAP BW/4HANA - SAP HANA - SAP Data Warehouse Cloud (DWC)

2020年8月4日

Define your SAP Data Warehousing strategy : SAP BW/4HANA - SAP HANA - SAP Data Warehouse Cloud (DWC)

With the increasing amount of data and data sources, the ability to build rich semantic models, manage a vast number of…

2 条评论
SAP Data Hub Video Series - Episode 2

2018年5月7日

SAP Data Hub Video Series - Episode 2

Welcome to this new SAP Data Hub Video Series. In this second episode, we will start to illustrate the functionnality…
SAP Data Hub Video Series - Episode 1

2018年5月3日

SAP Data Hub Video Series - Episode 1

Welcome to this new SAP Data Hub Video Series. In this first episode, we will look at the issues faced by enterprises…
openSAP course on SAP HANA Dynamic Tiering

2018年3月28日

openSAP course on SAP HANA Dynamic Tiering

The next course in our SAP HANA core knowledge series, Introduction to SAP HANA Dynamic Tiering, is aimed at database…
SAP HANA Landscape Definition Guide

2017年7月14日

SAP HANA Landscape Definition Guide

To get the most out of the SAP HANA platform, it is mission critical to outline a blueprint for its deployment. One of…
In the midst of the digital era, can employee satisfaction be reconciled with efficiency?

2017年7月14日

In the midst of the digital era, can employee satisfaction be reconciled with efficiency?

Is there a human face to digital HR? There is no doubt the HR department has a few challenges to face. On September 6…
Digital twins aren’t a new concept, but their application throughout the product lifecycle is.

2017年5月5日

Digital twins aren’t a new concept, but their application throughout the product lifecycle is.

Digital twins are virtual representations of a real-world products or assets. And, to compete in the digital economy…

1 条评论
SAP Leonardo & Digitizing Business: The Big Picture

2017年5月5日

SAP Leonardo & Digitizing Business: The Big Picture

SAP has supported the digitalization of business processes for more than 45 years through our business Suite solutions…

See all articles

Stop replicating your SAP data to hyperscalers cloud storage for your ML use cases. You're loosing time, money and don't offer the right insights !

Didier Heck

helping organizations to find the fastest way to turn data into valuable business outcomes and to innovate

What's the issue ?

The Solution

Interested to see how it works ?

领英推荐

Only SAP data required for your advanced analysis use case ?

Didier Heck的更多文章

社区洞察

其他会员也浏览了

Qlik Announces Qlik Talend Cloud and Qlik Answers: Eliminating Barriers to Enterprise AI Adoption

TechX Corp Affirms its Leading Position in AWS Cloud-Based Data and Analytics

Data lake vs. data warehouse: understanding the differences and use cases

Movie Magic with AWS: Create Your Own Recommendation System

Data Migration to AWS cloud for Fintech

Unleashing the Power of ?? BigQuery and ?? SAP Analytics: A Match Made in Data Heaven

Unleashing the Power of ?? BigQuery and ?? SAP Analytics: A Match Made in Data Heaven

Snowflake vs. Databricks: A Comprehensive Comparison

Snowflake Exploring AI For Data Warehousing Capabilities

Episode 4- From Serverless to Self-Services Analytics passing by Cloud Technologies

What's the issue ?

The Solution

Interested to see how it works ?

领英推荐

Only SAP data required for your advanced analysis use case ?

Didier Heck的更多文章

Why is SAP Business Data Cloud so different from SAP Datasphere and SAP Analytics Cloud combined ?

Always ready to learn more about the possibilities offered by SAP HANA Cloud

Define your SAP Data Warehousing strategy : SAP BW/4HANA - SAP HANA - SAP Data Warehouse Cloud (DWC)

SAP Data Hub Video Series - Episode 2

SAP Data Hub Video Series - Episode 1

openSAP course on SAP HANA Dynamic Tiering

SAP HANA Landscape Definition Guide

In the midst of the digital era, can employee satisfaction be reconciled with efficiency?

Digital twins aren’t a new concept, but their application throughout the product lifecycle is.

SAP Leonardo & Digitizing Business: The Big Picture

社区洞察

其他会员也浏览了

Qlik Announces Qlik Talend Cloud and Qlik Answers: Eliminating Barriers to Enterprise AI Adoption

TechX Corp Affirms its Leading Position in AWS Cloud-Based Data and Analytics

Data lake vs. data warehouse: understanding the differences and use cases

Movie Magic with AWS: Create Your Own Recommendation System

Data Migration to AWS cloud for Fintech

Unleashing the Power of ?? BigQuery and ?? SAP Analytics: A Match Made in Data Heaven

Unleashing the Power of ?? BigQuery and ?? SAP Analytics: A Match Made in Data Heaven

Snowflake vs. Databricks: A Comprehensive Comparison

Snowflake Exploring AI For Data Warehousing Capabilities

Episode 4- From Serverless to Self-Services Analytics passing by Cloud Technologies