ç™»å½•æŸ¥çœ‹æ›´å¤šå†…å®¹

Amazon Sagemaker Feature Store

Pronam Chatterjee

Founder and CEO - BluePi

å‘å¸ƒæ—¥æœŸ: 2022å¹´7æœˆ21æ—¥

+ å…³æ³¨

Recap

Just to recap, the main reasons we need a feature store are

Consistency in features for Models (variations can impact model results)
Reuse of features across models and teams saving time and cost
Enabling feature discovery and versioning

Sagemaker

Amazon Sagemaker is a fully managed, purpose-built repository for features. Being fully managed means the entire infrastructure, setup and provisioning are managed by AWS and do not require any management.

The gateway to the feature store is Amazon Sagemaker Studio. The Studio is a fully managed Jupyter Lab environment.?When you open the studio you would find the following widget in the Launcher tab.

Feature groups

To bundle related features together,?we use feature groups. Imagine the Feature group as a table and each feature as a column.?Each row is a way to group related features. As a simple example:

The customer would be a feature group
Recency, Frequency and Monetary categories/values would be separate features
Each customer would be a separate record

Each record contains a unique RecordIdentifier to uniquely identify the record. In our example, it could be the customerId.

Feature groups could be made available online or offline or both. Online Feature groups are mainly used for real-time predictions and store only the latest version of the feature data.?The read latency for an online store is a few milliseconds.

é¢†è‹±æŽ¨è

Build a serverless API using an Azure Function that reads and writes data to an Azure Cosmos DB with little to no code

Build a serverless API using an Azure Function thatâ€¦

Dimitar Iliev ?? 2 å¹´å‰

CREATE YOUR FIRST CICD PIPELINE

Gabe Olokun 3 å¹´å‰

AWS Cognito Backup Restore Solution with Python3 Lambda

ITGix Ltd 1 å¹´å‰

From there, navigating to the feature store you get a?view to see the list of all features and feature groups.

In true AWS tradition, every action that you can perform from the UI is also available via APIs. There is a Sagemaker python library that is available that can be used for API access.

The sample code to create a feature group is below and it is quite self-explanatory:

import sagemaker

sagemaker_session = sagemaker.Session()
region = sagemaker_session.boto_region_name

products_feature_group = FeatureGroup(
name=customers_feature_group_name, sagemaker_session=sagemaker_session
)

product_data = pd.read_csv("data/product.csv")

product_data["EventTime"] = pd.Series([current_time_sec] * len(product_data), dtype="float64")
customers_feature_group.load_feature_definitions(data_frame=product_data)

customers_feature_group.create(s3_uri="<s3 path>", record_identifier_name=record_identifier_feature_name, event_time_feature_name="EventTime", role_arn=role, enable_online_store=True)

With the use of a short code snippet above, we can create features into Sagemaker. The API for retrieval is equally simple and intuitive.

In a nutshell, Sagemaker Feature Store makes it a breeze to store, discover and retrieve features for Machine Learning.

Footnotes:

Sagemaker library version must be greater than 2.0

è¦æŸ¥çœ‹æˆ–æ·»åŠ è¯„è®ºï¼Œè¯·ç™»å½•

Pronam Chatterjeeçš„æ›´å¤šæ–‡ç«

Feature Store and Why is it needed

2022å¹´7æœˆ19æ—¥

Feature Store and Why is it needed

When you build ML models you don't input raw data as it rarely is in a format that can be used by the ML Modelsâ€¦
Are You Ready For The Artificial Intelligence Supply Chain Model And Its Impact On Sales?

2020å¹´4æœˆ1æ—¥

Are You Ready For The Artificial Intelligence Supply Chain Model And Its Impact On Sales?

Modern industrial sales and supply chains are complex. The amount of a product or service that a company can sell orâ€¦
Demand Forecasting Retail-Best Practices

2020å¹´3æœˆ11æ—¥

Demand Forecasting Retail-Best Practices

Demand forecasting retail is one of the toughest jobs. One has to look into the existing market data, storeâ€¦
How ML In Supply Chain Optimization Is Improving Management And Efficiency

2020å¹´2æœˆ4æ—¥

How ML In Supply Chain Optimization Is Improving Management And Efficiency

79% of companies that see greater revenue growth in their industry have a well-optimized and high-performing supplyâ€¦

1 æ¡è¯„è®º
How Deep Learning Solves Retail Forecasting Challenges

2020å¹´1æœˆ6æ—¥

How Deep Learning Solves Retail Forecasting Challenges

A study by Harvard Business Review and Snowflake Computing points out that retailers who choose to make data-drivenâ€¦

1 æ¡è¯„è®º
How AI is making the fashion industry smarter!

2019å¹´12æœˆ11æ—¥

How AI is making the fashion industry smarter!

Sephoraâ€™s Color IQ recommends customized foundation and concealer shades after scanning the shopperâ€™s skin using AIâ€¦

3 æ¡è¯„è®º
Success in Supply Chain with AI: Proven Ways

2019å¹´9æœˆ17æ—¥

Success in Supply Chain with AI: Proven Ways

A growing number of enterprises today are turning their trust towards machine learning in AI. There are various reasonsâ€¦

1 æ¡è¯„è®º
Data-Driven Transformation Affair in Retail

2019å¹´9æœˆ9æ—¥

Data-Driven Transformation Affair in Retail

Retail is one of the strongest and fastest-growing industries worldwide. 2019 onwards it will further pick up the paceâ€¦
Life of BluePi: 4 years of growth and how!

2017å¹´1æœˆ24æ—¥

Life of BluePi: 4 years of growth and how!

Clocking a 100% growth in revenues every year is no joke! More so, if youâ€™re a self-funded, lean company, workingâ€¦

7 æ¡è¯„è®º
Migrate Enterprise Applications to AWS

2014å¹´6æœˆ2æ—¥

Migrate Enterprise Applications to AWS

In this Webinar we would look at why migrating enterprise applications to the cloud are all the rage in the industryâ€¦

See all articles

Amazon Sagemaker Feature Store

Pronam Chatterjee

Founder and CEO - BluePi

Recap

Sagemaker

Feature groups

é¢†è‹±æŽ¨è

Pronam Chatterjeeçš„æ›´å¤šæ–‡ç«

ç¤¾åŒºæ´žå¯Ÿ

å…¶ä»–ä¼šå‘˜ä¹Ÿæµè§ˆäº†

*msaFilesystem: Practical way of file system management

Read and Write to BigQuery with Spark and IDE from On-Premises

How to Run a Sagemaker Notebook From AWS Lambdas ?

Building a Scalable Backend with AWS Lambda, API Gateway, and DynamoDB

Deploying a Flask Application on AWS with Docker, Amazon Elastic Container Registry(ECR), and Amazon Elastic Kubernetes Service(EKS)

Schedule Amazon RDS Instance stop and start using AWS Lambda

Building a serverless app - part 1

Deploy a FastAPI application with Dapr on Kubernetes

Local AWS Glue development via Docker with Private CA Bundle

Recap

Sagemaker

Feature groups

é¢†è‹±æŽ¨è

Pronam Chatterjeeçš„æ›´å¤šæ–‡ç«

Feature Store and Why is it needed

Are You Ready For The Artificial Intelligence Supply Chain Model And Its Impact On Sales?

Demand Forecasting Retail-Best Practices

How ML In Supply Chain Optimization Is Improving Management And Efficiency

How Deep Learning Solves Retail Forecasting Challenges

How AI is making the fashion industry smarter!

Success in Supply Chain with AI: Proven Ways

Data-Driven Transformation Affair in Retail

Life of BluePi: 4 years of growth and how!

Migrate Enterprise Applications to AWS

ç¤¾åŒºæ´žå¯Ÿ

å…¶ä»–ä¼šå‘˜ä¹Ÿæµè§ˆäº†

*msaFilesystem: Practical way of file system management

Read and Write to BigQuery with Spark and IDE from On-Premises

How to Run a Sagemaker Notebook From AWS Lambdas ?

Building a Scalable Backend with AWS Lambda, API Gateway, and DynamoDB

Deploying a Flask Application on AWS with Docker, Amazon Elastic Container Registry(ECR), and Amazon Elastic Kubernetes Service(EKS)

Schedule Amazon RDS Instance stop and start using AWS Lambda

Building a serverless app - part 1

Deploy a FastAPI application with Dapr on Kubernetes

Local AWS Glue development via Docker with Private CA Bundle

é¢†è‹±æŽ¨è

å…¶ä»–ä¼šå‘˜ä¹Ÿæµè§ˆäº†