登录查看更多内容

How to Build a Streamlit App for Favorita Grocery Sales Forecasting Using Regression Model

Stella Oiro

Apprentice SoftwareDeveloper || Technical Writer || Expert SEO Writer || Clinical Officer || Entrepreneur

发布日期: 2023年4月16日

Are you interested in predicting future grocery sales for a retail corporation? If you're interested, you can check out my GitHub for more projects related to data science and machine learning. In this article, we'll walk you through how to build a Streamlit app using a regression model that was trained on the Favorita Grocery Sales dataset.

Data Description

The Favorita Grocery Sales dataset consists of transactional records of a retail corporation in Ecuador over a period of five years. The data contains information about store locations, item descriptions, on-shelf dates, promotions, and unit sales. The goal of the competition is to predict the unit sales for a set of test items and stores.

Model Training

To train our regression model, we used a combination of feature engineering and XGBoost regression. We started by cleaning the data, removing duplicates and missing values, and then engineered new features such as day of the week, month, and year. We also used one-hot encoding to convert categorical variables into binary features.

After feature engineering, we split the data into training and validation sets, trained an XGBoost regression model on the training data, and tuned the hyperparameters using grid search. Finally, we evaluated the model on the validation set and calculated the root mean squared logarithmic error (RMSLE) to measure the performance of the model.

Streamlit App Development

To develop our Streamlit app, we started by importing the necessary libraries, loading the trained model and encoder, and defining the input and output interfaces for the app. We then defined the prediction function, which takes user inputs, preprocesses them using the encoder, and feeds them into the trained model to make a prediction.

领英推荐

Big Data At Walmart: How The Mind-Blowing 40+ Petabyte…

Bernard Marr 7 年前

Big Data-Driven Decision-Making At Domino’s Pizza

Bernard Marr 8 年前

Deep Diving into Retail Data Analytics: How Can…

JK Tech 2 年前

# Import necessary libraries
import streamlit as st
import pickle
import pandas as pd
from sklearn.preprocessing import LabelEncoder
from xgboost import XGBRegressor

# Load the trained model and encoder
model = pickle.load(open("model.pkl", "rb"))
encoder = pickle.load(open("encoder.pkl", "rb"))

# Define the input and output interfaces for the Streamlit app
st.title("Favorita Grocery Sales Forecasting")
store_item_id = st.text_input("Store Item ID", "0_0")
date = st.date_input("Date")
onpromotion = st.selectbox("On Promotion", ["True", "False"])

# Define the prediction function
@st.cache()
def predict_sales(store_item_id, date, onpromotion):
    df = pd.DataFrame({"store_item_id": [store_item_id],
                       "date": [date],
                       "onpromotion": [onpromotion]})
    df["store_id"], df["item_id"] = df["store_item_id"].str.split("_", 1).str
    df["year"] = df["date"].dt.year
    df["month"] = df["date"].dt.month
    df["day"] = df["date"].dt.day
    df["weekday"] = df["date"].dt.weekday
    df["onpromotion"] = encoder.transform(df[["onpromotion"]])
    df.drop(["store_item_id", "date"], axis=1, inplace=True)
    prediction = model.predict(df)
    return prediction[0]

# Call the prediction function and display the outpu
if st.button("Predict Sales"):
    prediction = predict_sales(store_item_id, date, onpromotion)
    st.write("Predicted Unit Sales: ", prediction)t

Results

Our Streamlit app allows you to input a store item ID, date, and promotion status, and receive a prediction for the unit sales for that item and store. The app preprocesses your inputs and feeds them into the trained regression

要查看或添加评论，请登录

Stella Oiro的更多文章

How to Conquer Go Codebase Complexity as a Junior Developer: A Comprehensive Guide

2024年9月29日

How to Conquer Go Codebase Complexity as a Junior Developer: A Comprehensive Guide

Table of Contents: 1. Introduction 2.

2 条评论
The Victory of AfyaChain at the Zone01 Kisumu 48-hour Hackathon

2024年8月3日

The Victory of AfyaChain at the Zone01 Kisumu 48-hour Hackathon

In the heart of Kisumu, the Innovating for Inclusive and Sustainable Development with Blockchain hackathon promised a…

14 条评论
7 Ways to Write the Itoa Function in Go

2024年6月1日

7 Ways to Write the Itoa Function in Go

Converting an integer to its ASCII string representation is a common task in programming. In Go, the strconv package…

1 条评论
Why Recursion in Go is a Game-Changer: A Deep Dive with Practical Examples

2024年5月30日

Why Recursion in Go is a Game-Changer: A Deep Dive with Practical Examples

Recursion is a fundamental concept in computer science, and Go provides a powerful platform to explore and implement…
Concurrency for Beginners: Goroutines and Channels in Go

2024年5月19日

Concurrency for Beginners: Goroutines and Channels in Go

Imagine a busy kitchen. A chef (your program) can only do one thing at a time (sequential code).
Mastering Error Handling in GO

2024年5月18日

Mastering Error Handling in GO

Ever feel bogged down by error checks in your Golang code? Go's explicit error handling, while promoting code clarity…
How to Develop a Sepsis Prediction App Using FastAPI

2023年6月11日

How to Develop a Sepsis Prediction App Using FastAPI

Are you interested in building a powerful sepsis prediction application that leverages machine learning and real-time…
Sentiment Analysis with DistilBert: A Complete Guide

2023年5月16日

Sentiment Analysis with DistilBert: A Complete Guide

Sentiment analysis is the process of determining the emotional tone of a piece of text. It is a valuable tool for…

3 条评论
Telco Customer Churn Prediction Using Gradio App: A Step-by-Step Guide

2023年4月16日

Telco Customer Churn Prediction Using Gradio App: A Step-by-Step Guide

Customer churn is a major challenge for many businesses, especially in the telecommunication industry. In order to…
Predicting Customer Churn: An Analysis of Key Indicators and Retention Strategies

2023年3月15日

Predicting Customer Churn: An Analysis of Key Indicators and Retention Strategies

Customer churn is a critical problem for businesses as it can lead to a loss of revenue and customer loyalty. In this…

See all articles

How to Build a Streamlit App for Favorita Grocery Sales Forecasting Using Regression Model

Stella Oiro

Apprentice SoftwareDeveloper || Technical Writer || Expert SEO Writer || Clinical Officer || Entrepreneur

Data Description

Model Training

Streamlit App Development

领英推荐

Stella Oiro的更多文章

社区洞察

其他会员也浏览了

Data Analytics: How Retail is Unlocking the Power of Data

4 ways Big Data is impacting the world of e-commerce

What role does data science play in the success of an e-commerce platform's finest functionality?

Retail Analytics Market

Analyzing Grocery Shopping Patterns with Instacart Data

Impact of Big Data Analytics in Retail Industry (Simplified)

Data Is The New Gold: Why Data Is A Brands Best Friend

Applications of Data Science in the E-commerce industry

Five Benefits Of Big Data Analytics And How Companies Can Get Started

Data Analysts in the E-Commerce Industry

Data Description

Model Training

Streamlit App Development

领英推荐

Stella Oiro的更多文章

How to Conquer Go Codebase Complexity as a Junior Developer: A Comprehensive Guide

The Victory of AfyaChain at the Zone01 Kisumu 48-hour Hackathon

7 Ways to Write the Itoa Function in Go

Why Recursion in Go is a Game-Changer: A Deep Dive with Practical Examples

Concurrency for Beginners: Goroutines and Channels in Go

Mastering Error Handling in GO

How to Develop a Sepsis Prediction App Using FastAPI

Sentiment Analysis with DistilBert: A Complete Guide

Telco Customer Churn Prediction Using Gradio App: A Step-by-Step Guide

Predicting Customer Churn: An Analysis of Key Indicators and Retention Strategies

社区洞察

其他会员也浏览了

Data Analytics: How Retail is Unlocking the Power of Data

4 ways Big Data is impacting the world of e-commerce

What role does data science play in the success of an e-commerce platform's finest functionality?

Retail Analytics Market

Analyzing Grocery Shopping Patterns with Instacart Data

Impact of Big Data Analytics in Retail Industry (Simplified)

Data Is The New Gold: Why Data Is A Brands Best Friend

Applications of Data Science in the E-commerce industry

Five Benefits Of Big Data Analytics And How Companies Can Get Started

Data Analysts in the E-Commerce Industry