DEPLOYING ML CLASSIFIER INTO WEB APP USING FASTAPI

Gideon Dadzie

Data Analyst/Engineer ||Backend Engineer||Electrical/ Electronic Engineer|| Writer

发布日期: 2023年8月20日

INTRODUCTION

Deploying an ML model using an API architecture is mostly a secure and reliable way by which users can interact with the ML model, mostly through making an API request. That is, users with authentication access to the API can just send request and expect a response (usually in JSON or XML format) from the analytic server on which the model runs. There are several API architecture for these kind of task, however, FASTAPI has gained popularity since its introduction in 2018. As the name suggest, FASTAPI is fast(i.e. high performance), provides an excellent documentation and has relevant framework for building web APIs with Python. In this article, it is intended to walk readers through a project where a docker image was created for deploying an ML classifier(built on scikit-learn) using FASTAPI.

PROJECT STRUCTURE

The project proceeded in two main stages;

Building ML Classifier : This phase of the project was carried out in Google Colab. Consistent with the CRISP-DM approach , this phase included:

a) Data Understanding: Patient dataset was acquired from John Hopkins University. The dataset had 11 fields where the first 10 fields had patient information and the last field(target variable) specified whether the patient was diagnosed with sepsis or not.

i) Data Overview: An exploratory analysis was performed on the dataset to identify potential issues in the dataset. Fortunately the dataset had little to no issues -thus little time was used in cleaning the data as only the column names were changed.

No alt text provided for this image — Data Overview and Descriptive Stat of variables

ii) Formulating and Testing Hypothesis

Null Hypothesis: There is no significant difference in the likelihood of young and old patients developing Sepsis

Alternate Hypothesis: The likelihood of young patients developing Sepsis differs significantly from that of old patients.

ii) Uni-variate and Multivariate Analysis: Using the visualization, the distribution of the individual variables (to detect outliers and "tailedness" of their variables) and their relationship between these variables was explored.

b) Data Preparation : Here, the target variable (Sepsis) and independent variables were encoded and scaled respectively using scikit-learn packages LabelEncoder() and StandardScalar().

c) Modelling : Here, six different models were built, trained and evaluated. After evaluation, four(4) best models underwent hyperparameter tuning and the best estimator with high performance was chosen as our ML classifier. The main stages in the modelling:

i) Data Splitting : Dataset available for training was split into train and anevaluation set.

ii) Balancing the target variables: There was an imbalance in the target variables. As part of standard practice to reduce bias towards one class during training, the dataset was balanced using the SMOTE technique (over-sampling).

Godwin Efobi 1 年前

DATA SCIENCE & MACHINE LEARNING Project using Python &…

Anand Tanna 6 年前

INTEGRATING AN ML CLASSIFIER INTO A WEB APPLICATION…

Quabena F. Boateng 1 年前

iii) Training: Six models were trained using the trainset and evaluated over the evaluation set. Below were the evaluation scores for each model.

iv) Hyper Parameter Tuning

The best performing models were further tuned to select the best performing estimator. Find the results of that below:

v) Exporting components

All relevant components such as requirements,txt, model preprocessing functions and the best model were exported to be used in the application

2. Web Application using FastAPI

i) Setting environment variables: Here, a virtual environment was created and all dependencies for the project were installed.

iii) Deploying App

3. Container: Finally, the docker container is created using Dockerfile to build an image.

CONCLUSION

FASTAPI provides a fast, modern architecture for deploying ML applications. This project has been insightful and anpther step deeper into the world of data and solutions . All files are available on my GitHub : https://github.com/MrDadzie/Sepsis_Classification_Project.git.

There is more to come...........

REFERENCES

Azubi Africa. https://www.azubiafrica.org/
FASTAPI Documentation. https://fastapi.tiangolo.com/
Machine Learning with Scikit -learn.https://engineering.rappi.com/serve-your-first-model-with-scikit-learn-flask-docker-df95efbbd35e

Evans Acquaye

1 年

Great man??

1 次回应

Timothy Owusu

1 年

Keep this up man. ??????????????

1 次回应

查看更多评论

要查看或添加评论，请登录

Gideon Dadzie的更多文章

Sentiment Analysis

2023年7月23日

Sentiment Analysis

Introduction Sentiment analysis is a Natural Learning Processing(NLP) technique which centers on finding the intentions…
ML EMBEDDING PROJECT

2023年6月27日

ML EMBEDDING PROJECT

INTRODUCTION After successful implementation of ML models to solve real life business situations, it is important that…

1 条评论
Telco Customer Churn Prediction - An ML Classification Problem

2023年5月31日

Telco Customer Churn Prediction - An ML Classification Problem

The continuous influx of innovative products and services by telecom companies provide customers with a wide range of…
Store Sales Analysis and Forecast

2023年4月29日

Store Sales Analysis and Forecast

Sales is an essential part of any business venture. It acts as an indicator of business health and holds the lifeline…
Indian Startup Ecosystem Analysis Project

2023年4月4日

Indian Startup Ecosystem Analysis Project

The nascence of creativity as well as innovational ideas is constant in a world where socioeconomic gaps exists and new…

See all articles

DEPLOYING ML CLASSIFIER INTO WEB APP USING FASTAPI

Gideon Dadzie

Data Analyst/Engineer ||Backend Engineer||Electrical/ Electronic Engineer|| Writer

领英推荐

Gideon Dadzie的更多文章

社区洞察

其他会员也浏览了

INTEGRATING AN ML CLASSIFIER INTO A WEB APPLICATION WITH FastAPI

Bias Variance trade-off-ML

Empowering machine learning architecture using D3Js.

Build a User Interface for your Machine Learning Model

The Perfect Prompt: Cheat Sheet With 100+ Best Practice Examples - PART 2

Semantic segmentation tutorial with mxnet/gluon Part I: Understanding the Data(set)

Implementing Machine Learning models on a corporate environment

Decision Trees: A Guide to Understanding and Building

generating synthetic data from tiny amount of dataset - A Must-Have Data Science skill !

How XDASH increases Business Performance in 2020?

领英推荐

Gideon Dadzie的更多文章

Sentiment Analysis

ML EMBEDDING PROJECT

Telco Customer Churn Prediction - An ML Classification Problem

Store Sales Analysis and Forecast

Indian Startup Ecosystem Analysis Project

社区洞察

其他会员也浏览了

INTEGRATING AN ML CLASSIFIER INTO A WEB APPLICATION WITH FastAPI

Bias Variance trade-off-ML

Empowering machine learning architecture using D3Js.

Build a User Interface for your Machine Learning Model

The Perfect Prompt: Cheat Sheet With 100+ Best Practice Examples - PART 2

Semantic segmentation tutorial with mxnet/gluon Part I: Understanding the Data(set)

Implementing Machine Learning models on a corporate environment

Decision Trees: A Guide to Understanding and Building

generating synthetic data from tiny amount of dataset - A Must-Have Data Science skill !

How XDASH increases Business Performance in 2020?