登录查看更多内容

Credit Card Fraud Detection Using SMOTE Technique

Peng Wang

Engineering Leader | Volunteer Board Director | Community Builder

发布日期: 2019年6月10日

Outlier detection is is an interesting application of machine learning. The goal is to identify those useful data records that can accurately profile abnormal behavior of the system. However, in real life examples, such special data like fraud and spam takes very small percentage of overall data population, which imposes challenges for developing machine learning models.

In this experiment, we will examine Kaggle's Credit Card Fraud Detection dataset and develop predictive models to detect fraud transactions which accounts for only 0.172% of all transactions. To deal with the unbalanced dateset issue, we will first balance the classes of our training data by a resampling technique (SMOTE), and then build a Logistic Regression model by optimizing the average precision score.

We will build and train our model on Google Colab, a free Jupyter notebook environment that runs on Google cloud and gives free GPU! For more information on Colab, check Colab official page.

Data Preparation

The dataset contains transactions made by credit cards in September 2013 by European cardholders over a two day period. There are 492 frauds out of a total 284,807 examples. It's highly unbalanced, with the positive class (frauds) accounting for only 0.172% of all transactions.

Let's visualize the skewness (1 - fraud, 0 - normal):

Evaluation Metrics

Before train our model, we must be clear what to measure our model performance i.e. model optimization. Typically, an accuracy score (the fraction of correct predictions) is used for measuring predictive model performance. However, it doesn't work well for highly unbalanced dataset. Using our dataset, if one always predicts a given transaction as normal i.e. non-fraud, accuracy score will be 0.998 - almost too perfect and no fraud will be reported! It's a useless predictive mode with "perfect" accuracy.

Let's review two common metrics for model evaluation.

Precision = TP/(TP+FP): measures how accurate are the predictions

Recall = TP/(TP+FN): measures how good all the positives are found

where

TP: True Positives (actually fraud and predicted as fraud)

FP: False Positives (actually normal and predicted as fraud)

FN: False Negatives (actually fraud and predicted as normal)

Because letting fraudulent transactions pass through is quite costly to business and credit card holder, False Negatives (actually fraud and predicted as normal) should be minimized. Therefore, a higher Recall is desired.

Often, as tradeoff, increasing Recall tends to lower Precision. If our model predicts too many false fraudulent transactions (to increase recall), it will become very annoying to regular credit card users and drive people away from using the service. That's a nightmare no business wants to see. Precision and Recall must be balanced.

A precision-recall curve shows the tradeoff of precision and recall for different prediction thresholds. It can be characterized by an Average Precision score which summarizes the weighted increase in precision with each change in recall. We use the Average Precision as a balanced measure of precision and recall to evaluate our model.

Model Training Without Re-sampling

We hold out 20% data for test and use the rest for training. Training data is split to 5 folds. A Logistic Regression model is built for this binary classification task. We run a random search on finding the optimal hyper parameters i.e. regularizer L1 or L2, and regularization penalty.

Took 2.83 minutes to find optimal parameters 
Best parameters for model: {'penalty': 'l2', 'C': 0.1} 
Best precision-recall score from training: 0.7642589240077233
Confusion Matrix 
[[56855     9]  
[   44    54]] 
Classification Report
               precision    recall  f1-score   support
            0       1.00      1.00      1.00     56864
            1       0.86      0.55      0.67        98
     accuracy                           1.00     56962
    macro avg       0.93      0.78      0.84     56962
 weighted avg       1.00      1.00      1.00     56962

Due to very limited fraud records presented in the training dataset, our model is only able to catch 55% frauds, while precision is very high because of normal transactions dominate the dataset.

Model Training With SMOTE (Over-sampling)

We balance the classes of our training data by using SMOTE (Synthetic Minority Over-sampling Technique). SMOTE is one of oversampling algorithms to increase number of positive class by producing synthetic examples. After applying SMOTE, the number of fraud instances in our training dataset is same as the number of normal transactions.

Took 6.78 minutes to find optimal parameters 
Best parameters for model: {'penalty': 'l2', 'C': 10} 
Best precision-recall score from training: 0.988851588589584 
Confusion Matrix 
[[55557  1307]  
[    8    90]] 
Classification Report
               precision    recall  f1-score   support
            0       1.00      0.98      0.99     56864
            1       0.06      0.92      0.12        98
     accuracy                           0.98     56962
    macro avg       0.53      0.95      0.55     56962
 weighted avg       1.00      0.98      0.99     56962

With over-sampling, 92% frauds are captured, with a cost of 2% increase on reporting normal transaction as fraud (FP). Essentially it's a decision business has to make: unable to catch many frauds or falsely stopping normal transactions - which is more costly to business?

Below is precision-recall curve for our predictions. It has an average precision score 0.75 which is not bad. One could adjust prediction threshold value to achieve a balanced precision-recall score.

Python code can be found on my GitHub. Original article has more technical details and can be found on my blog.

Happy Machine Learning!

Constantine Lycos, CFA

I manage investment portfolios for families and individuals

5 年

Interesting! Numbers and metrics appeal to me

Lynn Williams

Cost Accountant | Accounting Standards

5 年

Peng you are the man!!! I miss, working with you. Very creative in your gifts

2 次回应

查看更多评论

要查看或添加评论，请登录

Peng Wang的更多文章

Build Knowledge Graph RAG with LlamaIndex from PDF Documents

2024年7月3日

Build Knowledge Graph RAG with LlamaIndex from PDF Documents

Previously I built a LLM chatbot with PDF documents, using the Retrieval Augmented Generation (RAG) technique…

4 条评论
Build LLM Chatbot With PDF Documents

2024年5月14日

Build LLM Chatbot With PDF Documents

Have you applied for loans, grants, or financial assistance programs? Have you dedicated significant time to…

2 条评论
What Skills Do You Need to Become a Data Engineer

2022年4月4日

What Skills Do You Need to Become a Data Engineer

People often ask me what skills needed to become a data engineer. Before answering that question, let's take a look at…

11 条评论
What Skills Do You Need to Become an HR Analyst

2020年9月3日

What Skills Do You Need to Become an HR Analyst

For work reasons, I have opportunities to interact with HR analysts everyday. I am always curious what skills one would…

8 条评论
Trump VS Trudeau: Who Makes Better Use of Twitter During COVID-19?Crisis

2020年5月5日

Trump VS Trudeau: Who Makes Better Use of Twitter During COVID-19?Crisis

This is a much shorter version of my Medium article. During the COVID-19 pandemic, people take their worries, concerns,…

22 条评论
Auto Generated Insights of 2019 HR Tech Conference Twitter - Part 2 (Topic Modeling)

2019年10月23日

Auto Generated Insights of 2019 HR Tech Conference Twitter - Part 2 (Topic Modeling)

In our last post, we extract #HRTechConf tweets, clean up the texts, and generate a word cloud that highlights some of…

2 条评论
Auto Generated Insights of 2019 HR Tech Conference Twitter – Part 1

2019年10月15日

Auto Generated Insights of 2019 HR Tech Conference Twitter – Part 1

HR Technology Conference and Expo, world’s leading and largest conference for HR and IT professionals, just took place…

2 条评论
Things Employees Like and Dislike About Their Companies

2019年9月4日

Things Employees Like and Dislike About Their Companies

I work in people analytics and have been wondering all the time what make employees feel great or bad about their…

6 条评论
Web App For Border Crossing Wait Time Forecast – Part 2

2019年8月7日

Web App For Border Crossing Wait Time Forecast – Part 2

Keywords: Web App, Flask, AJAX, API, AWS, Virtual Environment Previously I built the Flask web app that runs on my…
Web App For Border Crossing Wait Time Forecast - Part 1

2019年7月11日

Web App For Border Crossing Wait Time Forecast - Part 1

About a year ago I built a predictive model for predicting border crossing wait time. There were a lot of feature…

9 条评论

See all articles

Credit Card Fraud Detection Using SMOTE Technique

Peng Wang

Engineering Leader | Volunteer Board Director | Community Builder

Data Preparation

Evaluation Metrics

Model Training Without Re-sampling

Model Training With SMOTE (Over-sampling)

Peng Wang的更多文章

社区洞察

其他会员也浏览了

CREDIT CARD FRAUD DETECTION USING MACHINE LEARNING

Mastercard's Leap into the Future: Gen AI Reinvents Consumer Security

Plug n play or your own data science team ?

Fraud Prediction Using AutoAI: How AI is Changing the Game in Fraud Detection

ePayments: Why Your Financial Institution Needs AI-Driven Defense?

The Latest Trends in AI for Fraud Detection: What IT Leaders Need to Know

Artificial Intelligence in Fraud Detection

The Benefits of Using AI for Fraud Detection in Payment Processing

The Benefits of Using AI for Fraud Detection in Payment Processing

A Fraud Series - Part Two: Adapting Technology To Fight Fraud

Data Preparation

Evaluation Metrics

Model Training Without Re-sampling

Model Training With SMOTE (Over-sampling)

Peng Wang的更多文章

Build Knowledge Graph RAG with LlamaIndex from PDF Documents

Build LLM Chatbot With PDF Documents

What Skills Do You Need to Become a Data Engineer

What Skills Do You Need to Become an HR Analyst

Trump VS Trudeau: Who Makes Better Use of Twitter During COVID-19?Crisis

Auto Generated Insights of 2019 HR Tech Conference Twitter - Part 2 (Topic Modeling)

Auto Generated Insights of 2019 HR Tech Conference Twitter – Part 1

Things Employees Like and Dislike About Their Companies

Web App For Border Crossing Wait Time Forecast – Part 2

Web App For Border Crossing Wait Time Forecast - Part 1

社区洞察

其他会员也浏览了

CREDIT CARD FRAUD DETECTION USING MACHINE LEARNING

Mastercard's Leap into the Future: Gen AI Reinvents Consumer Security

Plug n play or your own data science team ?

Fraud Prediction Using AutoAI: How AI is Changing the Game in Fraud Detection

ePayments: Why Your Financial Institution Needs AI-Driven Defense?

The Latest Trends in AI for Fraud Detection: What IT Leaders Need to Know

Artificial Intelligence in Fraud Detection

The Benefits of Using AI for Fraud Detection in Payment Processing

The Benefits of Using AI for Fraud Detection in Payment Processing

A Fraud Series - Part Two: Adapting Technology To Fight Fraud