登录查看更多内容

K-Nearest Neighbor Machine Learning algorithm

Dr. Ganapathi Pulipaka

发布日期: 2016年3月10日

The German credit dataset can be downloaded from UC Irvine, Machine learning community to indicate the predicted outcome if the loan applicant defaulted or not. Applying the logistic regression with three variables duration, amount, and installment, K-means classification, and K-Nearest Neighbor machine learning algorithm.

# Logistic regression

# Load the file from the hard disk after setting the work directory

germandata - read.csv("Creditdata.csv")

# Print dataset to see the pattern of the data

germandata

# The variable response is leveraged to evaluate the probability of the default outcome of the credit loan

germandata$Response - factor(germandata$Response)

# The subset of the data has been created to leverage the variables duration, amount, installment, and response

germandata - germandata[,c("duration","amount","installment","Response")]

# Print the dataset to see the data for these variables

germandata

#Perform the summary function on the dataset to see the data

summary(germandata)

#Sample output for 10 rows:

> germandata

duration amount installment Response

1 6 1169 A143 1

2 48 5951 A143 2

3 12 2096 A143 1

4 42 7882 A143 1

5 24 4870 A143 2

6 36 9055 A143 1

7 24 2835 A143 1

8 36 6948 A143 1

9 12 3059 A143 1

10 30 5234 A143 2

11 12 1295 A143 2

Dr. Ganapathi Pulipaka的更多文章

Can US Launch Next Generation AI Weapon Program

2023年8月17日

Can US Launch Next Generation AI Weapon Program

The next generation fighter jet program in America is truly impressive. With the advancements in global technology, the…

1 条评论
10 Most Influential Artificial Intelligence Executives in 2019 On The Globe by @analyticsinme - Analytics InSight Magazine

2019年5月22日

10 Most Influential Artificial Intelligence Executives in 2019 On The Globe by @analyticsinme - Analytics InSight Magazine

Dr. Ganapathi Pulipaka is a Chief Data Scientist for AI strategy, architecture, application development of Machine…

1 条评论
The Future Of Humanity: Artificial Intelligence by Buzzfeed Magazine.

2019年5月16日

The Future Of Humanity: Artificial Intelligence by Buzzfeed Magazine.

Take note of these two words: Artificial Intelligence. They will not hear about anything else with more emphasis on the…
Data Superheroes among US: The Whole Next Level of Human Brain by Brooke Whistance via @TheOdyssey

2019年5月14日

Data Superheroes among US: The Whole Next Level of Human Brain by Brooke Whistance via @TheOdyssey

Every individual possesses a specific talent and ability and sometimes more than one skill and different abilities can…
A New Book: The Future of Data Science and Parallel Computing

2018年8月13日

A New Book: The Future of Data Science and Parallel Computing

A New book Released https://www.amazon.

1 条评论
Building a Neural Net to Visualize High-Dimensional Data in TensorFlow

2018年6月19日

Building a Neural Net to Visualize High-Dimensional Data in TensorFlow

Word embeddings and high-dimensional data are ubiquitous in many facets of deep learning research such as natural…
Installation Guide for TensorFlow on macOS High Sierra 10.13.4 for your DeepLearning w/ Java, C, and Go

2018年6月19日

Installation Guide for TensorFlow on macOS High Sierra 10.13.4 for your DeepLearning w/ Java, C, and Go

This installation particularly focuses on macOS High Sierra version 10.13.

1 条评论
Ranked as Top Business Intelligence and Analytics Influencer for 2018 by Onalytica

2018年6月18日

Ranked as Top Business Intelligence and Analytics Influencer for 2018 by Onalytica

https://www.onalytica.
Tera-Peta-Exa-Zetta-Yotta: The Road to Technological Singularity - Interview with MirrorReview

2018年6月15日

Tera-Peta-Exa-Zetta-Yotta: The Road to Technological Singularity - Interview with MirrorReview

Modern technology has unlocked the data fabric of analytics with the potential of machine intelligence in day-to-day…

3 条评论
A Data Science Guide and Predictions for Future by GP Pulipaka published by Onalytica

2018年6月14日

A Data Science Guide and Predictions for Future by GP Pulipaka published by Onalytica

Key Topics: Machine Learning, Deep Learning, Data Science, IoT, SAP, Cloud Computing, Distributed Computing, Networks…

See all articles

K-Nearest Neighbor Machine Learning algorithm

Dr. Ganapathi Pulipaka

Dr. Ganapathi Pulipaka的更多文章

社区洞察

其他会员也浏览了

Look-ahead bias

Why is it called Support Vector Machine(SVM)?

Machine Learning Unveils House Price Predictions!

Machine Learning in R

Zero-Knowledge Proof Explained

What is hiding behind the term “mean Average Precision”?

Data Distribution in Machine Learning

What Are We Measuring When We Evaluate Large Vision-Language Models?

Understanding statistical definition of Bias and Variance

A Bit on "Missing Values" and "Imputation" in Machine Learning

Dr. Ganapathi Pulipaka的更多文章

Can US Launch Next Generation AI Weapon Program

10 Most Influential Artificial Intelligence Executives in 2019 On The Globe by @analyticsinme - Analytics InSight Magazine

The Future Of Humanity: Artificial Intelligence by Buzzfeed Magazine.

Data Superheroes among US: The Whole Next Level of Human Brain by Brooke Whistance via @TheOdyssey

A New Book: The Future of Data Science and Parallel Computing

Building a Neural Net to Visualize High-Dimensional Data in TensorFlow

Installation Guide for TensorFlow on macOS High Sierra 10.13.4 for your DeepLearning w/ Java, C, and Go

Ranked as Top Business Intelligence and Analytics Influencer for 2018 by Onalytica

Tera-Peta-Exa-Zetta-Yotta: The Road to Technological Singularity - Interview with MirrorReview

A Data Science Guide and Predictions for Future by GP Pulipaka published by Onalytica

社区洞察

其他会员也浏览了

Look-ahead bias

Why is it called Support Vector Machine(SVM)?

Machine Learning Unveils House Price Predictions!

Machine Learning in R

Zero-Knowledge Proof Explained

What is hiding behind the term “mean Average Precision”?

Data Distribution in Machine Learning

What Are We Measuring When We Evaluate Large Vision-Language Models?

Understanding statistical definition of Bias and Variance

A Bit on "Missing Values" and "Imputation" in Machine Learning