登录查看更多内容

Machine learning for anomaly detection

Naveen Joshi

AI, Robotics & Smart Cities Expert | 600K+ Followers

发布日期: 2017年5月7日

In data mining, anomaly detection is referred to the identification of items or events that do not conform to an expected pattern or to other items present in a dataset. Typically, these anomalous items have the potential of getting translated into some kind of problems such as structural defects, errors or frauds. Using machine learning for anomaly detection helps in enhancing the speed of detection.

Intrusions are those activities that can damage information systems. Intrusion detection has been gaining broad attention. Anomaly detection can be a key for solving intrusions, as while detecting anomalies, perturbations of normal behavior indicate a presence of intended or unintended induced attacks, defects, faults, and so on. Implementing machine learning algorithms will provide companies with a simple yet effective approach for detecting and classifying these anomalies. Machine learning algorithms have the ability to learn from data and make predictions based on that data. Machine learning for anomaly detection includes techniques that provide a promising alternative for detection and classification of anomalies based on an initially large set of features.

Take a look at the two machine learning techniques that can enable effective anomaly detection:

Supervised Machine Learning for Anomaly Detection

This method requires a labeled training set that contains both normal and anomalous samples for constructing the predictive model. Theoretically, supervised methods are believed to provide better detection rate than unsupervised methods. The most common supervised algorithms are supervised neural networks, parameterization of training model, support vector machine learning, k-nearest neighbors, Bayesian networks and decision trees. K-nearest neighbor (k-NN) is one of the most conventional nonparametric techniques that are used in supervised learning for anomaly detection. It calculates the approximate distances between different points on the input vectors and then assigns the unlabeled point to the class of its K-nearest neighbors. The Bayesian network is another popular model that can encode probabilistic relationships among variables interest. This technique is generally used for anomaly detection in combination with statistical schemes. These supervised techniques have several advantages, including the capability of encoding interdependencies between variables and of predicting events, along with the ability to incorporate both prior knowledge and data.

Unsupervised Machine Learning for Anomaly Detection

These techniques do not require training data. They are based on two basic assumptions. First, they presume that most of the network connections are normal traffic and only a small amount of percentage is abnormal. Second, they anticipate that malicious traffic is statistically different from normal traffic. Based on these two assumptions, data groups of similar instances that appear frequently are assumed to be normal traffic and those data groups that are infrequent are considered to be malicious. The most common unsupervised algorithms are self-organizing maps (SOM), K-means, C-means, expectation-maximization meta-algorithm (EM), adaptive resonance theory (ART), and one-class support vector machine. One popular technique is the self-organizing map (SOM). The main objective of the SOM is to reduce the dimension of data visualization.

Machine learning techniques are now receiving considerable attention among the anomaly detection researchers to address the weaknesses of knowledge base detection techniques.

Anomaly detection can effectively help in catching the fraud, discovering strange activity in large and complex Big Data sets. This can prove to be useful in areas such as banking security, natural sciences, medicine, and marketing, which are prone to malicious activities. With the machine, a learning organization can intensify search and increase effectiveness of their digital business initiatives.

Inderpreet Kaur

7 年

Very informative!!

Muqtader MBA

Business Analyst -IT

7 年

Thanks post...quite informative.

Prof. Dr. Simmy (HON) Kataria

Prof. Dr. Simmy Kataria.

7 年

Detection invervent caveat filling optimum biosciences with Neuro sciences and nano feel with n to the power of n zenned to zed fed

Prof. Dr. Simmy (HON) Kataria

Prof. Dr. Simmy Kataria.

7 年

Sukh Chen vish kalra

查看更多评论

要查看或添加评论，请登录

Naveen Joshi的更多文章

Parking Enforcement: How It Creates A Better Parking Experience

2023年9月29日

Parking Enforcement: How It Creates A Better Parking Experience

Finding a parking place in a crowded city can be challenging. The situation worsens when you locate one just to find…
Using License Plate Recognition To Streamline Your Parking

2023年9月28日

Using License Plate Recognition To Streamline Your Parking

Parking enforcement is a problem in densely populated cities, even those with adequate parking spaces. Trying to manage…
How Real-Time Parking Availability Data Support Electric Vehicle Charging Infrastructure

2023年9月27日

How Real-Time Parking Availability Data Support Electric Vehicle Charging Infrastructure

Electric vehicles (EVs) are becoming increasingly popular as people seek ways to reduce their carbon footprint…
How Kiosks And POS Management Aid In Enforcing Local Traffic Regulations

2023年9月25日

How Kiosks And POS Management Aid In Enforcing Local Traffic Regulations

Traffic regulations are seldom followed, even if they are vital to ensuring public safety unless there's a cop car…

1 条评论
10 Ways Real-Time Parking Data Can Support Sustainable Transportation

2023年9月24日

10 Ways Real-Time Parking Data Can Support Sustainable Transportation

As cities become increasingly congested with traffic, the need for sustainable transportation solutions has never been…
The Role Of Smart Meters In Ensuring Fairness And Compliance

2023年9月23日

The Role Of Smart Meters In Ensuring Fairness And Compliance

In an increasingly urbanized world, where parking spaces are becoming scarcer commodities, the need for efficient…
Want To Enhance Customer Satisfaction? Try Automated Parking Management

2023年9月11日

Want To Enhance Customer Satisfaction? Try Automated Parking Management

As the world gets increasingly urbanized, parking has become a big concern for drivers and business owners alike. The…
How Dynamic Pricing Can Revolutionize Parking

2023年9月10日

How Dynamic Pricing Can Revolutionize Parking

Traditionally, parking lot managers have explored different means to increase revenue. For instance, they would build…

1 条评论
How a Real-Time Parking Availability System Benefits Urban Transportation

2023年9月9日

How a Real-Time Parking Availability System Benefits Urban Transportation

Finding a suitable parking space is one of the biggest challenges drivers face in cities. A study by IBM revealed that…

2 条评论
How Automated Gateless Parking Works

2023年9月8日

How Automated Gateless Parking Works

Did you know people spend 17 hours a year looking for a parking space? And that’s mainly because of how a typical…

See all articles

Machine learning for anomaly detection

Naveen Joshi

AI, Robotics & Smart Cities Expert | 600K+ Followers

Naveen Joshi的更多文章

社区洞察

其他会员也浏览了

MLP (Keras) Optimizers for Discrete Problems

OpenDebateEvidence: A Massive-Scale Argument Mining and Summarization Dataset

Advanced Text Mining with OpenAI

Linear Regression and Logistic Regression in Machine Learning

posteriors: Normal Computing’s library for Uncertainty-Aware LLMs

Exploring the Nuances: Differences Between Text Mining and Data Mining Software

Hyper parameterization - the holy grail of ML!

A Deep Dive into SHA-256 By Learning To Hash by Hand

BxD Primer Series: FP-Growth Pattern Search Algorithm

Concept mining in knowledge graphs

Naveen Joshi的更多文章

Parking Enforcement: How It Creates A Better Parking Experience

Using License Plate Recognition To Streamline Your Parking

How Real-Time Parking Availability Data Support Electric Vehicle Charging Infrastructure

How Kiosks And POS Management Aid In Enforcing Local Traffic Regulations

10 Ways Real-Time Parking Data Can Support Sustainable Transportation

The Role Of Smart Meters In Ensuring Fairness And Compliance

Want To Enhance Customer Satisfaction? Try Automated Parking Management

How Dynamic Pricing Can Revolutionize Parking

How a Real-Time Parking Availability System Benefits Urban Transportation

How Automated Gateless Parking Works

社区洞察

其他会员也浏览了

MLP (Keras) Optimizers for Discrete Problems

OpenDebateEvidence: A Massive-Scale Argument Mining and Summarization Dataset

Advanced Text Mining with OpenAI

Linear Regression and Logistic Regression in Machine Learning

posteriors: Normal Computing’s library for Uncertainty-Aware LLMs

Exploring the Nuances: Differences Between Text Mining and Data Mining Software

Hyper parameterization - the holy grail of ML!

A Deep Dive into SHA-256 By Learning To Hash by Hand

BxD Primer Series: FP-Growth Pattern Search Algorithm

Concept mining in knowledge graphs