登录查看更多内容

Stopping Bad Bots

Adam Cassar

We protect webapps from threats, reduce infrastructure costs, and speed up performance.

发布日期: 2020年12月3日

Today's sophisticated bad bots often circumvent traditional security countermeasures. They disrupt and damage websites, mobile applications, and APIs. Malicious bot tactics include scraping user and pricing data, creating fake accounts, conducting advertising click fraud, exhausting online inventories and taking websites offline completely with automated DDoS attacks.

About one-quarter of all website traffic in 2019 originated from bad bots, an increase of 18 percent over 2018. Advanced persistent bots (APBs) made up seventy-five percent of that bad bot traffic as they attempted to evade detection by cycling through random IP addresses, using anonymous proxies, and changing their identities (user agent). The top industries in 2019 hardest hit by bad bots included financial services, education, e-commerce and government as well as media and airlines.

“Bot attack campaigns have become big business for threat actors, and major organizations are now fighting to support legitimate users and prospects while keeping attackers out of online applications and services,” says Paula Musich, Research Director, Enterprise Management Associates.

Bots have evolved over the years, from simple scripts into sophisticated networks of distributed agents that can mimic human interactions with machine learning techniques. They can avoid detection by network security technologies that have not kept pace with these new, sophisticated and automated agents.

Mitigating the damage from bad bots and staying ahead of evolving threats necessitates that organizations deploy an array of sophisticated security countermeasures to not only detect bad bots but also render them harmless from an economic perspective.

Bot Countermeasure Best Practices:

The following bad bot countermeasures best practices range from network security to machine learning and behavioural analysis that help to reduce the economic harm that malicious bots inflict on businesses and end-users.

Web Application Firewalls

Web Application Firewalls (WAF) are a common yet essential first line of defence that filter out harmful Layer 7 web application (HTTP) traffic using rules or policies that protect organizations against Distributed Denial of Service (DDoS) bot attacks. WAFs also protect against cross-site forgery, cross-site-scripting (XSS), file inclusion and SQL injection attacks. A WAF protects websites and can be deployed as an appliance, server plug?in, or filter and customized by application type or use case. WAF rules are also flexible and can be updated or changed based on the type of attack.

IP Tracking and Reputation

Sophisticated bots can be detected using network forensics by inspecting received and requested web traffic and assessing whether the requests are from actual users versus bad bots. Requests can be analysed from data sources including Tor/proxy IPs, IP reputation, IP geo-location information, ISP information and IP owners. Additional sources for real-time and near-time malicious IP threat data needed to block attacks can come from network data, CERTs, MITRE and cooperative competitors.

Client/Device Fingerprinting

Fingerprinting attempts to identify devices ranging from PCs, Internet of Things (IOT) or mobile devices and servers using data attributes that create real-time risk profiles to stop bot attacks. A bot detection fingerprinting engine will use web page access data and unique fingerprints for each device to protect against evasion techniques including dynamic IP addresses, anonymising web proxies and residential proxy networks.

Machine Learning

Artificial Intelligence (AI) and machine learning algorithms are increasingly being used to analyse and make recommendations regarding malicious bot mitigation using data from sources such as user activity history, behavioural patterns and meta-data. The benefits of using machine learning methodologies to detect bad bots are the use of custom tailored algorithms that can be deployed to target bots. These algorithms iteratively process user data and identities for discerning emerging bot attack patterns from very large amounts of real-time information.

Tarpitting

Tarpitting is a bot countermeasure that delays and slows down incoming malicious traffic from suspect connections. The technique is used to increase bot attack financial and resource costs in an attempt to discourage malicious actors. Bad bot tar pits can delay bot requests and responses. Innovative tarpitting techniques include requiring bad bots to solve computationally complex math challenges to access resources or web sites thereby slowing down or stopping bot activity.

User Behaviour Analysis

User interaction behaviour attributes and identifying characteristics on a web page or mobile app is different from the behaviour of an automated malicious bot. Factors such as number of pages visited per session, time spent on each web page or within a mobile app and repeat visit frequency all help to differentiate authentic users versus bad bots. Defeating bad bots using Behaviour Analysis involves creating a user model for individual sites using historical visitor data and checking for anomalies that may indicate bad bot activity.

Intent-based Deep Behavior Analysis (IDBA)

As opposed to Behaviour Analysis, Intent-based Deep Behaviour Analysis (IDBA) is a next-generation technique that conducts behavioural analysis at the user intent level versus the commonly used interaction-based behaviour analysis. IDBA consists of intent encoding, intent analysis, and adaptive learning. It also employs machine learning techniques to detect bad bots emulating on-site human behaviour interactions. Bad bot mitigation techniques include the limiting of login page attempts, web authentication pages, product searches and API call authentication pages.

Rate Limiting

Rate Limiting mitigates bad bots and DDoS attacks by restricting the amount of incoming traffic received for specific applications and API endpoints using pre-defined bandwidth or request limitation policies. Web applications, GET versus POST requests, APIs that receive queries, and login credentials all can be blocked if clients, IP or IP and user-agent pairs violate Rate Limiting rules. Intellectual property scraping can also be protected by Rate Limiting policies that restrict repeated image or digital downloads.

Javascript Injection

Using JavaScript Injection techniques can help mitigate bad bot attacks in several ways. Scripts can be placed into web applications that “fingerprint” a user’s browser to distinguish humans versus bad bots emulating “human-like” mouse movements, keystrokes or clicks. Fingerprinting detection may also involve user agent identification, HTML5 canvas and audio fingerprinting and protocol level fingerprinting with TLS and HTTP2. JavaScript combined with browser Cookies can also be used to identify anomalous behaviour from unwanted traffic or bad bots trending over time.

ANYCast DDoS Mitigation

Anycast is an IP addressing method that efficiently routes incoming traffic requests to the nearest location or “node.” Using ANYCast for selective routing enables network load resilience against DDoS attacks by routing traffic across geographically disperse servers and data centres. This prevents network resources from becoming overwhelmed with malicious or irrelevant traffic.

Alternative Content Serving

Serving Alternate and Cached Content when a bad bot is detected provides organizations with the ability to mislead bots but not block them altogether. For instance, e-commerce sites may fool price scraping bots by serving alternative web pages that look like legitimate pages but with higher prices. Serving Cached Content when a bot is detected also minimizes load servers and without affecting site performance.

Challenges

Requests from suspected bots can be redirected to Challenges or puzzles such as a CAPTCHA, also known as a Completely Automated Public Turing test that helps to identify a bad bot versus a human. Online puzzles, such as letter matching are easy for humans to solve but difficult for automated bots. Modern CAPTCHA puzzles require users to identify objects from real-world images such as traffic signals, cars, bikes, crossings or bridges.

Conclusion

Bad bots hijack user accounts, create fake accounts, scrape websites for prices, stock availability, site data and personal information. Bad bots can flood websites with traffic automated distributed denial of service attacks and attack public facing APIs using constantly changing techniques. Bad bots hide behind dynamic IP addresses. They also change their attack signatures, mimic human behaviours, and take over vast networks of hosts and IoT devices creating zombie machines that distribute malware across the internet. Deploying an array of countermeasures ranging from Web Application Firewalls to sophisticated Machine Learning algorithms is an organization's critical primary line of defence against bad bots.

Need help from bots? Contact Us

This article first appeared on the Peakhour.IO blog at https://www.peakhour.io/blog/bad-bot-countermeasures/

要查看或添加评论，请登录

Adam Cassar的更多文章

A Leap in Web with RFC 9460

2023年11月16日

A Leap in Web with RFC 9460

Embracing RFC 9460 The internet is transforming with the introduction of RFC 9460. This groundbreaking development…

1 条评论
Cache-Status Header

2023年11月15日

Cache-Status Header

Cache-Status Header: A New Tool for CDN Caching Analysis CDN caching complexity is well-known. Multiple layers, such as…
Issues That Affect Website Performance

2020年12月11日

Issues That Affect Website Performance

Our last three website performance articles covered the why and how's of testing website performance, and introduced…
Attack of the Bots

2020年12月3日

Attack of the Bots

Bots are software applications that automate repetitive tasks without any human interaction, and have fast become an…

1 条评论

Stopping Bad Bots

Adam Cassar

We protect webapps from threats, reduce infrastructure costs, and speed up performance.

Bot Countermeasure Best Practices:

Web Application Firewalls

IP Tracking and Reputation

Client/Device Fingerprinting

Machine Learning

Tarpitting

User Behaviour Analysis

Intent-based Deep Behavior Analysis (IDBA)

Rate Limiting

Javascript Injection

ANYCast DDoS Mitigation

Alternative Content Serving

Challenges

Conclusion

Adam Cassar的更多文章

社区洞察

其他会员也浏览了

Vulnerable APIs and Bot Attacks Costing Businesses Up to $186 Billion Annually

Microsoft breach update, CISA flags JetBrains, ChatGPT creds sale

The Ntirety Weekly Threat Intelligence Report: August 5, 2024

Cybercriminals Use Go Resty and Node Fetch in 13 Million Password Spraying Attempts

Web Application Security Vulnerabilities

DeepSeek App Exposes Sensitive User Data Without Encryption ??

How To Detect Proxies: Comprehensive Guide

TracWrap: Midnight Bear Claws Its Way Into Internal Systems, Magnet Goblin Keeps Organizations on Edge, and More

Understanding OTP and CAPTCHA Bypass Techniques

Bot Countermeasure Best Practices:

Web Application Firewalls

IP Tracking and Reputation

Client/Device Fingerprinting

Machine Learning

Tarpitting

User Behaviour Analysis

Intent-based Deep Behavior Analysis (IDBA)

Rate Limiting

Javascript Injection

ANYCast DDoS Mitigation

Alternative Content Serving

Challenges

Conclusion

Adam Cassar的更多文章

A Leap in Web with RFC 9460

Cache-Status Header

Issues That Affect Website Performance

Attack of the Bots

社区洞察

其他会员也浏览了

Vulnerable APIs and Bot Attacks Costing Businesses Up to $186 Billion Annually

Microsoft breach update, CISA flags JetBrains, ChatGPT creds sale

The Ntirety Weekly Threat Intelligence Report: August 5, 2024

Cybercriminals Use Go Resty and Node Fetch in 13 Million Password Spraying Attempts

Web Application Security Vulnerabilities

DeepSeek App Exposes Sensitive User Data Without Encryption ??

How To Detect Proxies: Comprehensive Guide

TracWrap: Midnight Bear Claws Its Way Into Internal Systems, Magnet Goblin Keeps Organizations on Edge, and More

Understanding OTP and CAPTCHA Bypass Techniques