登录查看更多内容

Security Papers Review #5 Automating Cyber IR with Reinforcement Learning

Moshe Kravchik

Ph.D. | Cyber Security Researcher & Data Scientist at ... | Guest Lecturer at Lev Academic Center ( JCT )

发布日期: 2022年12月18日

Well, the full name of this article was too long for a headline: Bridging Automated to Autonomous Cyber Defense: Foundational Analysis of Tabular Q-Learning by Andy Applebaum and a team of Apple folks. It was presented at the same AISec`22 workshop as the article covered in my previous post.

Reinforcement learning (RL) looks like the next big thing in AI, mostly due to its successes in games (Go, chess, and Atari). However, applying RL to real-life problems has been a challenge. Here's my simplistic explanation of why. RL can learn the best strategy, which sounds like what we need in many situations in life. However, RL works well when there are many possible situations (states) but only a small number of possible actions in each state. Lastly, you need to be able to run huge numbers of scenarios to train the network. This fits nicely with anything you can reliably simulate but is much harder to find in the real world.

RL is a complex topic, and I'm not an expert in RL. But I want to learn when RL can be applied to the things I encounter daily, and this article helped me with that. Two things I liked about the paper: 1) I learned about Microsoft's CyberBattleSim - a simulated network environment where attackers and defenders can be automated to train RL agents, and 2) - its use of a simple RL technique, tabular Q-learning, which made the paper useful and accessible to non-experts like myself.

The paper approaches an important practical problem - finding an optimal incident response (IR) strategy. Think of analysts that need to make quick decisions under a fast stream of events in the monitored network and under the pressure of screaming unhappy customers whose machines became unavailable. The paper evaluates tabular Q-learning as a possible tool to automate the analysts' roles.

No alt text provided for this image — Slide by @ Andy Applebaum

In very short, with RL we talk about an agent that can take some action in each state (e.g., isolate the compromised machine). An action will drive the system into a new state and yield some reward (the system becomes safer). Q-learning tries to maximize the reward while going over states and possible actions. To choose the next action, it combines past experience (taking the action that brought the highest reward in the past) and sometimes trying something new. This is actually a good strategy in life as well :-).

领英推荐

Can Generative AI Improve Your Cybersecurity Posture…

Mark Lynd 9 个月前

How AI is shaping the future of Cyber Security

Lahiru Livera 1 年前

The Rise of AI Driven Cyberattacks and Strategies for…

Dr. Erdal Ozkaya 11 个月前

In the simulated network, a game occurs between attackers and defenders. The attackers can exploit vulnerabilities and move across the network. The defenders can choose from reimaging a machine (making it unavailable), resetting the entire network, revoking the user's credentials, or just waiting. As in real life, the observations of both sides contain a mix of true events with some portion of noise and errors. The agent is rewarded if the defender can keep the network safe and available.

Of course, there is a lot more in the paper. The authors improved upon the simulator, system state representation, and the loss function (making the agent loss-averse).

So, did Q-learning work well? Yes and no. Yes, because, on average, its strategy overperformed the simple baseline approaches. No, because in many specific cases, the baseline ones were still the top performers. As the paper puts it: "while the learners might offer higher average rewards, no one learner always outperforms the baselines." At the end of the day, much more research is needed to find the optimal learner, but this paper helps us by providing an environment and a benchmark to start with. I liked it and hope you will too.

See my previous article here.

P.S. I may make a short break in posting due to a family celebration.

If you enjoyed this article,?please repost it?- this will help me to know it was useful.

要查看或添加评论，请登录

Moshe Kravchik的更多文章

NDSS 2023 highlights. Part 3

2023年3月8日

NDSS 2023 highlights. Part 3

In this part, you will find out how to: Reverse engineer an OS not supported by existing tools Drive away someone…
NDSS 2023 highlights. Part 2

2023年3月2日

NDSS 2023 highlights. Part 2

The first part of this review is here. Today, I'll survey three very different papers I found interesting in…
NDSS 2023 Notes. Part 1

2023年3月1日

NDSS 2023 Notes. Part 1

And I'm back with a top security conferences highlights review. This is my first time at NDSS (I'm attending it…
Security Papers Review #9 You Can’t See Me - Invisibility cloak for LiDARs

2023年1月13日

Security Papers Review #9 You Can’t See Me - Invisibility cloak for LiDARs

TL; DR Smart guys can make you invisible to LiDAR sensors of autonomous cars - and this can hurt. The physical side of…
Security Papers Review #8 Real Attackers Don’t Compute Gradients

2023年1月6日

Security Papers Review #8 Real Attackers Don’t Compute Gradients

TL; DR Your ML-based system will probably be attacked in a much simpler way than suggested by hundreds of recent…

14 条评论
Security Papers Review #7 Proving program security in Zero Knowledge

2023年1月4日

Security Papers Review #7 Proving program security in Zero Knowledge

TL; DR There is a practical way to prove your driver is secure without disclosing your IP. But you need to learn much…
Security Papers Review #6 Victory by KO: Attacking OpenPGP Using Key Overwriting

2022年12月24日

Security Papers Review #6 Victory by KO: Attacking OpenPGP Using Key Overwriting

TL; DR If you don't sign your clear data, bad things happen. This time, I will overview the top paper in the…
Security Papers Review #4 StolenEncoder

2022年12月9日

Security Papers Review #4 StolenEncoder

ChatGPT is the hottest thing this week, no doubt. Its mind-blowing capabilities will change our lives in so many ways.
Security Papers Review #3 Quo Vadis

2022年12月2日

Security Papers Review #3 Quo Vadis

This time, I will cover the Quo Vadis paper from the AISec '22 workshop from ACM CSS. I picked it for review for…
Security Papers Review #2: Talking Trojan

2022年11月25日

Security Papers Review #2: Talking Trojan

I continue my previous review with one more paper from the SCORED'22 workshop. This review will be a little different.

See all articles

Security Papers Review #5 Automating Cyber IR with Reinforcement Learning

Moshe Kravchik

Ph.D. | Cyber Security Researcher & Data Scientist at ... | Guest Lecturer at Lev Academic Center ( JCT )

领英推荐

Moshe Kravchik的更多文章

社区洞察

其他会员也浏览了

Machine learning and AI play a crucial role in cyber security

Role of AI and Machine Learning in Cybersecurity:

2: Machine Learning Algorithms in Cybersecurity

Understanding Cybersecurity in the FUTURE OF WORK (AI in cybersecurity and the role of cybersecurity in AI - which is leading)

Agents, AI, and a Dash of Cybersecurity: My Top Two LinkedIn Learning Courses.

Encouraging Critical Thinking About AI and the Future of Cybersecurity

AI Caught Cheating? The Dark Side of Reinforcement Learning

GPTs: Shaping Tomorrow's Knowledge Economy

AI and Neural Learning: Helping Companies Reduce Cyber Risks

The Top 10 AI And Machine Learning Use Cases Everyone Should Know About

领英推荐

Moshe Kravchik的更多文章

NDSS 2023 highlights. Part 3

NDSS 2023 highlights. Part 2

NDSS 2023 Notes. Part 1

Security Papers Review #9 You Can’t See Me - Invisibility cloak for LiDARs

Security Papers Review #8 Real Attackers Don’t Compute Gradients

Security Papers Review #7 Proving program security in Zero Knowledge

Security Papers Review #6 Victory by KO: Attacking OpenPGP Using Key Overwriting

Security Papers Review #4 StolenEncoder

Security Papers Review #3 Quo Vadis

Security Papers Review #2: Talking Trojan

社区洞察

其他会员也浏览了

Machine learning and AI play a crucial role in cyber security

Role of AI and Machine Learning in Cybersecurity:

2: Machine Learning Algorithms in Cybersecurity

Understanding Cybersecurity in the FUTURE OF WORK (AI in cybersecurity and the role of cybersecurity in AI - which is leading)

Agents, AI, and a Dash of Cybersecurity: My Top Two LinkedIn Learning Courses.

Encouraging Critical Thinking About AI and the Future of Cybersecurity

AI Caught Cheating? The Dark Side of Reinforcement Learning

GPTs: Shaping Tomorrow's Knowledge Economy

AI and Neural Learning: Helping Companies Reduce Cyber Risks

The Top 10 AI And Machine Learning Use Cases Everyone Should Know About