登录查看更多内容

Finding Bugs in Kernel. Part 2: Fuzzing the Actual Kernel

Vyacheslav Moskvin

Senior Security Researcher / Engineer | Hardware | IoT

发布日期: 2024年11月19日

In the previous post, we had a crash course on syzkaller, one of the most renowned Linux kernel fuzzers. We explored how to set it up in a non-trivial configuration (on macOS, no less—but the general steps also apply to Linux, of course), compile the kernel with a vulnerable driver, and configure everything for syzkaller to crash it. Now it's time to target real-world code. So, what code?

syzkaller is constantly grinding on Google's servers, relentlessly transforming kilowatts of power into test cases and crashes. Unfortunately, I don’t have an unlimited supply of electricity or a server budget, for that matter. So, I need to be more deliberate in choosing my target.

Thankfully, Google hosts syzbot, a syzkaller dashboard that displays the current fuzzing status, findings, and, most importantly, coverage for Linux subsystems.

We can use the latter to our advantage to find targets with less-than-ideal coverage. What qualifies as "less than ideal"? I don't know, but here goes nothing:

More than 0%: If something isn’t covered at all, perhaps there’s a good reason for it—like the subsystem doesn’t directly consume user input or has hardware requirements that are hard to meet in a fuzzing environment.
Less than, say, 15–20%: so it's not super explored as well.

Candidates

With that in mind, I started browsing the syzbot coverage heatmap. The heatmap is a massive web page—128 MB of pure HTML. Parsing it to filter subsystems by coverage would have been a good idea, but it was already too late at night for good ideas. So, I went with the good old "read-it-to-the-end" approach.

Here are some of my notes:

jffs2 (~50%), fs/jffs2 (0%) - A suspicious combination. Perhaps there’s something worth exploring here?
bcachefs, fs/bcachefs - 10%
net/atm - 1%
net/dsa - 2%
net/smc - 19%
net/sunrpc - 13%
nfs, fs/nfs - 37k blocks, 1% - Why so low?
net/tipc - 49% - High coverage, but still interesting due to a recent UAF found in it.
orangefs - 2%
Lots of other *fs subsystems are well under 10%.
fs/ubifs - 17k blocks, 0%!
scsi? - 60k blocks, 7% - Why so low? Is it hardware-dependent?
usb - 8%, usb-storage - 10%
virt/drivers - 14%
wireless/drivers - 1% - This will surely require some hacks.

An interesting idea is to fuzz hardware-dependent functions like MMC, PCI, NFC, pvrusb2, qat, RAID, RDMA, etc.

In the end, I decided to try two network subsystems: TIPC (despite its high coverage, I was motivated by the recentfindings from sam4k) and SMC.

Also, networking is much easier to fuzz: data flows nicely in and out through sockets, and everything can be configured with a few ioctl and setsockopt calls. Compare that to filesystems, where you have to handle binary blobs, mount and unmount them, or to other subsystems that might depend on specific hardware.

Attack Surface

Now it’s time to read some kernel code to figure out the attack surface. I’ll focus exclusively on TIPC from here on, though SMC is practically the same deal. The TIPC code can be found under net/tipc/.

TIPC's Netlink legacy compatibility layer is initialized in the tipc_netlink_compat_start function. This function registers a Generic Netlink family, tipc_genl_compat_family, with the operations structure tipc_genl_compat_ops:

From the code, we can see that there’s essentially one operation callback: tipc_nl_compat_recv, which receives Netlink messages. The main dispatcher for these messages is the tipc_nl_compat_handle function, which uses a switch statement to handle TIPC_CMD_* commands.

That was the legacy Netlink support, but what about the newer version? We can observe a similar code flow in net/tipc/netlink.c. The tipc_netlink_start function registers the tipc_genl_family with an operations array, tipc_genl_v2_ops.

In this newer implementation, all operations are directly exposed rather than being hidden behind a single function:

The control flow is as follows:

In net/tipc/socket.c we have protocol operations for different socket families (message, socket, stream). TIPC uses the same set of callbacks for all of them.

These Netlink commands, along with the usual socket operations, constitute our attack surface.

As you remember, TIPC is already covered by syzkaller. Of course, some of the commands supported by the subsystem are already defined in the fuzzer’s files: syzkaller/sys/linux/socket_tipc.txt and syzkaller/sys/linux/socket_tipc_netlink.txt.

I hoped to find discrepancies between the actual attack surface and the syscalls already defined in syzkaller. I did find one, but unfortunately, it was just a single syscall: setsockopt for the TIPC_NODELAY option.

领英推荐

The Linux File System Explained

Chadura Tech 1 个月前

A Checklist for Real-Time Applications in Linux

Linutronix 1 年前

Everything in Linux

Shamsul Huda 1 个月前

Here’s how I described it:

setsockopt$TIPC_NODELAY(fd sock_tipc, level const[SOL_TIPC], opt const[TIPC_NODELAY], val ptr[in, int32], len bytesize[val])

Below are the syscalls I ended up enabling in my syzkaller configuration:

"enable_syscalls": [
        "socket$tipc",
        "socketpair$tipc",
        "bind$tipc",
        "connect$tipc",
        "accept4$tipc",
        "getsockname$tipc",
        "getpeername$tipc",
        "sendmsg$tipc",
        "ioctl$SIOCGETLINKNAME",
        "ioctl$SIOCGETNODEID",
        "setsockopt$TIPC_IMPORTANCE",
        "setsockopt$TIPC_SRC_DROPPABLE",
        "setsockopt$TIPC_DEST_DROPPABLE",
        "setsockopt$TIPC_CONN_TIMEOUT",
        "setsockopt$TIPC_MCAST_BROADCAST",
        "setsockopt$TIPC_MCAST_REPLICAST",
        "setsockopt$TIPC_GROUP_LEAVE",
        "setsockopt$TIPC_GROUP_JOIN",
        "getsockopt$TIPC_IMPORTANCE",
        "getsockopt$TIPC_SRC_DROPPABLE",
        "getsockopt$TIPC_DEST_DROPPABLE",
        "getsockopt$TIPC_CONN_TIMEOUT",
        "getsockopt$TIPC_NODE_RECVQ_DEPTH",
        "getsockopt$TIPC_SOCK_RECVQ_DEPTH",
        "getsockopt$TIPC_GROUP_JOIN",
        "sendmsg$TIPC_CMD_SET_LINK_TOL",
        "sendmsg$TIPC_CMD_SET_LINK_PRI",
        "sendmsg$TIPC_CMD_SET_LINK_WINDOW",
        "sendmsg$TIPC_CMD_ENABLE_BEARER",
        "sendmsg$TIPC_CMD_GET_BEARER_NAMES",
        "sendmsg$TIPC_CMD_GET_MEDIA_NAMES",
        "sendmsg$TIPC_CMD_SHOW_PORTS",
        "sendmsg$TIPC_CMD_GET_REMOTE_MNG",
        "sendmsg$TIPC_CMD_GET_MAX_PORTS",
        "sendmsg$TIPC_CMD_GET_NETID",
        "sendmsg$TIPC_CMD_GET_NODES",
        "sendmsg$TIPC_CMD_GET_LINKS",
        "sendmsg$TIPC_CMD_SET_NODE_ADDR",
        "sendmsg$TIPC_CMD_SHOW_NAME_TABLE",
        "sendmsg$TIPC_CMD_SHOW_LINK_STATS",
        "sendmsg$TIPC_CMD_GET_MEDIA_NAMES",
        "sendmsg$TIPC_CMD_DISABLE_BEARER",
        "sendmsg$TIPC_CMD_RESET_LINK_STATS",
        "sendmsg$TIPC_CMD_SET_NETID",
        "socket$nl_generic",
        "syz_genetlink_get_family_id$tipc",
        "listen$tipc",
        "recvmsg$tipc",
        "shutdown$tipc",
        "close$tipc",
        "ppoll$tipc",
        "getsockopt$TIPC_SOCK_RECVQ_USED",
        "syz_genetlink_get_family_id$tipc2",
        "sendmsg$TIPC_NL_BEARER_DISABLE",
        "sendmsg$TIPC_NL_BEARER_ENABLE",
        "sendmsg$TIPC_NL_BEARER_GET",
        "sendmsg$TIPC_NL_BEARER_ADD",
        "sendmsg$TIPC_NL_BEARER_SET",
        "sendmsg$TIPC_NL_SOCK_GET",
        "sendmsg$TIPC_NL_PUBL_GET",
        "sendmsg$TIPC_NL_LINK_GET",
        "sendmsg$TIPC_NL_LINK_SET",
        "sendmsg$TIPC_NL_LINK_RESET_STATS",
        "sendmsg$TIPC_NL_MEDIA_GET",
        "sendmsg$TIPC_NL_MEDIA_SET",
        "sendmsg$TIPC_NL_NODE_GET",
        "sendmsg$TIPC_NL_NET_GET",
        "sendmsg$TIPC_NL_NET_SET",
        "sendmsg$TIPC_NL_NAME_TABLE_GET",
        "sendmsg$TIPC_NL_MON_SET",
        "sendmsg$TIPC_NL_MON_GET",
        "sendmsg$TIPC_NL_MON_PEER_GET",
        "sendmsg$TIPC_NL_PEER_REMOVE",
        "sendmsg$TIPC_NL_UDP_GET_REMOTEIP",
        "sendmsg$TIPC_NL_KEY_SET",
        "sendmsg$TIPC_NL_KEY_FLUSH"
      ]

Performance Improvements

The default way of using syzkaller is with QEMU: syzkaller spins up and manages as many QEMU instances as you need. As you know, QEMU can operate in two modes: pure emulation (TCG mode) or accelerated mode, which leverages a hypervisor (like KVM on Linux) for better performance.

But macOS doesn’t have KVM. Thankfully, another accelerator is available: HVF (Apple's Hypervisor Framework). The only thing I had to do was change the QEMU options to make them compatible with HVF.

Anyone who has worked with complex QEMU VMs knows how "easy" and "intuitive" the configuration process is. So after some trial and error, I managed to adapt the default configuration:

qemu-system-aarch64 -m 2048 -smp 2 -chardev socket,id=SOCKSYZ,server=on,wait=off,host=localhost,port=42150 -mon chardev=SOCKSYZ,mode=control -display none -serial stdio -no-reboot -name VM-2 -device virtio-rng-pci -machine virt,virtualization=on,gic-version=max -cpu max,sve=off,pauth=off -accel tcg,thread=multi -device virtio-net-pci,netdev=net0 -netdev user,id=net0,restrict=on,hostfwd=tcp:127.0.0.1:18538-:22 -hda linux_kernel/rootfs.ext3 -snapshot -kernel linux_kernel/Image -append root=/dev/vda console=ttyAMA0 console=ttyAMA0 root=/dev/vda

to these parameters, which finally worked with HVF:

qemu-system-aarch64 -m 2048 -smp 2 -chardev socket,id=SOCKSYZ,server=on,wait=off,host=localhost,port=63361 -mon chardev=SOCKSYZ,mode=control -display none -serial stdio -no-reboot -name VM-0 -device virtio-rng-pci -machine virt -accel hvf -device virtio-net-pci,netdev=net0 -netdev user,id=net0,restrict=on,hostfwd=tcp:127.0.0.1:32746-:22 -hda linux_kernel/rootfs.ext3 -snapshot -kernel linux_kernel/Image-6.10.8 -append "debug earlyprintk=serial slub_debug=UZ console=ttyAMA0 root=/dev/vda" -cpu cortex-a57

syzkaller allows you to specify additional QEMU options, but unfortunately, those platform-dependent options are hardcoded. Since, realistically, nobody but me is likely occupied with running syzkaller on macOS, I simply patched the options directly in the code and called it a day.

In the end, switching from TCG to HVF made the setup 3–5 times faster with the same configuration (number of VMs, processes, and RAM), so I was pretty happy with this optimization.

Config and build

If you’re new to building the kernel, check out my previous post. The process here is essentially the same—we just need to enable TIPC and SMC along with a few dependencies to ensure they’re built into the kernel.

$?make ARCH=arm64 defconfig
$?scripts/config -e KCOV -e KCOV_INSTRUMENT_ALL -e DEBUG_FS -e NET_9P -e NET_9P_VIRTIO -e TIPC -e KCOV_ENABLE_COMPARISONS -e KALLSYMS -e KALLSYMS_ALL -e DEBUG_INFO -e KASAN -d RANDOMIZE_BASE -d RANDOMIZE_MEMORY -e SMC -e INET -e INFINIBAND

Then, as usual:

$?make ARCH=arm64 oldconfig
$?make ARCH=arm64 -j $(nproc)

Fuzzing

It’s always helps to lower your expectations when fuzzing the kernel. So, as usual, I just started the campaign, made sure it was running smoothly, and went back to sleep. And yes, let me emphasize that under no circumstances did I check on the fuzzer in the middle of the night ??.

The next morning, I was greeted with this:

As you can see, those "KASAN: slab-use-after-free" messages look pretty promising. Nice!

Well, not so fast. Automatic reproduction failed, so I started by digging into the crash logs. The call trace already looked suspicious because none of the listed functions were explicitly part of TIPC.

After reading several articles and docs on reproducing syzkaller crashes, I managed to get my head around it. Unfortunately, the bugs just refused to trigger.

You’ll notice there are plenty of bug reports on the syzbot page marked as inconclusive. I guess I can classify my crashes the same way.

Oh well, time to move on to the next target. But that’s a story for another time.

Thanks for reading, and let’s stay in touch!

If you want to hear more from me, consider subscribing to my Telegram or WhatsApp channels.

Ilias Bezzaz

Cybersecurity Consultant | Student @ISEP

3 个月

really impressive work ??

1 次回应

Mahmoud Jadaan

Senior Penetration Tester (Embedded Automotive) at diconium

3 个月

well explained :)

1 次回应

Yunseong Kim

lore.kernel.org/all/?q=Yunseong+Kim

3 个月

Thank you Vyacheslav for the great article!

1 次回应

查看更多评论

要查看或添加评论，请登录

Vyacheslav Moskvin的更多文章

A Glitch to Die For

2025年1月30日

A Glitch to Die For

The rain drummed against the window, making the neon sign reflections tremble to its rhythm. Inside, only the flicker…

5 条评论
Linux Kernel Attack Surface: beyond IOCTL. DMA-BUF

2024年12月10日

Linux Kernel Attack Surface: beyond IOCTL. DMA-BUF

When analyzing Linux kernel module code, I often look for instances of copy_from_user() or jump straight to the IOCTL…

3 条评论
Finding Bugs in Kernel. Part 1: Crashing a Vulnerable Driver with Syzkaller

2024年9月17日

Finding Bugs in Kernel. Part 1: Crashing a Vulnerable Driver with Syzkaller

syzkaller is one of the best fuzzers for the Linux kernel. It supports coverage (through KCOV) and provides a way to…

7 条评论
Hunting Bugs in Linux Kernel With KASAN: How to Use it & What's the Benefit?

2024年9月10日

Hunting Bugs in Linux Kernel With KASAN: How to Use it & What's the Benefit?

It all started when I had brain surgery. After several days in the hospital, I got home.
How to Choose Projects That Advance Your Skills - From Endless Struggles to Endless Bugs

2024年5月16日

How to Choose Projects That Advance Your Skills - From Endless Struggles to Endless Bugs

Introduction There’s no shortage of new and exciting in our field. A Windows 0-day drops.

3 条评论
Found Bugs, Got paid, Stayed poor: Making a Living with Bug Bounties

2024年5月2日

Found Bugs, Got paid, Stayed poor: Making a Living with Bug Bounties

Early 2021. After feeling unsatisfied with my job for quite some time and battling burnout, I decided to take a…

4 条评论
Local LLM for Ultimate Hacker Chat?

2024年2月26日

Local LLM for Ultimate Hacker Chat?

The Idea I’m quite intrigued by AI capabilities, LLMs in particular. I use ChatGPT in my work almost every day…

4 条评论
File Fuzzing: Really Easy and Super Fast with new AFL++ Features

2024年1月8日

File Fuzzing: Really Easy and Super Fast with new AFL++ Features

One day, I saw that srelay exists. It’s a pretty old SOCKS4 and SOCKS5 proxy server with the latest update from 2018.

2 条评论
Bare Metal Reversing Foundations: Physical Memory Intricacies

2023年12月18日

Bare Metal Reversing Foundations: Physical Memory Intricacies

When you're dealing with bare metal firmware, you have access to every bit of code: from the very start of the…

3 条评论
Extracting Firmware: Every Method Explained

2023年11月20日

Extracting Firmware: Every Method Explained

The first step in finding vulnerabilities in some kind of IoT device is getting its firmware. 5-10 years ago, it was…

14 条评论

See all articles

Finding Bugs in Kernel. Part 2: Fuzzing the Actual Kernel

Vyacheslav Moskvin

Senior Security Researcher / Engineer | Hardware | IoT

Candidates

Attack Surface

领英推荐

Performance Improvements

Config and build

Fuzzing

Vyacheslav Moskvin的更多文章

社区洞察

其他会员也浏览了

The fstab (File System Table) configuration file

Explore the Android File System Hierarchy In-Depth:

LINUX BOOT PROCESS

Mount Google drive inside Linux LXC container with samba shares on Rocky Linux 8

Be the (Kernel) Driver you want to be

What is RAM? How to Access Your Computer's RAM and Read the Contents

Linux: its fundamental pieces and their uses.

RHEL: "Increase or Decrease Static Partition Size in Linux using "resize2fs" without losing the Data"

Windows Server 2003 Build Server

ELF Linux Executable PLT and GOT Tables

Candidates

Attack Surface

领英推荐

Performance Improvements

Config and build

Fuzzing

Vyacheslav Moskvin的更多文章

A Glitch to Die For

Linux Kernel Attack Surface: beyond IOCTL. DMA-BUF

Finding Bugs in Kernel. Part 1: Crashing a Vulnerable Driver with Syzkaller

Hunting Bugs in Linux Kernel With KASAN: How to Use it & What's the Benefit?

How to Choose Projects That Advance Your Skills - From Endless Struggles to Endless Bugs

Found Bugs, Got paid, Stayed poor: Making a Living with Bug Bounties

Local LLM for Ultimate Hacker Chat?

File Fuzzing: Really Easy and Super Fast with new AFL++ Features

Bare Metal Reversing Foundations: Physical Memory Intricacies

Extracting Firmware: Every Method Explained

社区洞察

其他会员也浏览了

The fstab (File System Table) configuration file

Explore the Android File System Hierarchy In-Depth:

LINUX BOOT PROCESS

Mount Google drive inside Linux LXC container with samba shares on Rocky Linux 8

Be the (Kernel) Driver you want to be

What is RAM? How to Access Your Computer's RAM and Read the Contents

Linux: its fundamental pieces and their uses.

RHEL: "Increase or Decrease Static Partition Size in Linux using "resize2fs" without losing the Data"

Windows Server 2003 Build Server

ELF Linux Executable PLT and GOT Tables