登录查看更多内容

From Code to Kernel: Why is my "Hello World" so Big?

Mohit Mishra

Engineering @ Amadeus || JGEC 2023 || KWoC 2021 || GWOC 2021

发布日期: 2024年11月18日

+ 关注

Table of Content

About This Series
What am I doing?
Introduction
Our Starting Point: The Simplest C Program
Introduction to the ELF Format
Executable Files: Not Just Your Code
Examining the Sections
Essential Code Sections
Dynamic Linking Infrastructure
Runtime Support Sections
Understanding the Size Contributors
Can We Make It Smaller?
Why Keep All This "Overhead"?
Conclusion
Further Reading

About This Series

This is the first chapter in the thirteen-chapter series on what happens after you run the program.

You can get more details on this mini-book here:

Github: https://github.com/mohitmishra786/underTheHoodOfExecutables/

Website: https://mohitmishra786.github.io/underTheHoodOfExecutables/

What am I doing?

Currently Writing Book "The Operating System Architect's Manual": https://github.com/mohitmishra786/myJourneyOfBuildingOS
My GitHub: https://github.com/mohitmishra786
Personal Tech Blogs: https://mohitmishra786.github.io/chessman
Medium: https://medium.com/@mohitmishra786687

Introduction

When beginning their journey with C programming on Linux, developers often start with the quintessential "Hello, World!" program. It's a rite of passage, a first step into the world of programming. However, this simple program holds a fascinating mystery that we'll unravel in this post: Why does such a tiny program compile into a surprisingly large executable?

Our Starting Point: The Simplest C Program

Let's begin with the classic "Hello, World!" program:

#include <stdio.h>

int main() {
    printf("Hello, World!\n");
    return 0;
}

Save this as hello.c and compile it with GCC:

gcc -o hello hello.c

Now, let's examine its size:

-rwxrwxr-x 1 chessman chessman 15960 Nov  7 13:16 hello
-rw-rw-r-- 1 chessman chessman    79 Nov  7 13:15 hello.c

15,969 bytes! That's shocking when you consider that our source code is merely 79 bytes. Let's put this in perspective:

Source code: 79 bytes
Executable: 15,969 bytes
Ratio: The executable is roughly 202 times larger than the source code!

Introduction to the ELF Format

Before we dive into the specifics, it's important to understand that our executable is in the ELF (Executable and Linkable Format) format, the standard binary format for executables on Linux. We'll explore ELF in great detail in Chapter 2, but for now, let's understand its basic structure.

An ELF file consists of several key components:

ELF Header
Program Header Table
Various Sections
Section Header Table

Let's use readelf to peek at the ELF header:

This header alone is 64 bytes! We'll explore these fields in detail in Chapter 2, "ELF: Demystifying the Executable Format."

Executable Files: Not Just Your Code

An executable file on Linux is not merely a raw dump of your compiled C code. Instead, it's a meticulously organized structure containing various segments of information crucial for the operating system to load and execute your program.

These segments serve diverse purposes:

Code Segment (.text): This section houses the heart of your program - the compiled machine instructions generated from your C code. It's where the printf function call and the loop logic in a more complex program would reside.
Data Segments (.data, .rodata, .bss): These segments hold the variables and constants used by your program. Initialized global variables find their home in .data, constant values (like the string "Hello, world!") reside in .rodata, and uninitialized global variables are allocated space in .bss.
Header Information: Executable files begin with a header that acts as a guide for the operating system. It contains metadata like architecture type, entry point, and section layout details.
Symbol Table: This table plays a critical role in linking (which we'll explore in-depth in later posts). It maps function and variable names used in your code to their corresponding addresses within the executable. This mapping is essential for resolving references between different parts of your program or when linking with external libraries.
Relocation Information: This section comes into play when your program is loaded into memory. It contains instructions for the linker to adjust memory addresses within the code, ensuring that references to functions, variables, and data structures point to the correct locations.
Debugging Information: If you compile your program with debugging symbols (using the -g flag with gcc), the executable file will also include debug information. This information allows debuggers like gdb to correlate machine instructions back to your original C code, making it possible to step through your program line by line and inspect variables during execution.

Examining the Sections

Let's use objdump to look at the sections in our executable:

That's a lot of sections! Let's break down the most important ones and understand why they're necessary:

Essential Code Sections

.text Section (The Code)

The .text section contains the actual machine code. Notice several interesting points:

Our printf call has been optimized to puts (we'll explore compiler optimizations in later chapters)
The function prologue and epilogue handle stack frame setup
The actual code is much larger than our simple C source would suggest

We'll explore the details of code sections more thoroughly in Chapter 3, "Where Your C Code Lives: Understanding ELF Sections."

.rodata Section (Read-only Data)

领英推荐

Coding Challenge #11 - A Basic Web Server

John Crickett 1 年前

Challenge: can you spot an error in C++ code?

PVS-Studio 2 年前

How to start with Rust developing a simple fs-watcher

Arnold ?? Van Wijnbergen 2 年前

This section contains our string constant "Hello, World!" along with other read-only data. The string is null-terminated and aligned according to the system's requirements.

Dynamic Linking Infrastructure

Our executable needs several sections to support dynamic linking:

.interp Section

?  executables git:(main) ? readelf -p .interp hello

String dump of section '.interp':
  [     0]  /lib64/ld-linux-x86-64.so.2

This section specifies the dynamic linker that will load our program. We'll explore dynamic linking in detail in Chapter 9, "Dynamic Linking in C: Shrinking Executables and Sharing Code."

Dynamic Symbol Sections

These sections (.dynsym, .dynstr) contain information about functions we use from shared libraries. The symbol table's role will be covered extensively in Chapter 7, "Symbols: The Linker's Address Book."

Runtime Support Sections

Initialization and Finalization

?  executables git:(main) ? readelf -d hello | grep INIT
 0x000000000000000c (INIT)               0x1000
 0x0000000000000019 (INIT_ARRAY)         0x3db8
 0x000000000000001b (INIT_ARRAYSZ)       8 (bytes)

These sections (.init, .init_array, .fini, .fini_array) handle program initialization and cleanup. We'll explore how these sections work before main() is called in Chapter 4, "Before main(): The Secret Life of Global Variables in C."

Exception Handling Support

?  executables git:(main) ? readelf -w hello | grep -A2 ".eh_frame"
  [17] .eh_frame_hdr    PROGBITS         0000000000002014  00002014
       0000000000000044  0000000000000000   A       0     0     4
       [Containing entries for all functions]

The .eh_frame and .eh_frame_hdr sections support C++ exceptions and stack unwinding. While our simple C program doesn't use exceptions, these sections are included to support interoperability with C++ code and for proper stack traces during crashes.

Understanding the Size Contributors

Let's break down where all those bytes go:

?  executables git:(main) ? size --format=GNU hello
      text       data        bss      total filename
       367       1609          8       1984 hello

But this only tells part of the story. Let's get a more detailed view:

Core Program Components (~0.3KB): Includes machine code, read-only data, initialized data, and BSS section.
Dynamic Linking Support (~1KB): Comprises dynamic symbol table, string tables, global offset table, and procedure linkage table.
Runtime Support (~0.3KB): Contains exception handling frames, init/fini arrays, and debug information.
Metadata and Headers (~0.7KB): Composed of ELF header, program headers, and section headers.

Can We Make It Smaller?

Yes! Let's try some optimization techniques:

Basic Size Optimization

gcc -Os -o hello_small hello.c
strip hello_small
ls -l hello_small
-rwxrwxr-x 1 chessman chessman 14472 Nov  7 13:35 hello_small

The -Os flag optimizes for size, and strip removes debugging information.

Static Linking (for comparison)

?  executables git:(main) ? gcc -static -o hello_static hello.c
?  executables git:(main) ? ls -l hello_static 
-rwxrwxr-x 1 chessman chessman 900344 Nov  7 13:37 hello_static

Static linking makes our executable much larger because it includes all library code directly! We'll explore the trade-offs between static and dynamic linking in Chapter 9.

Advanced Optimization (preview)

?  executables git:(main) ? gcc -Os -fdata-sections -ffunction-sections -Wl,--gc-sections -o hello_opt hello.c
?  executables git:(main) ? strip hello_opt 
?  executables git:(main) ? ls -l hello_opt 
-rwxrwxr-x 1 chessman chessman 14464 Nov  7 13:38 hello_opt

This uses link-time optimization to remove unused sections. We'll explore these techniques in Chapter 8, "Customizing the Layout: Introduction to Linker Scripts."

Why Keep All This "Overhead"?

While our executable might seem bloated, each component serves crucial purposes:

Dynamic Linking Support: Provides mechanisms like shared libraries to allow multiple programs to use the same code, simplifying updates by patching shared code, and efficiently using system resources by loading shared code into memory once.
Runtime Infrastructure: Manages the lifecycle of a program from startup to shutdown, including initializing variables, setting up the execution environment, handling runtime errors, and offering hooks for debugging and performance profiling.
Platform Compatibility: Ensures that programs can run on different systems with minimal to no changes by defining standardized ways to load programs, incorporate security measures like address space layout randomization (ASLR), and integrate with system-level debugging tools.

Conclusion

Our journey through the "Hello, World!" program has revealed that modern executables are sophisticated containers that package not just our code, but also the infrastructure needed to:

Load the program correctly
Link to shared libraries
Initialize the runtime environment
Handle errors gracefully
Support debugging and profiling
Ensure platform compatibility

In the upcoming chapters, we'll dive deeper into each of these aspects:

Chapter 2 will explore the ELF format in detail
Chapter 3 will examine how different types of code and data are organized
Chapter 4 will reveal what happens before main() is called
Chapters 5-8 will cover linking, symbols, and memory layout
Chapters 9-12 will dive into dynamic linking and advanced topics

Understanding these concepts empowers us to:

Debug programs more effectively
Optimize executable size and loading time
Make informed decisions about linking and loading
Write more efficient and maintainable code

Mohit Mishra的更多文章

Addressing TCP Limitations of Head-of-Line Blocking: The Journey from HTTP/2 to HTTP/3 with QUIC

2024年12月24日

Addressing TCP Limitations of Head-of-Line Blocking: The Journey from HTTP/2 to HTTP/3 with QUIC

Table of Contents Introduction Historical Context: From HTTP/1 to HTTP/2 TCP Fundamentals and Their Impact HTTP/2…
Basic Understanding of Threads - Practical Guide to POSIX Threads in C

2024年12月7日

Basic Understanding of Threads - Practical Guide to POSIX Threads in C

Table of Content Introduction What Are Threads? Why Use Threads? Thread Lifecycle Thread Implementation Models Thread…
Evolution Of Cloudflare's Cache Purging System: From Centralized To Distributed Architecture

2024年12月6日

Evolution Of Cloudflare's Cache Purging System: From Centralized To Distributed Architecture

Table of Contents Introduction Background The Old System: Centralized Architecture Challenges with the Old System The…
User Space to Kernel: Build Your Own Linux Kernel Network Stack

2024年11月28日

User Space to Kernel: Build Your Own Linux Kernel Network Stack

Table of Content Introduction Layer Model in Kernel Architecture Overview User Space Layer Application Layer Kernel…
From Pixel to Parallel: Understanding Modern GPU Architecture

2024年11月24日

From Pixel to Parallel: Understanding Modern GPU Architecture

Table of Contents Introduction GPU vs CPU: Architectural Differences Physical Architecture of Modern GPUs Memory…
From S Locks To SX Locks: From Mysql 5.6 To 8.0 A Concurrency Journey

2024年11月23日

From S Locks To SX Locks: From Mysql 5.6 To 8.0 A Concurrency Journey

Table of Contents Introduction Understanding B+ Trees Locking Mechanisms in Databases MySQL 5.6 Implementation MySQL 8.
From ZLIB to Zstandard: Discord’s Real-Time Communication Optimization

2024年11月1日

From ZLIB to Zstandard: Discord’s Real-Time Communication Optimization

In real-time communication platforms, efficiency isn't just about user experience—it's about sustainability and…
Understanding Database Indexing And Concurrent Operations

2024年10月31日

Understanding Database Indexing And Concurrent Operations

Introduction Database indexing is a fundamental concept that significantly impacts application performance. However…
Memory Allocation: What Happens When You Push the Limits?

2024年10月25日

Memory Allocation: What Happens When You Push the Limits?

Have you ever wondered what that malloc() call actually does? We all know programs need memory, but what happens when…

1 条评论
When Memory Runs Dry: Understanding the OOM Killer’s Decision Process

2024年10月24日

When Memory Runs Dry: Understanding the OOM Killer’s Decision Process

The Out-of-Memory (OOM) Killer’s decision-making process is a complex and crucial component of Linux memory management.…

See all articles

From Code to Kernel: Why is my "Hello World" so Big?

Mohit Mishra

Engineering @ Amadeus || JGEC 2023 || KWoC 2021 || GWOC 2021

Table of Content

About This Series

What am I doing?

Introduction

Our Starting Point: The Simplest C Program

Introduction to the ELF Format

Executable Files: Not Just Your Code

Examining the Sections

Essential Code Sections

领英推荐

Dynamic Linking Infrastructure

Runtime Support Sections

Understanding the Size Contributors

Can We Make It Smaller?

Why Keep All This "Overhead"?

Conclusion

Further Reading

Mohit Mishra的更多文章

社区洞察

其他会员也浏览了

Linux kernel Basic

Take-Home Challenge: Build a Unix-Inspired Task Manager API (Tech Agnostic)

Getting started with Dafny: Your first formal proof

C++20: Basic Chrono Terminology

Go Beyond .NET: Your Step-by-Step Journey from C#/.NET to Go - Part 4

Docker: "How to setup Python Interpreter and run Python Code on Docker Container?"

Working with Linux Kernel Modules: A Comprehensive Guide

Testing SteVe with various tools per language

C#

Table of Content

About This Series

What am I doing?

Introduction

Our Starting Point: The Simplest C Program

Introduction to the ELF Format

Executable Files: Not Just Your Code

Examining the Sections

Essential Code Sections

领英推荐

Dynamic Linking Infrastructure

Runtime Support Sections

Understanding the Size Contributors

Can We Make It Smaller?

Why Keep All This "Overhead"?

Conclusion

Further Reading

Mohit Mishra的更多文章

Addressing TCP Limitations of Head-of-Line Blocking: The Journey from HTTP/2 to HTTP/3 with QUIC

Basic Understanding of Threads - Practical Guide to POSIX Threads in C

Evolution Of Cloudflare's Cache Purging System: From Centralized To Distributed Architecture

User Space to Kernel: Build Your Own Linux Kernel Network Stack

From Pixel to Parallel: Understanding Modern GPU Architecture

From S Locks To SX Locks: From Mysql 5.6 To 8.0 A Concurrency Journey

From ZLIB to Zstandard: Discord’s Real-Time Communication Optimization

Understanding Database Indexing And Concurrent Operations

Memory Allocation: What Happens When You Push the Limits?

When Memory Runs Dry: Understanding the OOM Killer’s Decision Process

社区洞察

其他会员也浏览了

Linux kernel Basic

Take-Home Challenge: Build a Unix-Inspired Task Manager API (Tech Agnostic)

Getting started with Dafny: Your first formal proof

C++20: Basic Chrono Terminology

Go Beyond .NET: Your Step-by-Step Journey from C#/.NET to Go - Part 4

Docker: "How to setup Python Interpreter and run Python Code on Docker Container?"

Working with Linux Kernel Modules: A Comprehensive Guide

Testing SteVe with various tools per language

C#