登录查看更多内容

How Programs Are Built in the C Programming Language

ANJUM NAZIR

Cyber Security Architect | Blockchain Researcher | Founder & CEO Geeks Hub

发布日期: 2024年9月24日

On Linux and other similar environments, we generally use tons of commands, utilities, and tools on a daily basis. However, we often overlook the process of creating these applications or programs. In this article, I will give a walk-through of the build process for Linux and other similar systems. My article will focus on the compilation system in the C language.

Programs Are Translated by Other Programs into Different Forms

GNU C Compiler (GCC) is one of the most commonly used compilers in Linux-based systems. In Ubuntu, it can be found in the build-essential package. GCC reads the source file 'hello.c' and translates it into an executable object file named hello. An object file is one that contains object code. Object code contains a sequence of statements or instructions in a machine language (ML) or an intermediate language (IL), such as Register Transfer Language (RTL), Java Byte Code, Microsoft Intermediate Language (MSIL), etc.

The translation process

The diagram below illustrates the four phases in which a C program undergoes translation. The process comprises four phases (i) preprocessing, (ii) compilation, (iii) assembling, and (iv) linking, known collectively as the compilation process.

1. Preprocessing Phase

The preprocessor (cpp) modifies the original C program according to directives that begin with the # sign. The source code passes through this first phase. The preprocessing phase performs the following tasks.

Comments are removed.
Expansion of Macros
Expansion of the included files.

A preprocessor directive is basically instructions to the compiler to perform certain tasks before starting the actual compilation.

?On Linux / UNIX based systems

?$ gcc –E hello.c –o hello.i

?On Windows based systems

?> gcc.exe –E hello.c –o hello.i

2. Compilation Phase

The compiler (cc1) translates the text file 'hello.i' into the text file 'hello.s'. hello.s contains an assembly-language program now. Each statement in an assembly-language program exactly describes one low-level machine-language instruction in a standard text form. Assembly language is useful because it provides a common output language for different compilers for different high-level languages.

?On Linux / UNIX based systems

?$ gcc –S hello.i –o hello.s

?On Windows based systems

领英推荐

Why you should not use Ternary Operators in C

Hamish Laird 1 年前

Storage Classes in C

Yamil Garcia 1 年前

C++ Exercise 2: What is Concurrency and Why is it…

Surya Ambati 6 个月前

?> gcc.exe –S hello.i –o hello.s

3. Assembly Phase

Next, the assembler (as) translates hello.s into machine language instructions, packages them in a form known as a relocatable object program, and stores the result in the object file hello.o.

The 'hello.o' file is a binary file whose bytes encode machine language instructions rather than characters. If we were to view hello.o with a text editor, it would appear to be gibberish.

?On Linux / UNIX based systems

?$ gcc –c hello.s –o hello.o

?On Windows based systems

?> gcc.exe –c hello.i –o hello.s

??

4. Linking Phase

Notice that our hello program calls the printf function, which is part of the standard C library provided by every C compiler. The printf function resides in a separate precompiled object file called printf.o (generally libc library), which must be somehow merged or linked with our hello.o program. The linker (ld) handles this merging or linking process.

The result is the hello file, which is an executable object file (or simply executable) that is ready to be loaded into memory and executed by the system.

?On Linux / UNIX based systems

?$ gcc hello.o –o hello

?On Windows based systems

?> gcc.exe hello.o –o hello

Rizwan Ahmed Khan, Ph.D.

Incoming Professor at IBA, Karachi | AI for Humanity Advocate | Erasmus Mundus Scholar | Award-Winning AI Researcher | Passionate about Responsible AI & Machine Learning

5 个月

Ahmed Abdullah Khan

要查看或添加评论，请登录

ANJUM NAZIR的更多文章

Understanding Object Files

2024年9月24日

Understanding Object Files

In my previous article I demonstrated how programs are built in the C programming language. Click here to read the full…
Lastline leverages the open-source QEMU (Quick EMUlator) emulator

2015年9月19日

Lastline leverages the open-source QEMU (Quick EMUlator) emulator

The Lastline platform provides a full-system emulation approach to detecting malware and potential breach risks. At the…
Linux Rapid Tracks Online Training

2015年8月26日

Linux Rapid Tracks Online Training

Many individuals professional workers cannot manage to take full time training due to their professional / social…
Network Simulator 2 (NS2) Online Training

2015年8月18日

Network Simulator 2 (NS2) Online Training

Network Simulator 2 (ns2) is an open-source event-driven simulator designed for research in computer networks. This is…

1 条评论

How Programs Are Built in the C Programming Language

ANJUM NAZIR

Cyber Security Architect | Blockchain Researcher | Founder & CEO Geeks Hub

Programs Are Translated by Other Programs into Different Forms

The translation process

1. Preprocessing Phase

2. Compilation Phase

领英推荐

3. Assembly Phase

??

4. Linking Phase

ANJUM NAZIR的更多文章

社区洞察

其他会员也浏览了

The Power of C.

Introduction to C Programming: A Beginner’s Guide

The Rust programming language will make it into the Linux kernel

A High-Level Introduction to the C Programming Language

Old Programming Languages

A memory location is more important to a computer than the data stored in that location…

C, Windows, Red Team and Me.

Top 5 Programming Languages for Computer Engineers

Difference between C and C#

Programs Are Translated by Other Programs into Different Forms

The translation process

1. Preprocessing Phase

2. Compilation Phase

领英推荐

3. Assembly Phase

??

4. Linking Phase

ANJUM NAZIR的更多文章

Understanding Object Files

Lastline leverages the open-source QEMU (Quick EMUlator) emulator

Linux Rapid Tracks Online Training

Network Simulator 2 (NS2) Online Training

社区洞察

其他会员也浏览了

The Power of C.

Introduction to C Programming: A Beginner’s Guide

The Rust programming language will make it into the Linux kernel

A High-Level Introduction to the C Programming Language

Old Programming Languages

A memory location is more important to a computer than the data stored in that location…

C, Windows, Red Team and Me.

Top 5 Programming Languages for Computer Engineers

Difference between C and C#