登录查看更多内容

The Clone Trick of Programs

Hongbin Li

PhD, PEng, CFSE, PMP, SMIEEE

发布日期: 2022年1月29日

The clone trick in the title refers to the supernatural ability that the monkey king possesses in the story of the Journey to the West, one of the most popular Chinese traditional novels. When the monkey king needs a helper, he clones himself and goes on to other places while the cloned replica stays on current task. Wouldn’t it be nice if we could have such an ability? We could have our replica continue working while we enjoy life and free time. In the digital world, running programs often perform exactly the same trick as the monkey king.

A program is normally an executable machine code file; when it is loaded by the Operating System (OS), it is read into the memory and executed by the processor. A running program is called a process, an entity that consumes computer resources, such as memory, processor time, and I/O devices. A computer can have many running processes at the same time. The OS manages the processes and schedules them to run based on their priorities.?

A program is written to perform a function for users or the OS. Some functions are unique and singular such as a calculator or video player; some functions are of server type and require calling other programs to fulfill users’ requests. For example, a shell program displays a command prompt and waits for users’ inputs. Upon receiving users’ commands, the shell program executes the corresponding commands, displays outputs, then returns to command prompt and waits for user inputs again.?

Another example is a TCP server program. It listens to the incoming data on an Internet socket; when a connection request is received, the program accepts the connection and responds to the request.? There could be multiple requests at the same time; the server program should be designed to handle each request while continuing to listen on the socket for new requests.?

In above shell and server examples, the programs have two distinct functions: 1) Wait for user command input/request; 2) Respond to the command input/request. Programs can use the clone trick to create a new process to fulfill these two functions.

In Linux, a program can use fork() system call to create a new process. When fork() is called, the OS creates a new child process, by duplicating the memory segments of the original process, which becomes the parent process. Now both the parent and child processes share and execute the same code.?

At first glance, it seems strange that the program needs to create a new process to execute a new program. Why can’t the same program execute the new program? The reason is the new program to be executed is in a separate executable file. To execute a separate program, a new process must be created; fork() can be considered a quick and easy way to create a new process by duplicating memory segments of the existing process, basically a copy-paste in process creation. If we were to avoid creating a new child process, we will need to include the code for all kinds of possible user commands in the shell program, which increases memory usage and reduces modularity.?

As the intent of a fork is to have two processes carry out different tasks, the program needs to be able to distinguish between the parent and the child process. This is done by the different return values to the parent and child process: the child process ID to the parent process and 0 to the child process. The program can check the return value and execute different codes. The child process can run the exec() system call to load a new executable file, which transforms itself into a completely new process with memory segments from the new program. Because in practice, the fork() is almost always followed by an exec(), it would be a waste of time to replicate the memory segments and then replace them with the new program. So the OS uses a copy-on-write technique, which replicates the pointers to the memory segments and only creates new memory segments when there are writes to the memory.

领英推荐

Linking and Generating Executable Binaries for RISC-V

SiBrain Technologies Pvt Ltd 1 年前

Compilation Method for Rockchip Driver

Forlinx Embedded Technology Co.,Ltd. 5 个月前

cURL Demystified

Ashutosh Dongaonkar 4 个月前

The separation between the fork() and exec() allows the child process to set up the runtime environment before executing the new program. For example, if the user command needs to redirect output to a file instead of the standard output device, the child process can close the file descriptor of the standard output and then open a new file descriptor for the file. The command program is executed as usual, but the output will go to the file as the new file descriptor replaces the standard output device.

Once the child process is created, the parent process can choose to continue running in parallel or perform the wait() system call, which suspends the parent process and waits for the child process to finish. In the case of the shell program, the shell stops taking user input when a user command is being executed. So the child process finishes execution first, then the parent process continues execution.

For the server program, concurrent execution is required. The child process handles the requests while the parent process listens to the new requests at the same time. This would require both processes to have their own sockets for receiving and sending data. When the server process listens to a socket, it runs the accept() system call, which returns a new file descriptor that corresponds to a new socket when a connection request is received. Then, the server program runs fork() to create a child process with the replicated listening socket and new connection socket. The child process would handle the request via the new socket while the parent process continues accepting connection requests on the original socket.

The clone ability provided by the system calls from the OS seems to give running programs a sense of life. They can now create child processes to help with tasks, a supernatural ability in the digital world.

Reference

[1] The Linux Programming Interface. Michael Kerrisk, no starch press, 2010.

要查看或添加评论，请登录

Hongbin Li的更多文章

Vector Representation

2024年11月3日

Vector Representation

Vectors are a foundational concept in linear algebra, representing ordered lists of numbers. While simple in structure,…
The Recursive Cycle for Ultra Reliability

2024年9月7日

The Recursive Cycle for Ultra Reliability

Introduction Some modern-day technological systems have very low tolerance for failures because their consequences can…
Embracing Failure: The Hidden Key to Learning and Growth

2024年8月13日

Embracing Failure: The Hidden Key to Learning and Growth

During my time working at chemical sites, I participated the "Incident Learning Process," a critical method used to…

2 条评论
Ontario Regulation 429/04: Enhancing Benefits for Class A Consumers and Promoting Clean Energy

2024年7月25日

Ontario Regulation 429/04: Enhancing Benefits for Class A Consumers and Promoting Clean Energy

I learned recent updates to this Ontario regulation last month at a PEO event in June 2024. This change provides an…
Why Greatness Cannot Be Planned: Embracing the Power of Open Exploration

2024年6月9日

Why Greatness Cannot Be Planned: Embracing the Power of Open Exploration

Note: Following write-up is generated by ChatGPT by feeding it with an initial outline, follow-up comments, and…
Supply Demand Cycles

2022年11月8日

Supply Demand Cycles

In microeconomics, the free market is considered to be equipped with an invisible hand, sending demand signals via…
From Bush Soul to Jung’s Writing

2022年9月29日

From Bush Soul to Jung’s Writing

The above pictures were taken at the Royal Saskatchewan Museum. The first picture is a statue of an Indigenous figure;…
Psychologists’ Interpretation of Meaning

2022年7月17日

Psychologists’ Interpretation of Meaning

We all love jokes. Jokes are funny.
Complex Number Magic

2022年5月17日

Complex Number Magic

The above figure is a sketch of a treasure hunt puzzle in George Gamow’s book “One two three … infinity”[1]. According…
Program Microscope

2022年2月28日

Program Microscope

Any C source code needs to be compiled into an object file and then linked to an executable file before it can be run…

See all articles

The Clone Trick of Programs

Hongbin Li

PhD, PEng, CFSE, PMP, SMIEEE

领英推荐

Hongbin Li的更多文章

社区洞察

其他会员也浏览了

cURL Demystified

C++20: The Library

Creating a Command-Line Program in C++: argc vs cxxopts

Day 16: File System Module in Node.js

Tales from the developer's command line

C++ Core Guidelines: A Short Detour to Contracts in C++20

Topic - 4 (Introduction to Linked Lists)

shadcn-ui/ui codebase analysis: How does shadcn-ui CLI work? — Part 2.1

HackTheBox – Starting Point (Tier 1) Crocodile

What happens when you type ls -l in the shell ?

领英推荐

Hongbin Li的更多文章

Vector Representation

The Recursive Cycle for Ultra Reliability

Embracing Failure: The Hidden Key to Learning and Growth

Ontario Regulation 429/04: Enhancing Benefits for Class A Consumers and Promoting Clean Energy

Why Greatness Cannot Be Planned: Embracing the Power of Open Exploration

Supply Demand Cycles

From Bush Soul to Jung’s Writing

Psychologists’ Interpretation of Meaning

Complex Number Magic

Program Microscope

社区洞察

其他会员也浏览了

cURL Demystified

C++20: The Library

Creating a Command-Line Program in C++: argc vs cxxopts

Day 16: File System Module in Node.js

Tales from the developer's command line

C++ Core Guidelines: A Short Detour to Contracts in C++20

Topic - 4 (Introduction to Linked Lists)

shadcn-ui/ui codebase analysis: How does shadcn-ui CLI work? — Part 2.1

HackTheBox – Starting Point (Tier 1) Crocodile

What happens when you type ls -l in the shell ?