登录查看更多内容

Linux File I/O

Gabriel M.

Linux Systems Engineer | IT Infrastructure | Security | Virtualization | Automation | AI | C and Shell Scripting

发布日期: 2020年8月19日

When using system calls for dealing with file I/O, open(), read(), write() and close() are the four functions used to perform their namesake operations. These functions make use of file descriptors to reference open files. A file descriptor can be think of as a small "handle" used to get to the file and is represented by a non-negative integer. The cool thing about the Linux I/O model is that it is a universal I/O model, that is, a file descriptor can refer to all types of files, that is, terminals, devices, pipes, sockets, FIFOs, as well as regular files.

The three basic file descriptors, standard input, output and error, are made available to running programs, which inherit them from the shell that runs them. These descriptors are identified by their default ID, being stdin id 0, stdout id 1 and stderr id 2.

This is why those numbers are mostly seen right after I/O redirection operators inside shell scripts. As an example, the standard output from a ls command could be redirected to ls_output.txt file and, at the same time, the error output for that command could be redirected to another file ls_output_errors.txt, as follows:

ls > ls_output.txt 2> ls_output_errors.txt

For placing both outputs to the same file, the above command could be rewritten as:

ls > ls_ouptut_all.txt 2>&1

Where >2&1 means "redirect file descriptor 2 to the same place as file descriptor 1".

For using I/O file descriptors inside a program, this small C code shows how this can be done. The following program simply copies data from input to output, making use of the four system calls shown above.

#include <stdio.h>
#include <sys/types.h>
#include <sys/stat.h>
#include <fcntl.h>
#include <stdlib.h>
#include <string.h>
#include <unistd.h>

int main(int argc, char *argv[])
{
    int in = 0;
    int out = 0;
    int openFlags = 0;
    mode_t filePermissions = 0;
    ssize_t bytesRead = 0;
    char copyBuffer[4096];
    
    /* Check command line args */
    if (argc != 3 || strcmp(argv[1], "--help") == 0)
    {
        fprintf(stdout,"Usage: %s src-file dst-file\n", argv[0]);
        exit(EXIT_FAILURE);
    }
    
    /* Try to open input/output files */
    in = open(argv[1], O_RDONLY);
    if ( in == -1) {
        fprintf(stdout, "Error opening %s\n", argv[1]);
        exit(EXIT_FAILURE);
    }
    
    openFlags = O_CREAT | O_WRONLY | O_TRUNC;
    filePermissions = S_IRUSR | S_IWUSR |     /* user is rw- */
                S_IRGRP |                     /* group is r-- */
                S_IROTH;                      /* others are r-- */


    out = open(argv[2], openFlags, filePermissions );
    if ( out == -1) {
        fprintf(stdout, "Error opening %s\n", argv[2]);
        exit(EXIT_FAILURE);
    }
                    
    /* While we have data to read from source (or don't get input error)  */
    while ( ( bytesRead = read( in, copyBuffer, COPY_BUFFER_SIZE )) > 0 ) 
        /* we write to the output and check if all was written */
        if ( write( out, copyBuffer, bytesRead ) != bytesRead ) 
            fprintf(stdout, "Fatal error! Could not write whole buffer\n");
        
    /* Did we read anything? */
    if ( bytesRead == -1)
    {
        fprintf(stdout, "Fatal error [read]\n");
        exit(EXIT_FAILURE);
    }
    
    if (close( in ) == -1) 
    {
        fprintf(stdout, "Error closing input!\n");
        exit(EXIT_FAILURE);
    }
    
    if (close( out ) == -1)
    {
        fprintf(stdout, "Error closing output!\n");
        exit(EXIT_FAILURE);
    }
    
    exit(EXIT_SUCCESS);
}

The resulting program can be run as:

gcc copy.c -o copy 

./copy copy.c copy2.c 
./copy /dev/tty1 copy_of_terminal.log 
./copy netinst.iso copy.iso

The interesting thing here is that, because of the universal I/O model, the system calls that were used inside the program do not care about where the data is coming from or going to. They just see "input" and "output", where these could be a regular file, as in the example above, as well as terminals (/dev/tty) or other device/file types. That happens because all the details regarding wetter data comes from/to file system or device are handled by the kernel and the programmer does not need to care about that.

Happy coding!

-- FIN

要查看或添加评论，请登录

Gabriel M.的更多文章

Docker 101

2025年2月15日

Docker 101

A (tiny) introduction to Docker. If you want to get your hands dirty with some Docker content, this quick introduction…
Building custom kernels for Linux + updating the system firmware.

2025年2月14日

Building custom kernels for Linux + updating the system firmware.

In this post I'll show you how you can replace your current Linux kernel with a new one built entirely from the…
Debian complained about missing firmware during installation? Add missing 'non-free' firmware to installation image! :)

2021年2月12日

Debian complained about missing firmware during installation? Add missing 'non-free' firmware to installation image! :)

If you have installed Debian, you might have faced the screen below: And probably got very frustrated, because your…
Using Traps in Bash Scripts

2020年10月29日

Using Traps in Bash Scripts

Imagine that you have a script to perform some critical operation, one that would render the system completely unusable…

2 条评论
Safer Bash Scripting Tips

2020年10月29日

Safer Bash Scripting Tips

I hope you are all well and safe. As "stay safe" is the most used expression during this COVID-19 period, I thought it…

1 条评论
Pointers and 2D-arrays in C

2020年10月17日

Pointers and 2D-arrays in C

When taking the Engineering or CS undergrad path, like 1+1 leads to 2 (let's keep it simple here, shall we?), students…

2 条评论
Linux Kernel Inotify Subsystem

2020年10月12日

Linux Kernel Inotify Subsystem

When dealing with Unix-style filesystems there must be a data structure that is capable of describing an object from…
Shifting bits in C

2020年8月7日

Shifting bits in C

Have you ever asked yourself how much memory space do you waste when writing your code? Sometimes you just need a…
Improving your Linux box security

2020年6月1日

Improving your Linux box security

Did you know that more than never, during these quarantine days, there is a lot more malicious activities undergoing…
Build UnrealEngine on Linux, making it use less disk space! :)

2020年4月28日

Build UnrealEngine on Linux, making it use less disk space! :)

If you are playing with the amazing Unreal Engine on your Linux box, you might have noticed that the final size after…

See all articles

Linux File I/O

Gabriel M.

Linux Systems Engineer | IT Infrastructure | Security | Virtualization | Automation | AI | C and Shell Scripting

Gabriel M.的更多文章

社区洞察

其他会员也浏览了

Brief on Linux process

Understand Leaky Vessels (CVE-2024-21626)

LINUX BOOT PROCESS

Day 06/90 : File Permissions and Access Control Lists

RHEL: "Increase or Decrease Static Partition Size in Linux using "resize2fs" without losing the Data"

ELF Linux Executable PLT and GOT Tables

How to Use the Less Command?

ls-l command: understanding what happens in the shell

What happens when you type ls -l in the shell

Linux File Hierarchy

Gabriel M.的更多文章

Docker 101

Building custom kernels for Linux + updating the system firmware.

Debian complained about missing firmware during installation? Add missing 'non-free' firmware to installation image! :)

Using Traps in Bash Scripts

Safer Bash Scripting Tips

Pointers and 2D-arrays in C

Linux Kernel Inotify Subsystem

Shifting bits in C

Improving your Linux box security

Build UnrealEngine on Linux, making it use less disk space! :)

社区洞察

其他会员也浏览了

Brief on Linux process

Understand Leaky Vessels (CVE-2024-21626)

LINUX BOOT PROCESS

Day 06/90 : File Permissions and Access Control Lists

RHEL: "Increase or Decrease Static Partition Size in Linux using "resize2fs" without losing the Data"

ELF Linux Executable PLT and GOT Tables

How to Use the Less Command?

ls-l command: understanding what happens in the shell

What happens when you type ls -l in the shell

Linux File Hierarchy