Unix domain socket explained

In client-server computing, a Unix domain socket is a Berkeley socket that allows data to be exchanged between two processes executing on the same Unix or Unix-like host computer.[1] This is similar to an Internet domain socket that allows data to be exchanged between two processes executing on different host computers.

Regardless of the range of communication (same host or different host),[2] Unix computer programs that perform socket communication are similar. The only range of communication difference is the method to convert a name to the address parameter needed to bind the socket's connection. For a Unix domain socket, the name is a /[[Path (computing)|path]]/[[filename]]. For an Internet domain socket, the name is an [[IP address]]:[[Port (computer networking)|Port number]]. In either case, the name is called an address.[3]

Two processes may communicate with each other if each obtains a socket. The server process binds its socket to an address, opens a listen channel, and then continuously loops. Inside the loop, the server process is put to sleep while waiting to accept a client connection.[4] Upon accepting a client connection, the server then executes a read system call that will block wait. The client connects to the server's socket via the server's address. The client process then writes a message for the server process to read. The application's algorithm may entail multiple read/write interactions. Upon completion of the algorithm, the client executes exit[5] and the server executes close.[6]

For a Unix domain socket, the socket's address is a /path/filename identifier. The server will create /path/filename on the filesystem to act as a lock file semaphore. No I/O occurs on this file when the client and server send messages to each other.[7]

History

Sockets first appeared in Berkeley Software Distribution 4.2 (1983).[8] It became a POSIX standard in 2000.[8] The application programming interface has been ported to virtually every Unix implementation and most other operating systems.[8]

Socket instantiation

Both the server and the client must instantiate a socket object by executing the socket system call. Its usage is:[9] int socket(int domain, int type, int protocol);

The domain parameter should be one of the following common ranges of communication:[10]

  1. Within the same host by using the constant AF_UNIX
  2. Between two hosts via the IPv4 protocol by using the constant AF_INET
  3. Between two hosts via the IPv6 protocol by using the constant AF_INET6
  4. Within the same host or between two hosts via the Stream Control Transmission Protocol by using the constant SOCK_SEQPACKET

The Unix domain socket label is used when the domain parameter's value is AF_UNIX. The Internet domain socket label is used when the domain parameter's value is either AF_INET or AF_INET6.[11]

The type parameter should be one of two common socket types: stream or datagram.[10] A third socket type is available for experimental design: raw.

  1. SOCK_STREAM will create a stream socket. A stream socket provides a reliable, bidirectional, and connection-oriented communication channel between two processes. Data are carried using the Transmission Control Protocol (TCP).[10]
  2. SOCK_DGRAM will create a datagram socket. A Datagram socket does not guarantee reliability and is connectionless. As a result, the transmission is faster. Data are carried using the User Datagram Protocol (UDP).[12]
  3. SOCK_RAW will create an Internet Protocol (IP) datagram socket. A Raw socket skips the TCP/UDP transport layer and sends the packets directly to the network layer.[13]

For a Unix domain socket, data (network packets) are passed between two connected processes via the transport layer — either TCP or UDP.[14] For an Internet domain socket, data are passed between two connected processes via the transport layer and the Internet Protocol (IP) of the network layer — either TCP/IP or UDP/IP.[14]

The protocol parameter should be set to zero for stream and datagram sockets.[2] For raw sockets, the protocol parameter should be set to IPPROTO_RAW.[15]

socket return value

socket_fd = socket(int domain, int type, int protocol);

Like the regular-file [[open (system call)|open]] system call, the socket system call returns a file descriptor.[2] The return value's suffix _fd stands for file descriptor.

Server bind to /path/filename

After instantiating a new socket, the server binds the socket to an address. For a Unix domain socket, the address is a /path/filename.

Because the socket address may be either a /path/filename or an IP_address:Port_number, the socket application programming interface requires the address to first be set into a structure. For a Unix domain socket, the structure is:[16] struct sockaddr_un

The _un suffix stands for unix. For an Internet domain socket, the suffix will be either _in or _in6. The sun_ prefix stands for socket unix.[16]

Computer program to create and bind a stream Unix domain socket:[7]

  1. include
  2. include
  3. include
  4. include
  5. include
  6. include
  7. include
  8. include

/* Should be 91 characters or less. Some Unix-like are slightly more. *//* Use /tmp directory for demonstration only. */ char *socket_address = "/tmp/mysocket.sock";

void main(void)

The second parameter for bind is a pointer to struct sockaddr. However, the parameter passed to the function is the address of a struct sockaddr_un. struct sockaddr is a generic structure that is not used. It is defined in the formal parameter declaration for bind. Because each range of communication has its own actual parameter, this generic structure was created as a cast placeholder.[17]

Server listen for a connection

After binding to an address, the server opens a listen channel to a port by executing listen. Its usage is:[18] int listen(int server_socket_fd, int backlog);

Snippet to listen:if (listen(server_socket_fd, 4096)

-1) assert(0);

For a Unix domain socket, listen most likely will succeed and return 0. For an Internet domain socket, if the port is in use, listen returns -1.[18]

The backlog parameter sets the queue size for pending connections.[19] The server may be busy when a client executes a connect request. Connection requests up to this limit will succeed. If the backlog value passed in exceeds the default maximum, then the maximum value is used.[18]

Server accept a connection

After opening a listen channel, the server enters an infinite loop. Inside the loop is a system call to accept, which puts itself to sleep.[4] The accept system call will return a file descriptor when a client process executes connect.[20]

Snippet to accept a connection:int accept_socket_fd;

while (1)

Server I/O on a socket

When accept returns a positive integer, the server engages in an algorithmic dialog with the client.

Stream socket input/output may execute the regular-file system calls of [[read (system call)|read]] and [[write (system call)|write]].[6] However, more control is available if a stream socket executes the socket-specific system calls of send and recv. Alternatively, datagram socket input/output should execute the socket-specific system calls of sendto and recvfrom.[21]

For a basic stream socket, the server receives data with read(accept_socket_fd) and sends data with write(accept_socket_fd).

Snippet to illustrate I/O on a basic stream socket:int accept_socket_fd;

while (1)

  1. define BUFFER_SIZE 1024

void server_algorithmic_dialog(int accept_socket_fd)

Server close a connection

The algorithmic dialog ends when either the algorithm concludes or read(accept_socket_fd) returns < 1.[6] To close the connection, execute the close system call:[6]

Snippet to close a connection:int accept_socket_fd;

while (1)

Snippet to illustrate the end of a dialog:

  1. define BUFFER_SIZE 1024

void server_algorithmic_dialog(int accept_socket_fd)

Client instantiate and connect to /path/filename

Computer program for the client to instantiate and connect a socket:[5]

  1. include
  2. include
  3. include
  4. include
  5. include
  6. include
  7. include
  8. include

/* Must match the server's socket_address. */char *socket_address = "/tmp/mysocket.sock";

void main(void)

Client I/O on a socket

If connect returns zero, the client can engage in an algorithmic dialog with the server. The client may send stream data via write(client_socket_fd) and may receive stream data via read(client_socket_fd).

Snippet to illustrate client I/O on a stream socket:

  1. define BUFFER_SIZE 1024

void client_algorithmic_dialog(int client_socket_fd)

Notes and References

  1. Book: Kerrisk , Michael . The Linux Programming Interface . No Starch Press . 2010 . Sockets are a method of IPC that allow data to be exchanged between applications, either on the same host (computer) or on different hosts connected by a network. . 978-1-59327-220-3 . 1149.
  2. Book: Kerrisk , Michael . The Linux Programming Interface . No Starch Press . 2010 . 978-1-59327-220-3 . 1150.
  3. Book: Kerrisk , Michael . The Linux Programming Interface . No Starch Press . 2010 . 978-1-59327-220-3 . The server binds its socket to a well-known address (name) so that clients can locate it. . 1150.
  4. Book: Unix Network Programming . Stevens . Richard W. . Fenner . Bill . Rudoff . Andrew M. . Pearson Education . 2004 . 3rd . 81-297-0710-1 . Normally, the server process is put to sleep in the call to accept, waiting for a client connection to arrive and be accepted. . 14.
  5. Book: Kerrisk , Michael . The Linux Programming Interface . No Starch Press . 2010 . 978-1-59327-220-3 . 1169.
  6. Book: Kerrisk , Michael . The Linux Programming Interface . No Starch Press . 2010 . 978-1-59327-220-3 . 1159.
  7. Book: Kerrisk , Michael . The Linux Programming Interface . No Starch Press . 2010 . 978-1-59327-220-3 . 1166.
  8. Book: Kerrisk , Michael . The Linux Programming Interface . No Starch Press . 2010 . 978-1-59327-220-3 . 1149.
  9. Book: Kerrisk , Michael . The Linux Programming Interface . No Starch Press . 2010 . 978-1-59327-220-3 . 1153.
  10. Book: Kerrisk , Michael . The Linux Programming Interface . No Starch Press . 2010 . 978-1-59327-220-3 . 1151.
  11. Book: Kerrisk , Michael . The Linux Programming Interface . No Starch Press . 2010 . 978-1-59327-220-3 . 1197.
  12. Book: Kerrisk , Michael . The Linux Programming Interface . No Starch Press . 2010 . 978-1-59327-220-3 . 1152.
  13. Book: Kerrisk , Michael . The Linux Programming Interface . No Starch Press . 2010 . 978-1-59327-220-3 . 1184.
  14. Book: Kerrisk , Michael . The Linux Programming Interface . No Starch Press . 2010 . 978-1-59327-220-3 . 1181.
  15. Book: Kerrisk , Michael . The Linux Programming Interface . No Starch Press . 2010 . 978-1-59327-220-3 . 1153.
  16. Book: Kerrisk , Michael . The Linux Programming Interface . No Starch Press . 2010 . 978-1-59327-220-3 . 1165.
  17. Book: Kerrisk , Michael . The Linux Programming Interface . No Starch Press . 2010 . 978-1-59327-220-3 . 1154.
  18. Web site: Linux manual page for listen .
  19. Book: Kerrisk , Michael . The Linux Programming Interface . No Starch Press . 2010 . 978-1-59327-220-3 . 1157.
  20. Web site: Linux manual page for accept .
  21. Book: Kerrisk , Michael . The Linux Programming Interface . No Starch Press . 2010 . 978-1-59327-220-3 . 1160.