03. TCP Server and Client
Become familiar with the socket API. The code is incomplete and incorrect because network programming is more than calling APIs, which you’ll learn as you go.
3.1 Prerequisites
Get familiar with Linux
Although the principles of network programming are the same, there are many platform differences on Windows & MacOS. For beginners, it’s most convenient to just use Linux, even if you have no Linux experience. You don’t need to know much about Linux to program in it.
- Get a Linux environment either via VirtualBox, WSL, or cloud providers (VPS).
- Learn how to edit, move, copy, and delete files. You don’t have to write code in Linux. Learn how to copy files into Linux, or share files with a VM.
- Compile code with
g++
. You don’t have to mess with build systems like makefiles.
$ g++ -Wall -Wextra -Og -g foo.cpp bar.cpp -o prog
$ ./prog
Basic programming skills
- C programming concepts: array, struct, memory, pointers.
- Debugging skills:
- Print stuff with
printf()
; verify conditions withassert()
. - Inspect syscalls with
strace
. - Inspect live programs or core dumps with
gdb
, show stack traces and etc.
- Print stuff with
C++ features are used only for optional, minor conveniences like vector & string. You don’t need to know C++, but you do need to know about dynamic arrays:
struct MyString { char *data; size_t length; size_t capacity; };
Learn how to get documentation
This is not a reference book, we will not include every detail about the socket API.
man socket.2
This command shows the man page for the socket()
syscall. On Linux, all socket API methods are syscalls. Man pages are
divided into several sections, as specified by the numerical suffix.
Examples:
man read.2
returns theread()
syscall (section 2 is for syscalls).man read
returns theread
shell command (in section 1; not what you want).man socket.2
return thesocket()
syscall.man socket.7
returns the socket interface overview, not the syscall.
Man pages are great for looking up things you already know, but not for learning new things. There are great online resources for learning, such as Beej’s Guide.
3.2 Create a TCP Server
Let’s make the pseudo code real: Read data from the client, write a response, that’s it.
= socket()
fd
bind(fd, address)
listen(fd)while True:
= accept(fd)
conn_fd
do_something_with(conn_fd) close(conn_fd)
Step 1: Obtain a socket handle
The socket()
syscall takes 3 integer arguments.
int fd = socket(AF_INET, SOCK_STREAM, 0);
AF_INET
is for IPv4. UseAF_INET6
for IPv6 or dual-stack sockets.SOCK_STREAM
is for TCP. UseSOCK_DGRAM
for UDP.- The 3rd argument is 0 and useless for our purposes.
The combination of the 3 arguments determines the socket protocol:
Protocol | Arguments |
---|---|
IPv4+TCP | socket(AF_INET, SOCK_STREAM, 0) |
IPv6+TCP | socket(AF_INET6, SOCK_STREAM, 0) |
IPv4+UDP | socket(AF_INET, SOCK_DGRAM, 0) |
IPv6+UDP | socket(AF_INET6, SOCK_DGRAM, 0) |
man socket.2
lists all the flags, but only certain
combinations are accepted. We’ll only be using TCP, so you can forget
about those arguments for now. By the way, man ip.7
tells
you how to create TCP/UDP sockets and the required
#include
s.
Step 2: Set socket options
There are many options that change the behavior of a socket, such as
TCP no delay, IP QoS, etc. (none are our concern). These options are set
via the socksockopt()
API. Like the bind()
API, this just passes a parameter to the OS as the actual socket has not
been created yet.
int val = 1;
(fd, SOL_SOCKET, SO_REUSEADDR, &val, sizeof(val)); setsockopt
- The combination of the 2nd & 3rd arguments specifies which option to set.
- The 4th argument is the option value.
- Different options use different types, so the size of the option value is also needed.
In this case, we set the SO_REUSEADDR
option to an
int
value of 1
, this option accepts a boolean
value of 0
or 1
. What does this do? This is
related to delayed packets and TCP TIME_WAIT
. Understanding
this requires a non-trivial amount of TCP knowledge, you can read the explanations.
The effect of SO_REUSEADDR
is important: if it’s not set
to 1
, a server program cannot bind to the same IP:port it
was using after a restart. This is generally undesirable TCP behavior.
You should enable SO_REUSEADDR
for all listening sockets!
Even if you don’t understand what exactly it is.
You can look up other socket options in man socket.7
,
man ip.7
, man tcp.7
, but don’t expect to
understand them all.
Step 3: Bind to an address
We’ll bind to the wildcard address 0.0.0.0:1234
. This is
just a parameter for listen()
.
struct sockaddr_in addr = {};
.sin_family = AF_INET;
addr.sin_port = ntohs(1234); // port
addr.sin_addr.s_addr = ntohl(0); // wildcard IP 0.0.0.0
addrint rv = bind(fd, (const sockaddr *)&addr, sizeof(addr));
if (rv) { die("bind()"); }
struct sockaddr_in
holds an IPv4:port pair stored as big
endian numbers, converted by ntohs()
and
ntohl()
. For example, 1.2.3.4
is represented
by ntohl(0x01020304)
.
struct sockaddr_in {
uint16_t sin_family; // AF_INET
uint16_t sin_port; // port in big endian
struct in_addr sin_addr; // IPv4
};
struct in_addr {
uint32_t s_addr; // IPv4 in big endian
};
For IPv6, use struct sockaddr_in6
instead. The
addr
argument accepts both address types, so the method
also needs the struct size because they are different.
struct sockaddr_in6 {
uint16_t sin6_family; // AF_INET6
uint16_t sin6_port; // port in big endian
uint32_t sin6_flowinfo; // ignore
struct in6_addr sin6_addr; // IPv6
uint32_t sin6_scope_id; // ignore
};
struct in6_addr {
uint8_t s6_addr[16]; // IPv6
};
struct sockaddr_in
and struct sockaddr_in6
have different sizes, so the struct size (addrlen
) is
needed.
int bind(int sockfd, const struct sockaddr *addr, socklen_t addrlen);
struct sockaddr
is not used anywhere, just type cast
struct sockaddr_in
or struct sockaddr_in6
to
this pointer type to match the function prototype.
Step 4: Listen
All the previous steps are just passing parameters. The socket is
actually created after listen()
. The OS will automatically
handle TCP handshakes and place established connections in a queue. The
application can then retrieve them via accept()
.
// listen
= listen(fd, SOMAXCONN);
rv if (rv) { die("listen()"); }
The 2nd argument is the size of the queue, which in our case is
SOMAXCONN
. SOMAXCONN
is defined as 128 on
Linux, which is sufficient for us.
Step 5: Accept connections
The server enters a loop that accepts and processes each client connection.
while (true) {
// accept
struct sockaddr_in client_addr = {};
socklen_t addrlen = sizeof(client_addr);
int connfd = accept(fd, (struct sockaddr *)&client_addr, &addrlen);
if (connfd < 0) {
continue; // error
}
(connfd);
do_something(connfd);
close}
The accept()
syscall also returns the peer’s address.
The addrlen
argument is both the input and output size.
Step 6: Read & write
Our dummy processing is just 1 read()
and 1
write()
.
static void do_something(int connfd) {
char rbuf[64] = {};
ssize_t n = read(connfd, rbuf, sizeof(rbuf) - 1);
if (n < 0) {
("read() error");
msgreturn;
}
("client says: %s\n", rbuf);
printf
char wbuf[] = "world";
(connfd, wbuf, strlen(wbuf));
write}
You can replace read
/write
with
send
/recv
. The difference is that
send
/recv
has an extra argument to pass some
optional flags that we don’t need.
ssize_t write(int fd, const void *buf, size_t len);
ssize_t send(int fd, const void *buf, size_t len, int flags);
For now, we have ignored the return value of write()
and
there is no error handling. We’ll write real programs in the next
chapter.
3.3 Create a TCP Client
Write something, read back from the server, then close the connection.
int fd = socket(AF_INET, SOCK_STREAM, 0);
if (fd < 0) {
("socket()");
die}
struct sockaddr_in addr = {};
.sin_family = AF_INET;
addr.sin_port = ntohs(1234);
addr.sin_addr.s_addr = ntohl(INADDR_LOOPBACK); // 127.0.0.1
addrint rv = connect(fd, (const struct sockaddr *)&addr, sizeof(addr));
if (rv) {
("connect");
die}
char msg[] = "hello";
(fd, msg, strlen(msg));
write
char rbuf[64] = {};
ssize_t n = read(fd, rbuf, sizeof(rbuf) - 1);
if (n < 0) {
("read");
die}
("server says: %s\n", rbuf);
printf(fd); close
INADDR_LOOPBACK
is defined as 0x7f000001
,
which is the address 127.0.0.1
.
Compile our programs with the following command line:
g++ -Wall -Wextra -O2 -g 03_server.cpp -o server
g++ -Wall -Wextra -O2 -g 03_client.cpp -o client
Run ./server
in one window and then run
./client
in another window:
$ ./server
client says: hello
$ ./client
server says: world
3.4 More on socket API
Some important but not immediately relevant things.
Understand `struct sockaddr`
Let’s look at these function prototypes:
int accept(int sockfd, struct sockaddr *addr, socklen_t len);
int connect(int sockfd, const struct sockaddr *addr, socklen_t len);
int bind(int sockfd, const struct sockaddr *addr, socklen_t len);
We never used struct sockaddr
, instead we forcibly cast
either struct sockaddr_in
or
struct sockaddr_in6
to this pointer type. Here is how these
structs are defined.
// pointless
struct sockaddr {
unsigned short sa_family; // AF_INET, AF_INET6
char sa_data[14]; // useless
};
// IPv4:port
struct sockaddr_in {
sa_family_t sin_family; // AF_INET
uint16_t sin_port; // port number, big endian
struct in_addr sin_addr; // IPv4 address
};
// IPv6:port
struct sockaddr_in6 {
sa_family_t sin6_family; // AF_INET6
uint16_t sin6_port; // port number, big endian
uint32_t sin6_flowinfo;
struct in6_addr sin6_addr; // IPv6 address
uint32_t sin6_scope_id;
};
// can store both sockaddr_in & sockaddr_in6
struct sockaddr_storage {
sa_family_t ss_family; // AF_INET, AF_INET6
char __some_padding[__BIG_ENOUGH_NUMBER];
};
The socket API is weird in that it defines many pointless types.
struct sockaddr
has no use at all;struct sockaddr *
is practically justvoid *
.struct sockaddr_storage
is supposed to hold both address types, which can be trivially replaced byunion { struct sockaddr_in v4; struct sockaddr_in6 v6 }
.struct sockaddr_in
&struct sockaddr_in6
are the only useful and concrete structs.sin_addr
&sin6_addr
are pointlessly nested structs with just a single field.*_family
is practically a 16-bit integer, yet it has its own type.
What the API wants to achieve can be expressed by a simple tagged
union
:
struct fictional_sane_sockaddr {
uint16_t family; // tag: AF_INET, AF_INET6
uint16_t port;
union {
struct { uint8_t ipv4[4]; };
struct { uint8_t ipv6[16]; /* ... */ };
};
};
// warning: not compatible with `struct sockaddr_*`
Syscalls, APIs, and libraries
On Linux, each socket function is a syscall wrapper in libc. The socket API is called BSD socket and is supported by all major platforms. On Windows, the API is mostly the same, with minor differences like different function names.
There are also socket libraries, but they are not as useful as you
might think; the main complexity is not the API, but the rest of the
things like protocols, event loops. So a library won’t do much. The
socket API is simple and contains only a few methods. The only scary
part is struct sockaddr_*
.
Specify the local address before `connect()`
bind()
can also be used on the client socket before
connect()
to specify the source address. Without this, the
OS will automatically select a source address. This is useful for
selecting a particular source address if multiple ones are available. If
the port in bind()
is zero, the OS will automatically pick
a port.
Get the address of each side
If you are using wildcard IP or letting the OS pick the port, you
don’t know the exact address. Use getsockname()
to retrieve
the local address of a TCP connection. Use getpeername()
to
retrieve the remote address (the same address returned from
accept()
).
int getsockname(int fd, struct sockaddr *addr, socklen_t *addrlen); // local
int getpeername(int fd, struct sockaddr *addr, socklen_t *addrlen); // remote
Domain name resolution
getaddrinfo()
resolves a domain name into IP addresses.
There is a sample program in its man page.
Unlike other socket APIs, this is not a Linux syscall and is
implemented in libc because name resolution is a complicated and
high-level function on Linux. It involves reading a bunch of files such
as /etc/resolv.conf
and /etc/hosts
before
querying a DNS server with UDP.
Socket and inter-process communication (IPC)
There are mechanisms that allow processes within the same machine to communicate such as Unix domain sockets, pipes, etc. They are just a computer network confined to a single machine, so the programming techniques are the same.
Unix domain sockets share the same API with network sockets. You can
create either packet-based or byte-stream-based Unix domain sockets,
like UDP or TCP. A Unix domain socket is created with different flags on
the socket()
method and uses
struct sockaddr_un
, but the rest is the same. Read
man unix.7
for more info.
A pipe is a one-way byte stream. So you need a protocol like a TCP socket, which is not as trivial as you might think. You’ll learn about protocols in the next chapter.
Variants of read & write
We used read
/write
syscalls for sockets.
They are the most generic IO interface also usable for disk files,
pipes, etc. I list some variants of read
/write
just for your information.
Reading | Writing | Description |
---|---|---|
read |
write |
Read/write with a single continuous buffer. |
readv |
writev |
Read/write with multiple buffers. |
recv |
send |
Has an extra flag. |
recvfrom |
sendto |
Also get/set the remote address (packet-based). |
recvmsg |
sendmsg |
readv /writev
with more flags and controls. |
recvmmsg |
sendmmsg |
Multiple
recvmsg /sendmmsg in 1 syscall. |
Source code: