Using Node.js with Nginx to implement high load network

When it comes to building high-throughput web applications, NginX and Node.js are a natural match. They are all designed based on the event-driven model and can easily break through the C10K bottleneck of traditional web servers such as Apache. The default configuration can already achieve high concurrency, but if you want to achieve more than thousands of requests per second on cheap hardware, there is still some work to be done.

This article assumes that readers use NginX's HttpProxyModule to act as a reverse proxy for the upstream node.js server. We will introduce the tuning of sysctl in Ubuntu 10.04 and above systems, as well as the tuning of node.js applications and NginX. Of course, if you are using a Debian system, you can also achieve the same goal, but the tuning methods are different.

Network Tuning

If you don’t first understand the underlying transmission mechanisms of Nginx and Node.js and carry out targeted optimization, no matter how detailed the optimization of the two is, it may be in vain. Generally, Nginx connects the client and upstream applications through TCP sockets.

Our system has many thresholds and restrictions for TCP, which are set through kernel parameters. The default values of these parameters are often set for general purposes and cannot meet the high traffic and short life requirements of web servers.

Here are some parameters that are candidates for tuning TCP. To make them effective, you can put them in the /etc/sysctl.conf file, or put them in a new configuration file, such as /etc/sysctl.d/99-tuning.conf, and then run sysctl -p to let the kernel load they. We use sysctl-cookbook to do this physical work.

It should be noted that the values listed here are safe to use, but it is still recommended that you study the meaning of each parameter in order to choose a more appropriate value based on your load, hardware and usage.

Copy code The code is as follows:

net.ipv4.ip_local_port_range='1024 65000'

net.ipv4.tcp_tw_reuse='1'

net.ipv4.tcp_fin_timeout='15'

net.core.netdev_max_backlog='4096'

net.core.rmem_max='16777216'

net.core.somaxconn='4096'

net.core.wmem_max='16777216'

net.ipv4.tcp_max_syn_backlog='20480'

net.ipv4.tcp_max_tw_buckets='400000'

net.ipv4.tcp_no_metrics_save='1'

net.ipv4.tcp_rmem='4096 87380 16777216'

net.ipv4.tcp_syn_retries='2'

net.ipv4.tcp_synack_retries='2'

net.ipv4.tcp_wmem='4096 65536 16777216'

vm.min_free_kbytes='65536'

Highlight a few important ones.

Copy code The code is as follows:

net.ipv4.ip_local_port_range

In order to serve the downstream client for the upstream application, NginX must open two TCP connections, one to the client and one to the application. When a server receives many connections, the system's available ports will quickly be exhausted. By modifying the net.ipv4.ip_local_port_range parameter, you can increase the range of available ports. If such an error is found in /var/log/syslog: "possible SYN flooding on port 80. Sending cookies", it means that the system cannot find an available port. Increasing the net.ipv4.ip_local_port_range parameter can reduce this error.

Copy code The code is as follows:

net.ipv4.tcp_tw_reuse

When the server needs to switch between a large number of TCP connections, a large number of connections in the TIME_WAIT state will be generated. TIME_WAIT means that the connection itself is closed, but the resources have not been released. Setting net_ipv4_tcp_tw_reuse to 1 lets the kernel try to recycle connections when it is safe, which is much cheaper than re-establishing new connections.

Copy code The code is as follows:

net.ipv4.tcp_fin_timeout

This is the minimum time a connection in TIME_WAIT state must wait before recycling. Making it smaller can speed up recycling.
How to check connection status

Use netstat:

Copy code The code is as follows:

netstat -tan | awk '{print $6}' | sort | uniq -c

or use ss:

Copy code The code is as follows:

ss -s

NginX

As the load on the web server gradually increases, we will begin to encounter some strange limitations of NginX. The connection is dropped and the kernel keeps reporting SYN flood. At this time, the load average and CPU usage are very small, and the server can obviously handle more connections, which is really frustrating.

After investigation, it was found that there are many connections in TIME_WAIT state. This is the output from one of the servers:

Copy code The code is as follows:

ss -s
Total: 388 (kernel 541)
TCP: 47461 (estab 311, closed 47135, orphaned 4, synrecv 0, timewait 47135/0), ports 33938

Transport Total IP IPv6
* 541 - - -
RAW 0 0 0 0
UDP 13 10 3
TCP 326 325 1
INET 339 335 4
FRAG 0 0 0 0

There are 47135 TIME_WAIT connections! Moreover, it can be seen from ss that they are all closed connections. This indicates that the server has consumed most of the available ports, and also implies that the server is allocating new ports for each connection. Tuning the network helped a little with the problem, but there were still not enough ports.

After further research, I found a document about the uplink keepalive command, which reads:

Set the maximum number of idle keep-alive connections to the upstream server. These connections will be retained in the cache of the worker process.

Interesting. In theory, this setup minimizes wasted connections by passing requests over cached connections. The documentation also mentions that we should set proxy_http_version to "1.1" and clear the "Connection" header. After further research, I found that this is a good idea, because HTTP/1.1 greatly optimizes the usage of TCP connections compared to HTTP1.0, and Nginx uses HTTP/1.0 by default.

After modifying as suggested in the document, our uplink configuration file becomes like this:

Copy code The code is as follows:

upstream backend_nodejs {

server nodejs-3:5016 max_fails=0 fail_timeout=10s;

server nodejs-4:5016 max_fails=0 fail_timeout=10s;

server nodejs-5:5016 max_fails=0 fail_timeout=10s;

server nodejs-6:5016 max_fails=0 fail_timeout=10s;

keepalive 512;

}

I also modified the proxy settings in the server section as suggested. At the same time, a proxy_next_upstream was added to skip failed servers, the client's keepalive_timeout was adjusted, and the access log was turned off. The configuration becomes like this:

Copy code The code is as follows:

server {
listen 80;
server_name fast.gosquared.com;

client_max_body_size 16M;
keepalive_timeout 10;

location / {
proxy_next_upstream error timeout http_500 http_502 http_503 http_504;
proxy_set_header Connection "";
proxy_http_version 1.1;
proxy_pass http://backend_nodejs;
}

access_log off;
error_log /dev/null crit;
}

After adopting the new configuration, I found that the sockets occupied by the servers were reduced by 90%. Requests can now be transmitted using far fewer connections. The new output is as follows:

Copy code The code is as follows:

ss -s

Total: 558 (kernel 604)
TCP: 4675 (estab 485, closed 4183, orphaned 0, synrecv 0, timewait 4183/0), ports 2768

Transport Total IP IPv6
* 604 - - -
RAW 0 0 0 0
UDP 13 10 3
TCP 492 491 1
INET 505 501 4

Node.js

Thanks to the event-driven design that handles I/O asynchronously, Node.js can handle a large number of connections and requests out of the box. Although there are other tuning methods, this article will focus mainly on the process aspect of node.js.

Node is single-threaded and does not automatically use multiple cores. In other words, the application cannot automatically obtain the full capabilities of the server.

Achieve clustering of Node processes

We can modify the application so that it forks multiple threads and receives data on the same port, thereby enabling the load to span multiple cores. Node has a cluster module that provides all the tools necessary to achieve this goal, but adding them to the application requires a lot of manual labor. If you are using express, eBay has a module called cluster2 that can be used.

Prevent context switching

When running multiple processes, you should ensure that each CPU core is only busy with one process at a time. Generally speaking, if the CPU has N cores, we should generate N-1 application processes. This ensures that each process gets a reasonable time slice, leaving one core free for the kernel scheduler to run other tasks. We also need to ensure that basically no other tasks other than Node.js are executed on the server to prevent CPU contention.

We once made a mistake and deployed two node.js applications on the server, and then each application opened N-1 processes. As a result, they compete with each other for the CPU, causing the system load to rise sharply. Although our servers are all 8-core machines, the performance overhead caused by context switching can still be clearly felt. Context switching refers to the phenomenon where the CPU suspends the current task in order to perform other tasks. When switching, the kernel must suspend all state of the current process and then load and execute another process. To solve this problem, we reduced the number of processes started by each application so that they can share the CPU fairly. As a result, the system load was reduced:

2015628112206774.png (802×404)

Please pay attention to the picture above to see how the system load (blue line) drops below the number of CPU cores (red line). On other servers, we saw the same thing. Since the total workload remains the same, the performance improvement in the graph above can only be attributed to the reduction in context switches.

Using Node.js with Nginx to implement high load network_node.js