Home Web Front-end JS Tutorial Introduction to the cluster module in Node.js

Introduction to the cluster module in Node.js

Jul 04, 2018 am 11:20 AM
node.js

This article mainly introduces the introduction of the cluster module in Node.js. It has a certain reference value. Now I share it with you. Friends in need can refer to it.

Node’s single-threaded design is no longer available In order to fully "squeeze" machine performance, Node has added a built-in module cluster, which can implement cluster functions by managing a bunch of child processes through a parent process. This article mainly introduces an in-depth analysis of Node.js cluster Module, interested friends can refer to

cluster module overview

node instances are single-threaded operations. In server-side programming, multiple node instances are usually created to handle client requests to improve system throughput. For such multiple node instances, we call it a cluster.

With the help of node's cluster module, developers can obtain the benefits of cluster services without modifying the original project code with almost no modifications.

The cluster has the following two common implementation solutions, and the cluster module that comes with node adopts the second solution.

Option 1: Multiple node instances and multiple ports

The node instances in the cluster each listen to different ports, and then the reverse proxy implements the request to multiple Distribution of ports.

  1. Advantages: Simple implementation, each instance is relatively independent, which is good for service stability.

  2. Disadvantages: increased port occupancy, communication between processes is more troublesome.

Option 2: The main process forwards the request to the child process

In the cluster, create a main process (master) and several child processes ( worker). The master monitors client connection requests and forwards them to workers according to specific policies.

  1. Advantages: Usually only one port is occupied, communication is relatively simple, and the forwarding strategy is more flexible.

  2. Disadvantages: The implementation is relatively complex and requires high stability of the main process.

Getting Started Example

In the cluster module, the main process is called master and the child process is called worker.

The example is as follows, create server instances with the same number of CPUs to handle client requests. Note that they are all listening on the same port.

// server.js
var cluster = require('cluster');
var cpuNums = require('os').cpus().length;
var http = require('http');

if(cluster.isMaster){
 for(var i = 0; i < cpuNums; i++){
  cluster.fork();
 }
}else{
 http.createServer(function(req, res){
  res.end(`response from worker ${process.pid}`);
 }).listen(3000);

 console.log(`Worker ${process.pid} started`);
}
Copy after login

Create batch script: ./req.sh.

#!/bin/bash

# req.sh
for((i=1;i<=4;i++)); do  
 curl http://127.0.0.1:3000
 echo ""
done
Copy after login

The output is as follows. As you can see, the responses come from different processes.

response from worker 23735
response from worker 23731
response from worker 23729
response from worker 23730

cluster Module implementation principle

To understand the cluster module, we mainly need to understand three questions:

  1. How do master and workers communicate?

  2. How to achieve port sharing for multiple server instances?

  3. Multiple server instances, how to distribute requests from clients to multiple workers?

The following will be introduced based on the schematic diagram. For source code level introduction, you can refer to the author's github.

Question 1: How to communicate between master and worker

This question is relatively simple. The master process creates worker processes through cluster.fork(). Cluster.fork() internally creates child processes through child_process.fork().

In other words:

  1. The master process and the worker process are the relationship between parent and child processes.

  2. The master process and the worker process can communicate through the IPC channel. (Important)

Question 2: How to implement port sharing

In the previous example, servers created in multiple wokers listened to the same port. port 3000. Generally speaking, if multiple processes listen to the same port, the system will report an error.

Why is our example okay?

The secret is that in the net module, the listen() method is specially processed. Depending on whether the current process is a master process or a worker process:

  1. master process: listen to requests normally on this port. (No special processing)

  2. #Worker process: Create a server instance. Then send a message to the master process through the IPC channel, so that the master process also creates a server instance and listens for requests on this port. When a request comes in, the master process forwards the request to the server instance of the worker process.

To sum up, it is: the master process listens to a specific port and forwards customer requests to the worker process.

As shown below:

Question 3: How to distribute requests to multiple workers

Every When the worker process creates a server instance to listen for requests, it will be registered on the master through the IPC channel. When a client request arrives, the master will be responsible for forwarding the request to the corresponding worker.

Which worker will it be forwarded to specifically? This is determined by the forwarding strategy. It can be set through the environment variable NODE_CLUSTER_SCHED_POLICY, or passed in when cluster.setupMaster(options).

默认的转发策略是轮询(SCHED_RR)。

当有客户请求到达,master会轮询一遍worker列表,找到第一个空闲的worker,然后将该请求转发给该worker。

master、worker内部通信小技巧

在开发过程中,我们会通过 process.on('message', fn) 来实现进程间通信。

前面提到,master进程、worker进程在server实例的创建过程中,也是通过IPC通道进行通信的。那会不会对我们的开发造成干扰呢?比如,收到一堆其实并不需要关心的消息?

答案肯定是不会?那么是怎么做到的呢?

当发送的消息包含cmd字段,且改字段以NODE_作为前缀,则该消息会被视为内部保留的消息,不会通过message事件抛出,但可以通过监听'internalMessage'捕获。

以worker进程通知master进程创建server实例为例子。worker伪代码如下:

// woker进程
const message = {
 cmd: &#39;NODE_CLUSTER&#39;,
 act: &#39;queryServer&#39;
};
process.send(message);
Copy after login

master伪代码如下:

worker.process.on(&#39;internalMessage&#39;, fn);
Copy after login

相关链接

官方文档:https://nodejs.org/api/cluster.html

Node学习笔记:https://github.com/chyingp/nodejs-learning-guide

以上就是本文的全部内容,希望对大家的学习有所帮助,更多相关内容请关注PHP中文网!

相关推荐:

nodejs中实现路由功能的方法

对于Nodejs的Http模块的解析

nodejs中模块定义的介绍

The above is the detailed content of Introduction to the cluster module in Node.js. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Detailed graphic explanation of the memory and GC of the Node V8 engine Detailed graphic explanation of the memory and GC of the Node V8 engine Mar 29, 2023 pm 06:02 PM

This article will give you an in-depth understanding of the memory and garbage collector (GC) of the NodeJS V8 engine. I hope it will be helpful to you!

An article about memory control in Node An article about memory control in Node Apr 26, 2023 pm 05:37 PM

The Node service built based on non-blocking and event-driven has the advantage of low memory consumption and is very suitable for handling massive network requests. Under the premise of massive requests, issues related to "memory control" need to be considered. 1. V8’s garbage collection mechanism and memory limitations Js is controlled by the garbage collection machine

Let's talk about how to choose the best Node.js Docker image? Let's talk about how to choose the best Node.js Docker image? Dec 13, 2022 pm 08:00 PM

Choosing a Docker image for Node may seem like a trivial matter, but the size and potential vulnerabilities of the image can have a significant impact on your CI/CD process and security. So how do we choose the best Node.js Docker image?

Let's talk in depth about the File module in Node Let's talk in depth about the File module in Node Apr 24, 2023 pm 05:49 PM

The file module is an encapsulation of underlying file operations, such as file reading/writing/opening/closing/delete adding, etc. The biggest feature of the file module is that all methods provide two versions of **synchronous** and **asynchronous**, with Methods with the sync suffix are all synchronization methods, and those without are all heterogeneous methods.

Node.js 19 is officially released, let's talk about its 6 major features! Node.js 19 is officially released, let's talk about its 6 major features! Nov 16, 2022 pm 08:34 PM

Node 19 has been officially released. This article will give you a detailed explanation of the 6 major features of Node.js 19. I hope it will be helpful to you!

Let's talk about the GC (garbage collection) mechanism in Node.js Let's talk about the GC (garbage collection) mechanism in Node.js Nov 29, 2022 pm 08:44 PM

How does Node.js do GC (garbage collection)? The following article will take you through it.

Let's talk about the event loop in Node Let's talk about the event loop in Node Apr 11, 2023 pm 07:08 PM

The event loop is a fundamental part of Node.js and enables asynchronous programming by ensuring that the main thread is not blocked. Understanding the event loop is crucial to building efficient applications. The following article will give you an in-depth understanding of the event loop in Node. I hope it will be helpful to you!

What should I do if node cannot use npm command? What should I do if node cannot use npm command? Feb 08, 2023 am 10:09 AM

The reason why node cannot use the npm command is because the environment variables are not configured correctly. The solution is: 1. Open "System Properties"; 2. Find "Environment Variables" -> "System Variables", and then edit the environment variables; 3. Find the location of nodejs folder; 4. Click "OK".

See all articles