This article will introduce you to the node.js stream api from the shallower to the deeper. Please see below for specific details.
Basic introduction
In Node.js, there are two ways to read files, one is to use fs.readFile, and the other is to use fs.createReadStream to read.
fs.readFile is the most familiar to every Node.js user. It is simple to understand and easy to use. But its disadvantage is that all the data will be read into the memory first. Once a large file is encountered, the efficiency of reading in this way is very low.
And fs.createReadStream reads data through Stream. It divides the file (data) into small pieces and then triggers some specific events. We can listen to these events and write specific processing functions. This method is not easy to use compared to the above, but it is very efficient.
In fact, Stream is not only used for file processing in Node.js, it can also be seen in other places, such as process.stdin/stdout, http, tcp sockets, zlib, crypto, etc. Useful.
This article is a summary of my learning about the Stream API in Node.js. I hope it will be useful to everyone.
Features
Event-based communication
You can connect the stream through pipe
Type
Readable Stream Readable data stream
Writeable Stream writable data stream
Duplex Stream bidirectional data stream, can read and write at the same time
Transform Stream conversion data stream, can read and write, can convert (process) data at the same time
Events
Events of readable data streams
readable is triggered when data flows out
data For those data streams that are not explicitly paused, add data The event listening function will switch the data flow to flow dynamics and provide data to the outside as soon as possible.
end is triggered when the data is read. Note that it cannot be confused with writeableStream.end(). writeableStream does not have an end event, only the .end() method
close is triggered when the data source is closed
error is triggered when an error occurs while reading data
Events of writable data streams
drain writable.write(chunk) After returning false, the cache is completely written and the
finish call will be triggered when it can be rewritten. end method, it is triggered after all cached data is released, similar to the end event in the readable data stream, indicating the end of the writing process
Triggered when pipe is used as the pipe target
unpipe is used as the unpipe target Triggered when
error occurs. Triggered when an error occurs when writing data.
State
The readable data stream has two states: stream dynamic and paused state. Methods to change the data stream state. As follows:
Pause state-> Streaming dynamics
Add a listening function for the data event
Call the resume method
Call the pipe method
Note: If there is no listening function for the data event and no destination for the pipe method when converting to streaming dynamics, the data will be lost.
Stream dynamics-> Pause state
When there is no destination for the pipe method, call the pause method
When there is a destination for the pipe method, remove all data events Listening function, and call the unpipe method to remove the destinations of all pipe methods
Note: Only removing the listening function of the data event will not automatically cause the data flow to enter the "pause state". In addition, when there are destinations of the pipe method, calling the pause method does not guarantee that the data flow is always paused. Once those destinations issue data requests, the data flow may continue to provide data.
Usage
Read and write files
var fs = require('fs'); // 新建可读数据流 var rs = fs.createReadStream('./test1.txt'); // 新建可写数据流 var ws = fs.createWriteStream('./test2.txt'); // 监听可读数据流结束事件 rs.on('end', function() { console.log('read text1.txt successfully!'); }); // 监听可写数据流结束事件 ws.on('finish', function() { console.log('write text2.txt successfully!'); }); // 把可读数据流转换成流动态,流进可写数据流中 rs.pipe(ws); 读取 CSV 文件,并上传数据(我在生产环境中写过) var fs = require('fs'); var es = require('event-stream'); var csv = require('csv'); var parser = csv.parse(); var transformer = csv.transform(function(record) { return record.join(','); }); var data = fs.createReadStream('./demo.csv'); data .pipe(parser) .pipe(transformer) // 处理前一个 stream 传递过来的数据 .pipe(es.map(function(data, callback) { upload(data, function(err) { callback(err); }); })) // 相当于监听前一个 stream 的 end 事件 .pipe(es.wait(function(err, body) { process.stdout.write('done!'); }));
For more usage
You can refer to https://github.com/jeresig/node-stream-playground , after entering the sample website, click add stream directly to see the results.
Common Pitfalls
Using rs.pipe(ws) to write files does not append the content of rs to the back of ws, but directly overwrites the original content of ws with the content of rs.
The ended/closed stream cannot be reused and the data stream must be re-created
The pipe method returns the target data stream, such as a.pipe(b) returns b, so the listener Please pay attention to whether the object you are monitoring is correct during the event.
If you want to monitor multiple data streams and you use the pipe method to connect the data streams in series, you have to write:
data
.on('end', function() { console.log('data end'); }) .pipe(a) .on('end', function() { console.log('a end'); }) .pipe(b) .on('end', function() { console.log('b end'); });
Commonly used class libraries
event-stream feels like functional programming. I personally prefer
awesome-nodejs#streams because I don’t have any other stream libraries. I have used it, so if you need it, just look here.
The above content is the editor’s introduction to the use of Stream API in Node.js. I hope you like it.
For more articles on the use of Stream API in Node.js, please pay attention to the PHP Chinese website!