File opening methods
When we use the open() function to open a file, there are several opening modes.
'r'->Read-only
'w'->Write-only. If the file already exists, it will be cleared. If it does not exist, it will be created.
'a'-> Append, write to the end of the file
'b'-> Binary mode, such as opening images, audio, and word files.
'+'->Update (readable and writable)
This one with the '+' sign is a bit difficult to understand, let's go through the code to get a feel for it.
with open('foo.txt', 'w+') as f: f.write('bar\n') f.seek(0) data = f.read()
As you can see, the above code can not only be written, but also read. Note that you must first locate the beginning, f.seek(0), otherwise you will read empty data.
Some people may be confused. Since the '+' sign is readable and writable, what is the difference between 'w+' and 'r+'.
That is,
'w+' will clear and create (the file will be cleared if it already exists, and created if it does not exist.)
'r+' will not clear or create
Do not open text files in binary mode
First look at the "weird" phenomenon of the code below.
Assume that under Windows, I have a f.txt file with the following content.
hello world
Code one,
with open('f.txt', 'r') as f: print f.readlines() with open('f.txt', 'rb') as f: print f.readlines()
Output
['hello\n', 'world\n'] ['hello\r\n', 'world\r\n']
Code 2,
with open('f.txt', 'rb') as f: data = f.read() with open('f.txt', 'w') as f: f.write(data)
Open the file and it becomes like this,
hello^M world^M
First of all, understand the concepts of line feed character '\n' and carriage return character '\r'.
'\n', newline character (LF, Line-Feed), refers to a new line.
'\r', carriage return (CR, Carriage-Return), refers to returning to the beginning of the line.
Because the newline identifiers under different systems are different.
windows->'\r\n' unix->'\n' mac->'\r'
This is why txt under windows has '^M' at the end of the line when it is opened in linux.
This is why I ran a script under Linux to export the game data and opened it in local windows and it became one line.
In fact, text files are also binary files, which are text-encoded binary files. The text files process some invisible characters to increase readability.
In python, you can get the newline identifier of the current system through os.linesep. For example, under Windows, os.linesep is '\r\n'.
When operating the newline flag in python, it doesn't matter what platform it is on, just use '\n'. Python will automatically convert it to different flags according to different systems.
With the above theoretical basis, we can analyze the "weird" phenomenon of the code at the beginning of this article.
In code 1, for files opened in text mode, the newline flag will be processed by python into '\n', while when opened in binary mode, it will be left intact.
In code two, open in binary mode and write in text mode. When opened in binary, it is still '\r\n', and when writing in text mode, python will convert '\n' into '\r\n', so it is actually equivalent to writing '\r\r \n', so there is an extra '^M'.
For more articles related to the usage of the open() function in Python to specify the file opening method, please pay attention to the PHP Chinese website!