Home > Backend Development > XML/RSS Tutorial > Detailed introduction on how to edit xml files

Detailed introduction on how to edit xml files

黄舟
Release: 2017-03-29 15:48:23
Original
2126 people have browsed it

XMLThe document can contain foreign characters such as Norwegian or French (Chinese is also acceptable! This part still cannot be translated according to the original text, some of the following content is written by myself)

For For your parser to understand these characters, you must adopt a consistent character encoding standard in the XML document.

-------------------------------------------------- ------------------------------------

Windows 95/98 Notepad
Windows 95/98 Notepad cannot save files in Unicode encoding format.

You can use Notepad to edit and save XML documents containing foreign characters (for example: Norwegian or French or Chinese)

<?xml version="1.0"?>
<note>
<from>小奀</from>
<to>小林</to>
<message>晚上一起去火锅呀</message>
</note>
Copy after login

But if you open this with a browser and edited it with Notepad XML document, an error will occur.

-------------------------------------------------- ------------------------------------

in Windows 95/98 Notepad Using encoding
Windows 95/98 Notepad to edit XML files must set the encodingattribute.

In order to avoid errors, you can add an encoding attribute to the XML document declaration to indicate the encoding type of this XML document, but do not use Unicode encoding.

The following encoding type will not cause errors, and Chinese characters will be displayed normally:

<?xml version="1.0" encoding="gb2312"?>
Copy after login

The following encoding types will not cause errors, and Chinese characters will be displayed normally:

<?xml version="1.0" encoding="gbk"?>
Copy after login

The following encoding type will not cause errors, and Chinese characters will not be displayed normally (garbled characters):

<?xml version="1.0" encoding="windows-1252"?>
Copy after login
Copy after login

The following encoding types will not cause errors, and Chinese characters will be displayed abnormally (garbled characters) :

<?xml version="1.0" encoding="ISO-8859-1"?>
Copy after login
Copy after login

The following encoding types will not cause errors, and Chinese characters are displayed normally:

<?xml version="1.0" encoding="UTF-8"?>
Copy after login
Copy after login

The following encoding types will cause errors:

<?xml version="1.0" encoding="UTF-16"?>
Copy after login
Copy after login

--------------------------------------------- ----------------------------------

Using Windows 2000 Notepad
Windows 2000 Notepad can save files in Unicode encoding format.

Notepad for Windows 2000 supports Unicodecharacter set. If you use Win2000 Notepad to save the XML document in Unicode encoding format (please note that there is no encoding information in the XML declaration):

<?xml version="1.0"?>
<note><from>小奀</from><to>小林</to><message>晚上一起去火锅呀</message></note>
Copy after login

---------------- -------------------------------------------------- --------------

Windows 2000 Notepad Encoding
Windows 2000 Notepad can also save files in "UTF-16" encoding format.

If you declare the encoding attribute in the XML document and save the file in Unicode encoding format, an error may occur.

The following code will cause an error:

<?xml version="1.0" encoding="windows-1252"?>
Copy after login
Copy after login

The following code will cause an error:

<?xml version="1.0" encoding="ISO-8859-1"?>
Copy after login
Copy after login

The following code will cause an error:

<?xml version="1.0" encoding="UTF-8"?>
Copy after login
Copy after login

The following file; note_encode_utf16_u.xml, will display normally in IE5.0+, but will cause an error in Netscape 6.2 browser.

<?xml version="1.0" encoding="UTF-16"?>
Copy after login
Copy after login

---------------------------------------- -------------------------------------

Error message
When using IE5.0 or higher to browse XML documents, you may encounter two different encoding errors:

An invalid character was found in the text content (An invalid character was found in text content).

If your XML document does not match the encoding format of your XML document, an error may occur. Usually, the XML document contains some "non-English" characters and uses single-byte encoding Editor, and the encoding format of the XML document is not set in the declaration of the XML document.

Conversion from the current encoding format to another encoding format is not supported (Switch from current encoding to specified encoding not supported ).

If the XML document is saved in Unicode/UTF-16 encoding format, but the declaration of the XML document sets the encoding format to be some single-byte encoding (such as Windows-1252, ISO-8859-1 or UTF -8); or the XML document is saved in a single-byte encoding format, but the declaration of the XML document sets the encoding format to be some Unicode/UTF-16 encoding form. Both situations will cause errors.

-------------------------------------------------- ------------------------------------

Conclusion
Conclusion: After saving Before setting the encoding format of the document in the declaration of the XML document, some of my suggestions to avoid errors:

Use an editor that supports Unicode encoding format.
Make sure you know which encoding format you are using.
Use attribute declarations to set the encoding format in XML documents.

The above is the detailed content of Detailed introduction on how to edit xml files. For more information, please follow other related articles on the PHP Chinese website!

Related labels:
source:php.cn
Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn
Popular Tutorials
More>
Latest Downloads
More>
Web Effects
Website Source Code
Website Materials
Front End Template