Batch change the encoding method in meta information of HTML files
Release: 2016-07-25 09:08:18
Original
1128 people have browsed it
Sometimes the encoding method of the html file is different from the encoding method specified in the meta information. You can use this code to fix it. This program relies on jsoup and commons-io packages
- import java.io.File;
- import java.io.FileWriter;
- import java.io.IOException;
- import java.io.Writer;
- import java.util.Iterator;
-
- import org.apache. commons.io.FileUtils;
- import org.jsoup.Jsoup;
- import org.jsoup.nodes.Document;
- import org.jsoup.nodes.Element;
- import org.jsoup.select.Elements;
-
- public class main {
-
- /**
- * @param args
- * @throws IOException
- */
- public static void main(String[] args) throws IOException {
- // TODO Auto-generated method stub
-
- File input = new File("C:\Users\jack\Desktop \New Folder\jdk-zh");
- Iterator it = FileUtils.iterateFiles(input, null, true);
- while (it.hasNext()) {
- File file = it.next();
- Document doc = Jsoup.parse(file, "gb2312");
- Elements content = doc.getElementsByAttributeValueStarting("content", "text/html;");
- for (Element meta : content) {
- meta.attr("content ", "text/html; charset=utf-8");
- System.out
- .println("Modify content--------" + file.getName() + "---");
- }
- FileUtils.writeStringToFile(file, doc.html(),"utf-8");
- }
- }
- }
-
Copy code
|
Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn
Latest Articles by Author
-
2024-10-22 09:46:29
-
2024-10-13 13:53:41
-
2024-10-12 12:15:51
-
2024-10-11 22:47:31
-
2024-10-11 19:36:51
-
2024-10-11 15:50:41
-
2024-10-11 15:07:41
-
2024-10-11 14:21:21
-
2024-10-11 12:59:11
-
2024-10-11 12:17:31