How Can I Decode HTML Entities in Java?-javaTutorial-php.cn

How Can I Decode HTML Entities in Java?

Linda Hamilton

Release： 2024-12-24 15:19:09

Original

952 people have browsed it

How Can I Decode HTML Entities in Java?

Decoding HTML Entities in Java

Unescaping HTML character entities in Java allows you to convert special characters, such as "" " and "">"," back to their original Unicode characters. In .NET, this is achieved using the HttpUtility.HtmlDecode method.

Java Equivalent

Java does not provide a direct equivalent to the .NET HttpUtility.HtmlDecode method. However, you can use third-party libraries like Apache Commons Lang to achieve this functionality.

Solution Using Apache Commons StringEscapeUtils

import org.apache.commons.lang3.StringEscapeUtils;

String html = "&quot;&amp;nbsp;&quot;&quot;&amp;gt;";
String unescapedHtml = StringEscapeUtils.unescapeHtml4(html);

System.out.println(unescapedHtml); // Outputs: " " >

Copy after login

The StringEscapeUtils.unescapeHtml4() method decodes HTML 4.0 entities, converting special characters back to their Unicode representations.

The above is the detailed content of How Can I Decode HTML Entities in Java?. For more information, please follow other related articles on the PHP Chinese website!