Rumah hujung hadapan web html tutorial 【推荐】oc解析HTML数据的类库(爬取网页数据)_html/css_WEB-ITnose

【推荐】oc解析HTML数据的类库(爬取网页数据)_html/css_WEB-ITnose

Jun 24, 2016 am 11:24 AM

  TFhpple是一个用于解析html数据的第三方库,本人感觉功能还算可以,只不过在使用前必须配置项目。

  

  配置

1.导入libxml2.tbd

 

2.设置编译路径

   使用

这里使用一个例子来说明

http://so.gushiwen.org/guwen/book_2.aspx

 

 1.创建TFHpple对象,data为网站返回的数据

TFHpple *htmlParser = [[TFHpple alloc] initWithHTMLData:data];

 

 2.使用searchWithXPathQuery方法得到有用数据,XPATH知识具体百度

NSArray *temp1 = [htmlParser searchWithXPathQuery:@"//div[@class='shileft']/div[@class='bookcont']"]

这样我们获取了论语的数据

 

3。获取并分析元素

TFHppleElement *element = [elements objectAtIndex:i];

 TFHppleElement对象包含许多属性,下面简单介绍一下各属性

1。

<strong>@property (nonatomic, copy, readonly) NSString *raw</strong>
Salin selepas log masuk

raw是包含html标记的网页数据

<div class="bookcont">&#13;        <ul>&#13;         &#13;              <span><a href="/guwen/bookv_19.aspx">学而篇</a></span>&#13;               &#13;              <span><a href="/guwen/bookv_20.aspx">为政篇</a></span>&#13;               &#13;              <span><a href="/guwen/bookv_21.aspx">八佾篇</a></span>&#13;               &#13;              <span><a href="/guwen/bookv_22.aspx">里仁篇</a></span>&#13;               &#13;              <span><a href="/guwen/bookv_23.aspx">公冶长篇</a></span>&#13;               &#13;              <span><a href="/guwen/bookv_24.aspx">雍也篇</a></span>&#13;               &#13;              <span><a href="/guwen/bookv_25.aspx">述而篇</a></span>&#13;               &#13;              <span><a href="/guwen/bookv_26.aspx">泰伯篇</a></span>&#13;               &#13;              <span><a href="/guwen/bookv_27.aspx">子罕篇</a></span>&#13;               &#13;              <span><a href="/guwen/bookv_28.aspx">乡党篇</a></span>&#13;               &#13;              <span><a href="/guwen/bookv_29.aspx">先进篇</a></span>&#13;               &#13;              <span><a href="/guwen/bookv_30.aspx">颜渊篇</a></span>&#13;               &#13;              <span><a href="/guwen/bookv_31.aspx">子路篇</a></span>&#13;               &#13;              <span><a href="/guwen/bookv_32.aspx">宪问篇</a></span>&#13;               &#13;              <span><a href="/guwen/bookv_33.aspx">卫灵公篇</a></span>&#13;               &#13;              <span><a href="/guwen/bookv_34.aspx">季氏篇</a></span>&#13;               &#13;              <span><a href="/guwen/bookv_35.aspx">阳货篇</a></span>&#13;               &#13;              <span><a href="/guwen/bookv_36.aspx">微子篇</a></span>&#13;               &#13;              <span><a href="/guwen/bookv_37.aspx">子张篇</a></span>&#13;               &#13;              <span><a href="/guwen/bookv_38.aspx">尧曰篇</a></span>&#13;              &#13;        </ul>&#13;        </div>
Salin selepas log masuk

2.content是网页的具体数据,不包含html标记

学而篇                             为政篇                             八佾篇                             里仁篇                             公冶长篇                             雍也篇                             述而篇                             泰伯篇                             子罕篇                             乡党篇                             先进篇                             颜渊篇                             子路篇                             宪问篇                             卫灵公篇                             季氏篇                             阳货篇                             微子篇                             子张篇                             尧曰篇
Salin selepas log masuk

3.tagName是html标签

输出只有div

4.attributes,属性。。。。。。。

<strong>class = bookcont;</strong>
Salin selepas log masuk

5.children子节点

(    "{\n    nodeContent = \"\\n        \";\n    nodeName = text;\n}",    "{\n    nodeChildArray =     (\n                {\n            nodeContent = \"\\n         \\n              \";\n            nodeName = text;\n        },\n                {\n            nodeChildArray =             (\n                                {\n                    nodeAttributeArray =                     (\n                                                {\n                            attributeName = href;\n                            nodeContent = \"/guwen/bookv_19.aspx\";\n                        }\n                    );\n                    nodeChildArray =                     (\n                                                {\n                            nodeContent = \"\\U5b66\\U800c\\U7bc7\";\n                            nodeName = text;\n                        }\n                    );\n                    nodeContent = \"\\U5b66\\U800c\\U7bc7\";\n                    nodeName = a;\n                    raw = \"<a href=\\\"/guwen/bookv_19.aspx\\\">\\U5b66\\U800c\\U7bc7</a>\";\n                }\n            );\n            nodeContent = \"\\U5b66\\U800c\\U7bc7\";\n            nodeName = span;\n            raw = \"<span><a href=\\\"/guwen/bookv_19.aspx\\\">\\U5b66\\U800c\\U7bc7</a></span>\";\n        },\n                {\n            nodeContent = \"\\n               \\n              \";\n            nodeName = text;\n        },\n                {\n            nodeChildArray =             (\n                                {\n                    nodeAttributeArray =                     (\n                                                {\n                            attributeName = href;\n                            nodeContent = \"/guwen/bookv_20.aspx\";\n                        }\n                    );\n                    nodeChildArray =                     (\n                                                {\n                            nodeContent = \"\\U4e3a\\U653f\\U7bc7\";\n                            nodeName = text;\n                        }\n                    );\n                    nodeContent = \"\\U4e3a\\U653f\\U7bc7\";\n                    nodeName = a;\n                    raw = \"<a href=\\\"/guwen/bookv_20.aspx\\\">\\U4e3a\\U653f\\U7bc7</a>\";\n                }\n            );\n            nodeContent = \"\\U4e3a\\U653f\\U7bc7\";\n            nodeName = span;\n            raw = \"<span><a href=\\\"/guwen/bookv_20.aspx\\\">\\U4e3a\\U653f\\U7bc7</a></span>\";\n        },\n                {\n            nodeContent = \"\\n               \\n              \";\n            nodeName = text;\n        },\n                {\n            nodeChildArray =             (\n                                {\n                    nodeAttributeArray =                     (\n                                                {\n                            attributeName = href;\n                            nodeContent = \"/guwen/bookv_21.aspx\";\n                        }\n                    );\n                    nodeChildArray =                     (\n                                                {\n                            nodeContent = \"\\U516b\\U4f7e\\U7bc7\";\n                            nodeName = text;\n                        }\n                    );\n                    nodeContent = \"\\U516b\\U4f7e\\U7bc7\";\n                    nodeName = a;\n                    raw = \"<a href=\\\"/guwen/bookv_21.aspx\\\">\\U516b\\U4f7e\\U7bc7</a>\";\n                }\n            );\n            nodeContent = \"\\U516b\\U4f7e\\U7bc7\";\n            nodeName = span;\n            raw = \"<span><a href=\\\"/guwen/bookv_21.aspx\\\">\\U516b\\U4f7e\\U7bc7</a></span>\";\n        },\n                {\n            nodeContent = \"\\n               \\n              \";\n            nodeName = text;\n        },\n                {\n            nodeChildArray =             (\n                                {\n                    nodeAttributeArray =                     (\n                                                {\n                            attributeName = href;\n                            nodeContent = \"/guwen/bookv_22.aspx\";\n                        }\n                    );\n                    nodeChildArray =                     (\n                                                {\n                            nodeContent = \"\\U91cc\\U4ec1\\U7bc7\";\n                            nodeName = text;\n                        }\n                    );\n                    nodeContent = \"\\U91cc\\U4ec1\\U7bc7\";\n                    nodeName = a;\n                    raw = \"<a href=\\\"/guwen/bookv_22.aspx\\\">\\U91cc\\U4ec1\\U7bc7</a>\";\n                }\n            );\n            nodeContent = \"\\U91cc\\U4ec1\\U7bc7\";\n            nodeName = span;\n            raw = \"<span><a href=\\\"/guwen/bookv_22.aspx\\\">\\U91cc\\U4ec1\\U7bc7</a></span>\";\n        },\n                {\n            nodeContent = \"\\n               \\n              \";\n            nodeName = text;\n        },\n                {\n            nodeChildArray =             (\n                                {\n                    nodeAttributeArray =                     (\n                                                {\n                            attributeName = href;\n                            nodeContent = \"/guwen/bookv_23.aspx\";\n                        }\n                    );\n                    nodeChildArray =                     (\n                                                {\n                            nodeContent = \"\\U516c\\U51b6\\U957f\\U7bc7\";\n                            nodeName = text;\n                        }\n                    );\n                    nodeContent = \"\\U516c\\U51b6\\U957f\\U7bc7\";\n                    nodeName = a;\n                    raw = \"<a href=\\\"/guwen/bookv_23.aspx\\\">\\U516c\\U51b6\\U957f\\U7bc7</a>\";\n                }\n            );\n            nodeContent = \"\\U516c\\U51b6\\U957f\\U7bc7\";\n            nodeName = span;\n            raw = \"<span><a href=\\\"/guwen/bookv_23.aspx\\\">\\U516c\\U51b6\\U957f\\U7bc7</a></span>\";\n        },\n                {\n            nodeContent = \"\\n               \\n              \";\n            nodeName = text;\n        },\n                {\n            nodeChildArray =             (\n                                {\n                    nodeAttributeArray =                     (\n                                                {\n                            attributeName = href;\n                            nodeContent = \"/guwen/bookv_24.aspx\";\n                        }\n                    );\n                    nodeChildArray =                     (\n                                                {\n                            nodeContent = \"\\U96cd\\U4e5f\\U7bc7\";\n                            nodeName = text;\n                        }\n                    );\n                    nodeContent = \"\\U96cd\\U4e5f\\U7bc7\";\n                    nodeName = a;\n                    raw = \"<a href=\\\"/guwen/bookv_24.aspx\\\">\\U96cd\\U4e5f\\U7bc7</a>\";\n                }\n            );\n            nodeContent = \"\\U96cd\\U4e5f\\U7bc7\";\n            nodeName = span;\n            raw = \"<span><a href=\\\"/guwen/bookv_24.aspx\\\">\\U96cd\\U4e5f\\U7bc7</a></span>\";\n        },\n                {\n            nodeContent = \"\\n               \\n              \";\n            nodeName = text;\n        },\n                {\n            nodeChildArray =             (\n                                {\n                    nodeAttributeArray =                     (\n                                                {\n                            attributeName = href;\n                            nodeContent = \"/guwen/bookv_25.aspx\";\n                        }\n                    );\n                    nodeChildArray =                     (\n                                                {\n                            nodeContent = \"\\U8ff0\\U800c\\U7bc7\";\n                            nodeName = text;\n                        }\n                    );\n                    nodeContent = \"\\U8ff0\\U800c\\U7bc7\";\n                    nodeName = a;\n                    raw = \"<a href=\\\"/guwen/bookv_25.aspx\\\">\\U8ff0\\U800c\\U7bc7</a>\";\n                }\n            );\n            nodeContent = \"\\U8ff0\\U800c\\U7bc7\";\n            nodeName = span;\n            raw = \"<span><a href=\\\"/guwen/bookv_25.aspx\\\">\\U8ff0\\U800c\\U7bc7</a></span>\";\n        },\n                {\n            nodeContent = \"\\n               \\n              \";\n            nodeName = text;\n        },\n                {\n            nodeChildArray =             (\n                                {\n                    nodeAttributeArray =                     (\n                                                {\n                            attributeName = href;\n                            nodeContent = \"/guwen/bookv_26.aspx\";\n                        }\n                    );\n                    nodeChildArray =                     (\n                                                {\n                            nodeContent = \"\\U6cf0\\U4f2f\\U7bc7\";\n                            nodeName = text;\n                        }\n                    );\n                    nodeContent = \"\\U6cf0\\U4f2f\\U7bc7\";\n                    nodeName = a;\n                    raw = \"<a href=\\\"/guwen/bookv_26.aspx\\\">\\U6cf0\\U4f2f\\U7bc7</a>\";\n                }\n            );\n            nodeContent = \"\\U6cf0\\U4f2f\\U7bc7\";\n            nodeName = span;\n            raw = \"<span><a href=\\\"/guwen/bookv_26.aspx\\\">\\U6cf0\\U4f2f\\U7bc7</a></span>\";\n        },\n                {\n            nodeContent = \"\\n               \\n              \";\n            nodeName = text;\n        },\n                {\n            nodeChildArray =             (\n                                {\n                    nodeAttributeArray =                     (\n                                                {\n                            attributeName = href;\n                            nodeContent = \"/guwen/bookv_27.aspx\";\n                        }\n                    );\n                    nodeChildArray =                     (\n                                                {\n                            nodeContent = \"\\U5b50\\U7f55\\U7bc7\";\n                            nodeName = text;\n                        }\n                    );\n                    nodeContent = \"\\U5b50\\U7f55\\U7bc7\";\n                    nodeName = a;\n                    raw = \"<a href=\\\"/guwen/bookv_27.aspx\\\">\\U5b50\\U7f55\\U7bc7</a>\";\n                }\n            );\n            nodeContent = \"\\U5b50\\U7f55\\U7bc7\";\n            nodeName = span;\n            raw = \"<span><a href=\\\"/guwen/bookv_27.aspx\\\">\\U5b50\\U7f55\\U7bc7</a></span>\";\n        },\n                {\n            nodeContent = \"\\n               \\n              \";\n            nodeName = text;\n        },\n                {\n            nodeChildArray =             (\n                                {\n                    nodeAttributeArray =                     (\n                                                {\n                            attributeName = href;\n                            nodeContent = \"/guwen/bookv_28.aspx\";\n                        }\n                    );\n                    nodeChildArray =                     (\n                                                {\n                            nodeContent = \"\\U4e61\\U515a\\U7bc7\";\n                            nodeName = text;\n                        }\n                    );\n                    nodeContent = \"\\U4e61\\U515a\\U7bc7\";\n                    nodeName = a;\n                    raw = \"<a href=\\\"/guwen/bookv_28.aspx\\\">\\U4e61\\U515a\\U7bc7</a>\";\n                }\n            );\n            nodeContent = \"\\U4e61\\U515a\\U7bc7\";\n            nodeName = span;\n            raw = \"<span><a href=\\\"/guwen/bookv_28.aspx\\\">\\U4e61\\U515a\\U7bc7</a></span>\";\n        },\n                {\n            nodeContent = \"\\n               \\n              \";\n            nodeName = text;\n        },\n                {\n            nodeChildArray =             (\n                                {\n                    nodeAttributeArray =                     (\n                                                {\n                            attributeName = href;\n                            nodeContent = \"/guwen/bookv_29.aspx\";\n                        }\n                    );\n                    nodeChildArray =                     (\n                                                {\n                            nodeContent = \"\\U5148\\U8fdb\\U7bc7\";\n                            nodeName = text;\n                        }\n                    );\n                    nodeContent = \"\\U5148\\U8fdb\\U7bc7\";\n                    nodeName = a;\n                    raw = \"<a href=\\\"/guwen/bookv_29.aspx\\\">\\U5148\\U8fdb\\U7bc7</a>\";\n                }\n            );\n            nodeContent = \"\\U5148\\U8fdb\\U7bc7\";\n            nodeName = span;\n            raw = \"<span><a href=\\\"/guwen/bookv_29.aspx\\\">\\U5148\\U8fdb\\U7bc7</a></span>\";\n        },\n                {\n            nodeContent = \"\\n               \\n              \";\n            nodeName = text;\n        },\n                {\n            nodeChildArray =             (\n                                {\n                    nodeAttributeArray =                     (\n                                                {\n                            attributeName = href;\n                            nodeContent = \"/guwen/bookv_30.aspx\";\n                        }\n                    );\n                    nodeChildArray =                     (\n                                                {\n                            nodeContent = \"\\U989c\\U6e0a\\U7bc7\";\n                            nodeName = text;\n                        }\n                    );\n                    nodeContent = \"\\U989c\\U6e0a\\U7bc7\";\n                    nodeName = a;\n                    raw = \"<a href=\\\"/guwen/bookv_30.aspx\\\">\\U989c\\U6e0a\\U7bc7</a>\";\n                }\n            );\n            nodeContent = \"\\U989c\\U6e0a\\U7bc7\";\n            nodeName = span;\n            raw = \"<span><a href=\\\"/guwen/bookv_30.aspx\\\">\\U989c\\U6e0a\\U7bc7</a></span>\";\n        },\n                {\n            nodeContent = \"\\n               \\n              \";\n            nodeName = text;\n        },\n                {\n            nodeChildArray =             (\n                                {\n                    nodeAttributeArray =                     (\n                                                {\n                            attributeName = href;\n                            nodeContent = \"/guwen/bookv_31.aspx\";\n                        }\n                    );\n                    nodeChildArray =                     (\n                                                {\n                            nodeContent = \"\\U5b50\\U8def\\U7bc7\";\n                            nodeName = text;\n                        }\n                    );\n                    nodeContent = \"\\U5b50\\U8def\\U7bc7\";\n                    nodeName = a;\n                    raw = \"<a href=\\\"/guwen/bookv_31.aspx\\\">\\U5b50\\U8def\\U7bc7</a>\";\n                }\n            );\n            nodeContent = \"\\U5b50\\U8def\\U7bc7\";\n            nodeName = span;\n            raw = \"<span><a href=\\\"/guwen/bookv_31.aspx\\\">\\U5b50\\U8def\\U7bc7</a></span>\";\n        },\n                {\n            nodeContent = \"\\n               \\n              \";\n            nodeName = text;\n        },\n                {\n            nodeChildArray =             (\n                                {\n                    nodeAttributeArray =                     (\n                                                {\n                            attributeName = href;\n                            nodeContent = \"/guwen/bookv_32.aspx\";\n                        }\n                    );\n                    nodeChildArray =                     (\n                                                {\n                            nodeContent = \"\\U5baa\\U95ee\\U7bc7\";\n                            nodeName = text;\n                        }\n                    );\n                    nodeContent = \"\\U5baa\\U95ee\\U7bc7\";\n                    nodeName = a;\n                    raw = \"<a href=\\\"/guwen/bookv_32.aspx\\\">\\U5baa\\U95ee\\U7bc7</a>\";\n                }\n            );\n            nodeContent = \"\\U5baa\\U95ee\\U7bc7\";\n            nodeName = span;\n            raw = \"<span><a href=\\\"/guwen/bookv_32.aspx\\\">\\U5baa\\U95ee\\U7bc7</a></span>\";\n        },\n                {\n            nodeContent = \"\\n               \\n              \";\n            nodeName = text;\n        },\n                {\n            nodeChildArray =             (\n                                {\n                    nodeAttributeArray =                     (\n                                                {\n                            attributeName = href;\n                            nodeContent = \"/guwen/bookv_33.aspx\";\n                        }\n                    );\n                    nodeChildArray =                     (\n                                                {\n                            nodeContent = \"\\U536b\\U7075\\U516c\\U7bc7\";\n                            nodeName = text;\n                        }\n                    );\n                    nodeContent = \"\\U536b\\U7075\\U516c\\U7bc7\";\n                    nodeName = a;\n                    raw = \"<a href=\\\"/guwen/bookv_33.aspx\\\">\\U536b\\U7075\\U516c\\U7bc7</a>\";\n                }\n            );\n            nodeContent = \"\\U536b\\U7075\\U516c\\U7bc7\";\n            nodeName = span;\n            raw = \"<span><a href=\\\"/guwen/bookv_33.aspx\\\">\\U536b\\U7075\\U516c\\U7bc7</a></span>\";\n        },\n                {\n            nodeContent = \"\\n               \\n              \";\n            nodeName = text;\n        },\n                {\n            nodeChildArray =             (\n                                {\n                    nodeAttributeArray =                     (\n                                                {\n                            attributeName = href;\n                            nodeContent = \"/guwen/bookv_34.aspx\";\n                        }\n                    );\n                    nodeChildArray =                     (\n                                                {\n                            nodeContent = \"\\U5b63\\U6c0f\\U7bc7\";\n                            nodeName = text;\n                        }\n                    );\n                    nodeContent = \"\\U5b63\\U6c0f\\U7bc7\";\n                    nodeName = a;\n                    raw = \"<a href=\\\"/guwen/bookv_34.aspx\\\">\\U5b63\\U6c0f\\U7bc7</a>\";\n                }\n            );\n            nodeContent = \"\\U5b63\\U6c0f\\U7bc7\";\n            nodeName = span;\n            raw = \"<span><a href=\\\"/guwen/bookv_34.aspx\\\">\\U5b63\\U6c0f\\U7bc7</a></span>\";\n        },\n                {\n            nodeContent = \"\\n               \\n              \";\n            nodeName = text;\n        },\n                {\n            nodeChildArray =             (\n                                {\n                    nodeAttributeArray =                     (\n                                                {\n                            attributeName = href;\n                            nodeContent = \"/guwen/bookv_35.aspx\";\n                        }\n                    );\n                    nodeChildArray =                     (\n                                                {\n                            nodeContent = \"\\U9633\\U8d27\\U7bc7\";\n                            nodeName = text;\n                        }\n                    );\n                    nodeContent = \"\\U9633\\U8d27\\U7bc7\";\n                    nodeName = a;\n                    raw = \"<a href=\\\"/guwen/bookv_35.aspx\\\">\\U9633\\U8d27\\U7bc7</a>\";\n                }\n            );\n            nodeContent = \"\\U9633\\U8d27\\U7bc7\";\n            nodeName = span;\n            raw = \"<span><a href=\\\"/guwen/bookv_35.aspx\\\">\\U9633\\U8d27\\U7bc7</a></span>\";\n        },\n                {\n            nodeContent = \"\\n               \\n              \";\n            nodeName = text;\n        },\n                {\n            nodeChildArray =             (\n                                {\n                    nodeAttributeArray =                     (\n                                                {\n                            attributeName = href;\n                            nodeContent = \"/guwen/bookv_36.aspx\";\n                        }\n                    );\n                    nodeChildArray =                     (\n                                                {\n                            nodeContent = \"\\U5fae\\U5b50\\U7bc7\";\n                            nodeName = text;\n                        }\n                    );\n                    nodeContent = \"\\U5fae\\U5b50\\U7bc7\";\n                    nodeName = a;\n                    raw = \"<a href=\\\"/guwen/bookv_36.aspx\\\">\\U5fae\\U5b50\\U7bc7</a>\";\n                }\n            );\n            nodeContent = \"\\U5fae\\U5b50\\U7bc7\";\n            nodeName = span;\n            raw = \"<span><a href=\\\"/guwen/bookv_36.aspx\\\">\\U5fae\\U5b50\\U7bc7</a></span>\";\n        },\n                {\n            nodeContent = \"\\n               \\n              \";\n            nodeName = text;\n        },\n                {\n            nodeChildArray =             (\n                                {\n                    nodeAttributeArray =                     (\n                                                {\n                            attributeName = href;\n                            nodeContent = \"/guwen/bookv_37.aspx\";\n                        }\n                    );\n                    nodeChildArray =                     (\n                                                {\n                            nodeContent = \"\\U5b50\\U5f20\\U7bc7\";\n                            nodeName = text;\n                        }\n                    );\n                    nodeContent = \"\\U5b50\\U5f20\\U7bc7\";\n                    nodeName = a;\n                    raw = \"<a href=\\\"/guwen/bookv_37.aspx\\\">\\U5b50\\U5f20\\U7bc7</a>\";\n                }\n            );\n            nodeContent = \"\\U5b50\\U5f20\\U7bc7\";\n            nodeName = span;\n            raw = \"<span><a href=\\\"/guwen/bookv_37.aspx\\\">\\U5b50\\U5f20\\U7bc7</a></span>\";\n        },\n                {\n            nodeContent = \"\\n               \\n              \";\n            nodeName = text;\n        },\n                {\n            nodeChildArray =             (\n                                {\n                    nodeAttributeArray =                     (\n                                                {\n                            attributeName = href;\n                            nodeContent = \"/guwen/bookv_38.aspx\";\n                        }\n                    );\n                    nodeChildArray =                     (\n                                                {\n                            nodeContent = \"\\U5c27\\U66f0\\U7bc7\";\n                            nodeName = text;\n                        }\n                    );\n                    nodeContent = \"\\U5c27\\U66f0\\U7bc7\";\n                    nodeName = a;\n                    raw = \"<a href=\\\"/guwen/bookv_38.aspx\\\">\\U5c27\\U66f0\\U7bc7</a>\";\n                }\n            );\n            nodeContent = \"\\U5c27\\U66f0\\U7bc7\";\n            nodeName = span;\n            raw = \"<span><a href=\\\"/guwen/bookv_38.aspx\\\">\\U5c27\\U66f0\\U7bc7</a></span>\";\n        },\n                {\n            nodeContent = \"\\n              \\n        \";\n            nodeName = text;\n        }\n    );\n    nodeContent = \"\\n         \\n              \\U5b66\\U800c\\U7bc7\\n               \\n              \\U4e3a\\U653f\\U7bc7\\n               \\n              \\U516b\\U4f7e\\U7bc7\\n               \\n              \\U91cc\\U4ec1\\U7bc7\\n               \\n              \\U516c\\U51b6\\U957f\\U7bc7\\n               \\n              \\U96cd\\U4e5f\\U7bc7\\n               \\n              \\U8ff0\\U800c\\U7bc7\\n               \\n              \\U6cf0\\U4f2f\\U7bc7\\n               \\n              \\U5b50\\U7f55\\U7bc7\\n               \\n              \\U4e61\\U515a\\U7bc7\\n               \\n              \\U5148\\U8fdb\\U7bc7\\n               \\n              \\U989c\\U6e0a\\U7bc7\\n               \\n              \\U5b50\\U8def\\U7bc7\\n               \\n              \\U5baa\\U95ee\\U7bc7\\n               \\n              \\U536b\\U7075\\U516c\\U7bc7\\n               \\n              \\U5b63\\U6c0f\\U7bc7\\n               \\n              \\U9633\\U8d27\\U7bc7\\n               \\n              \\U5fae\\U5b50\\U7bc7\\n               \\n              \\U5b50\\U5f20\\U7bc7\\n               \\n              \\U5c27\\U66f0\\U7bc7\\n              \\n        \";\n    nodeName = ul;\n    raw = \"<ul>&#13;\\n         &#13;\\n              <span><a href=\\\"/guwen/bookv_19.aspx\\\">\\U5b66\\U800c\\U7bc7</a></span>&#13;\\n               &#13;\\n              <span><a href=\\\"/guwen/bookv_20.aspx\\\">\\U4e3a\\U653f\\U7bc7</a></span>&#13;\\n               &#13;\\n              <span><a href=\\\"/guwen/bookv_21.aspx\\\">\\U516b\\U4f7e\\U7bc7</a></span>&#13;\\n               &#13;\\n              <span><a href=\\\"/guwen/bookv_22.aspx\\\">\\U91cc\\U4ec1\\U7bc7</a></span>&#13;\\n               &#13;\\n              <span><a href=\\\"/guwen/bookv_23.aspx\\\">\\U516c\\U51b6\\U957f\\U7bc7</a></span>&#13;\\n               &#13;\\n              <span><a href=\\\"/guwen/bookv_24.aspx\\\">\\U96cd\\U4e5f\\U7bc7</a></span>&#13;\\n               &#13;\\n              <span><a href=\\\"/guwen/bookv_25.aspx\\\">\\U8ff0\\U800c\\U7bc7</a></span>&#13;\\n               &#13;\\n              <span><a href=\\\"/guwen/bookv_26.aspx\\\">\\U6cf0\\U4f2f\\U7bc7</a></span>&#13;\\n               &#13;\\n              <span><a href=\\\"/guwen/bookv_27.aspx\\\">\\U5b50\\U7f55\\U7bc7</a></span>&#13;\\n               &#13;\\n              <span><a href=\\\"/guwen/bookv_28.aspx\\\">\\U4e61\\U515a\\U7bc7</a></span>&#13;\\n               &#13;\\n              <span><a href=\\\"/guwen/bookv_29.aspx\\\">\\U5148\\U8fdb\\U7bc7</a></span>&#13;\\n               &#13;\\n              <span><a href=\\\"/guwen/bookv_30.aspx\\\">\\U989c\\U6e0a\\U7bc7</a></span>&#13;\\n               &#13;\\n              <span><a href=\\\"/guwen/bookv_31.aspx\\\">\\U5b50\\U8def\\U7bc7</a></span>&#13;\\n               &#13;\\n              <span><a href=\\\"/guwen/bookv_32.aspx\\\">\\U5baa\\U95ee\\U7bc7</a></span>&#13;\\n               &#13;\\n              <span><a href=\\\"/guwen/bookv_33.aspx\\\">\\U536b\\U7075\\U516c\\U7bc7</a></span>&#13;\\n               &#13;\\n              <span><a href=\\\"/guwen/bookv_34.aspx\\\">\\U5b63\\U6c0f\\U7bc7</a></span>&#13;\\n               &#13;\\n              <span><a href=\\\"/guwen/bookv_35.aspx\\\">\\U9633\\U8d27\\U7bc7</a></span>&#13;\\n               &#13;\\n              <span><a href=\\\"/guwen/bookv_36.aspx\\\">\\U5fae\\U5b50\\U7bc7</a></span>&#13;\\n               &#13;\\n              <span><a href=\\\"/guwen/bookv_37.aspx\\\">\\U5b50\\U5f20\\U7bc7</a></span>&#13;\\n               &#13;\\n              <span><a href=\\\"/guwen/bookv_38.aspx\\\">\\U5c27\\U66f0\\U7bc7</a></span>&#13;\\n              &#13;\\n        </ul>\";\n}",    "{\n    nodeContent = \"\\n        \";\n    nodeName = text;\n}")
Salin selepas log masuk

6.firstChild

{    nodeContent = "\n        ";    nodeName = text;}
Salin selepas log masuk

 

上面属性都是涉及HTML语言的标记,我们一般使用的时content属性,然后处理得到的NSString对象

 

这样我们就得到并处理为我们想要的数据。TFHppleElement是一个很重要的类,具体使用在这里就不介绍了。

 

Kenyataan Laman Web ini
Kandungan artikel ini disumbangkan secara sukarela oleh netizen, dan hak cipta adalah milik pengarang asal. Laman web ini tidak memikul tanggungjawab undang-undang yang sepadan. Jika anda menemui sebarang kandungan yang disyaki plagiarisme atau pelanggaran, sila hubungi admin@php.cn

Alat AI Hot

Undresser.AI Undress

Undresser.AI Undress

Apl berkuasa AI untuk mencipta foto bogel yang realistik

AI Clothes Remover

AI Clothes Remover

Alat AI dalam talian untuk mengeluarkan pakaian daripada foto.

Undress AI Tool

Undress AI Tool

Gambar buka pakaian secara percuma

Clothoff.io

Clothoff.io

Penyingkiran pakaian AI

Video Face Swap

Video Face Swap

Tukar muka dalam mana-mana video dengan mudah menggunakan alat tukar muka AI percuma kami!

Alat panas

Notepad++7.3.1

Notepad++7.3.1

Editor kod yang mudah digunakan dan percuma

SublimeText3 versi Cina

SublimeText3 versi Cina

Versi Cina, sangat mudah digunakan

Hantar Studio 13.0.1

Hantar Studio 13.0.1

Persekitaran pembangunan bersepadu PHP yang berkuasa

Dreamweaver CS6

Dreamweaver CS6

Alat pembangunan web visual

SublimeText3 versi Mac

SublimeText3 versi Mac

Perisian penyuntingan kod peringkat Tuhan (SublimeText3)

Adakah HTML mudah belajar untuk pemula? Adakah HTML mudah belajar untuk pemula? Apr 07, 2025 am 12:11 AM

HTML sesuai untuk pemula kerana mudah dan mudah dipelajari dan dapat melihat hasilnya dengan cepat. 1) Keluk pembelajaran HTML adalah lancar dan mudah dimulakan. 2) Hanya menguasai tag asas untuk mula membuat laman web. 3) Fleksibiliti yang tinggi dan boleh digunakan dalam kombinasi dengan CSS dan JavaScript. 4) Sumber pembelajaran yang kaya dan alat moden menyokong proses pembelajaran.

Peranan HTML, CSS, dan JavaScript: Tanggungjawab Teras Peranan HTML, CSS, dan JavaScript: Tanggungjawab Teras Apr 08, 2025 pm 07:05 PM

HTML mentakrifkan struktur web, CSS bertanggungjawab untuk gaya dan susun atur, dan JavaScript memberikan interaksi dinamik. Ketiga melaksanakan tugas mereka dalam pembangunan web dan bersama -sama membina laman web yang berwarna -warni.

Apakah contoh tag permulaan dalam html? Apakah contoh tag permulaan dalam html? Apr 06, 2025 am 12:04 AM

Anexampleofastartartingtaginhtmlis, yangbeginsaparagraph.startingtagsareessentialinhtmlasttheyinitiateelements, definetheirtypes, andarecrucialforstructuringwebpagesandconstructionthedom.

Memahami HTML, CSS, dan JavaScript: Panduan Pemula Memahami HTML, CSS, dan JavaScript: Panduan Pemula Apr 12, 2025 am 12:02 AM

WebDevelopmentReliesOnhtml, CSS, andjavascript: 1) HtmlStructuresContent, 2) CSSStylesit, dan3) JavaScriptaddsInteractivity, Formingthebasisofmodernwebexperiences.

GITEE PAGES PENYEDIAAN LAMAN WEB STATIC Gagal: Bagaimana menyelesaikan masalah dan menyelesaikan kesilapan fail tunggal 404? GITEE PAGES PENYEDIAAN LAMAN WEB STATIC Gagal: Bagaimana menyelesaikan masalah dan menyelesaikan kesilapan fail tunggal 404? Apr 04, 2025 pm 11:54 PM

Giteepages Statik Laman Web Penggunaan Gagal: 404 Penyelesaian Masalah dan Resolusi Ralat Semasa Menggunakan Gitee ...

Bagaimana untuk melaksanakan susun atur penyesuaian kedudukan paksi y dalam anotasi web? Bagaimana untuk melaksanakan susun atur penyesuaian kedudukan paksi y dalam anotasi web? Apr 04, 2025 pm 11:30 PM

Algoritma Adaptif Kedudukan Y-Axis untuk Fungsi Anotasi Web Artikel ini akan meneroka cara melaksanakan fungsi anotasi yang serupa dengan dokumen perkataan, terutama bagaimana menangani selang antara anotasi ...

Bagaimana menggunakan CSS3 dan JavaScript untuk mencapai kesan penyebaran dan membesarkan gambar -gambar sekitarnya selepas mengklik? Bagaimana menggunakan CSS3 dan JavaScript untuk mencapai kesan penyebaran dan membesarkan gambar -gambar sekitarnya selepas mengklik? Apr 05, 2025 am 06:15 AM

Untuk mencapai kesan penyebaran dan membesarkan imej sekitarnya selepas mengklik pada imej, banyak reka bentuk web perlu mencapai kesan interaktif: klik pada imej tertentu untuk membuat sekitar ...

HTML, CSS, dan JavaScript: Alat penting untuk pemaju web HTML, CSS, dan JavaScript: Alat penting untuk pemaju web Apr 09, 2025 am 12:12 AM

HTML, CSS dan JavaScript adalah tiga tiang pembangunan web. 1. HTML mentakrifkan struktur laman web dan menggunakan tag seperti, dan sebagainya. 2. CSS mengawal gaya laman web, menggunakan pemilih dan atribut seperti warna, saiz font, dan lain-lain.

See all articles