html - scrapy返回的response为str对象,如何转换成response提取目标值?
阿神
阿神 2017-04-17 17:44:42
0
2
706

通过抓取,最后返回的对象为以下内容,发现对象属性为string,现在应该如何提取?

{"r":0,
 "msg": ["<p class=\"item\"><p class=\"blk\">\n<a target=\"_blank\" href=\"\/topic\/19564906\">\n<img src=\"https:\/\/pic2.zhimg.com\/35acee23dc6b42ee9abead8f2d00c9a5_xs.jpg\" alt=\"Android \u6e38\u620f\">\n<strong>Android \u6e38\u620f<\/strong>\n<\/a>\n<p><\/p>\n\n<a id=\"t::-4897\" href=\"javascript:;\" class=\"follow meta-item zg-follow\"><i class=\"z-icon-follow\"><\/i>\u5173\u6ce8<\/a>\n\n<\/p><\/p>","<p class=\"item\"><p class=\"blk\">\n<a target=\"_blank\" href=\"\/topic\/19568806\">\n<img src=\"https:\/\/pic4.zhimg.com\/23c6509df_xs.jpg\" alt=\"Unity\uff08\u6e38\u620f\u5f15\u64ce\uff09\">\n<strong>Unity\uff08\u6e38\u620f\u5f15\u64ce\uff09<\/strong>\n<\/a>\n<p>Unity \u662f\u4e00\u79cd\u96c6\u6210\u7684\u521b\u4f5c\u5de5\u5177\uff0c\u9488\u5bf93D\u6e38\u620f\u548c\u5176\u4ed6\u4ea4\u4e92\u5185\u5bb9\uff08\u2026<\/p>\n\n<a id=\"t::-6199\" href=\"javascript:;\" class=\"follow meta-item zg-follow\"><i class=\"z-icon-follow\"><\/i>\u5173\u6ce8<\/a>\n\n<\/p><\/p>","<p class=\"item\"><p class=\"blk\">\n<a target=\"_blank\" href=\"\/topic\/19552038\">\n<img src=\"https:\/\/pic2.zhimg.com\/bec983801_xs.jpg\" alt=\"\u76db\u5927\u7f51\u7edc\">\n<strong>\u76db\u5927\u7f51\u7edc<\/strong>\n<\/a>\n<p>\u4e0a\u6d77\u76db\u5927\u7f51\u7edc\u53d1\u5c55\u6709\u9650\u516c\u53f8\uff08\u7b80\u79f0\u201c\u76db\u5927\u7f51\u7edc\u201d\uff09\u662f\u4e2d\u56fd\u7684\u7f51\u7edc\u6e38\u620f\u8fd0\u2026<\/p>\n\n<a id=\"t::-611\" href=\"javascript:;\" class=\"follow meta-item zg-follow\"><i class=\"z-icon-follow\"><\/i>\u5173\u6ce8<\/a>\n\n<\/p><\/p>","<p class=\"item\"><p class=\"blk\">\n<a target=\"_blank\" href=\"\/topic\/19583842\">\n<img src=\"https:\/\/pic3.zhimg.com\/097b126eebd99490cdab183c0389b30e_xs.jpg\" alt=\"\u72ec\u7acb\u6e38\u620f\">\n<strong>\u72ec\u7acb\u6e38\u620f<\/strong>\n<\/a>\n<p>\u72ec\u7acb\u6e38\u620f\u6307\u6e38\u620f\u5f00\u53d1\u8005\u6ca1\u6709\u6e38\u620f\u516c\u53f8\u6216\u6e38\u620f\u53d1\u884c\u5546\u63d0\u4f9b\u7684\u85aa\u8d44\uff0c\u5fc5\u987b\u72ec\u2026<\/p>\n\n<a id=\"t::-11190\" href=\"javascript:;\" class=\"follow meta-item zg-follow\"><i class=\"z-icon-follow\"><\/i>\u5173\u6ce8<\/a>\n\n<\/p><\/p>","<p class=\"item\"><p class=\"blk\">\n<a target=\"_blank\" href=\"\/topic\/19590019\">\n<img src=\"https:\/\/pic2.zhimg.com\/8ef8852e1_xs.jpg\" alt=\"\u6865\u724c\">\n<strong>\u6865\u724c<\/strong>\n<\/a>\n<p>\u7231\u597d\u8005\u904d\u53ca\u5168\u4e16\u754c\u7684\u4e00\u79cd\u6251\u514b\u6e38\u620f\uff0c\u4e16\u754c\u8303\u56f4\u3001\u6d32\u9645\u8303\u56f4\u90fd\u8bbe\u6709\u6865\u724c\u534f\u2026<\/p>\n\n<a id=\"t::-13245\" href=\"javascript:;\" class=\"follow meta-item zg-follow\"><i class=\"z-icon-follow\"><\/i>\u5173\u6ce8<\/a>\n\n<\/p><\/p>","<p class=\"item\"><p class=\"blk\">\n<a target=\"_blank\" href=\"\/topic\/19600554\">\n<img src=\"https:\/\/pic1.zhimg.com\/bb3b26f06cf1339294eb24761761f364_xs.jpg\" alt=\"\u6e38\u620f\u4ea7\u4e1a\">\n<strong>\u6e38\u620f\u4ea7\u4e1a<\/strong>\n<\/a>\n<p><\/p>\n\n<a id=\"t::-16821\" href=\"javascript:;\" class=\"follow meta-item zg-follow\"><i class=\"z-icon-follow\"><\/i>\u5173\u6ce8<\/a>\n\n<\/p><\/p>","<p class=\"item\"><p class=\"blk\">\n<a target=\"_blank\" href=\"\/topic\/19550880\">\n<img src=\"https:\/\/pic4.zhimg.com\/ae43a82db_xs.jpg\" alt=\"\u827a\u7535 (EA)\">\n<strong>\u827a\u7535 (EA)<\/strong>\n<\/a>\n<p>\u7f8e\u56fd\u827a\u7535\u662f\u5168\u7403\u8457\u540d\u7684\u4e92\u52a8\u5a31\u4e50\u8f6f\u4ef6\u5236\u4f5c\u4e0e\u53d1\u884c\u516c\u53f8\uff0c\u603b\u90e8\u4f4d\u4e8e\u7f8e\u56fd\u52a0\u2026<\/p>\n\n<a id=\"t::-218\" href=\"javascript:;\" class=\"follow meta-item zg-follow\"><i class=\"z-icon-follow\"><\/i>\u5173\u6ce8<\/a>\n\n<\/p><\/p>","<p class=\"item\"><p class=\"blk\">\n<a target=\"_blank\" href=\"\/topic\/19632659\">\n<img src=\"https:\/\/pic3.zhimg.com\/9ab3edd2b3164d91de0099b2282bbd4a_xs.jpg\" alt=\"\u523a\u5ba2\u4fe1\u6761\uff08Assassin's Creed\uff09\">\n<strong>\u523a\u5ba2\u4fe1\u6761\uff08Assassin's Creed\uff09<\/strong>\n<\/a>\n<p>\u2014\u2014\u613f\u6d1e\u5bdf\u4e4b\u7236\u6307\u5f15\u6211\u7b49\u3002 \u4e07\u7269\u7686\u865a\uff0c\u4e07\u4e8b\u7686\u5141\u3002 \u8fd9\u662f\u6211\u4eec\u7684\u7956\u5148\u2026<\/p>\n\n<a id=\"t::-27576\" href=\"javascript:;\" class=\"follow meta-item zg-follow\"><i class=\"z-icon-follow\"><\/i>\u5173\u6ce8<\/a>\n\n<\/p><\/p>","<p class=\"item\"><p class=\"blk\">\n<a target=\"_blank\" href=\"\/topic\/19660478\">\n<img src=\"https:\/\/pic1.zhimg.com\/922bbf820_xs.jpg\" alt=\"\u6587\u660e\uff08\u7cfb\u5217\u6e38\u620f\uff09\">\n<strong>\u6587\u660e\uff08\u7cfb\u5217\u6e38\u620f\uff09<\/strong>\n<\/a>\n<p>\u300a\u6587\u660e\u300b\u6700\u65e9\u7531\u72ec\u7acb\u5f00\u53d1\u8005\u5f00\u53d1\uff0c\u540e\u7ecfMicroprose\uff0c\u518d\u5230F\u2026<\/p>\n\n<a id=\"t::-36841\" href=\"javascript:;\" class=\"follow meta-item zg-follow\"><i class=\"z-icon-follow\"><\/i>\u5173\u6ce8<\/a>\n\n<\/p><\/p>","<p class=\"item\"><p class=\"blk\">\n<a target=\"_blank\" href=\"\/topic\/19553255\">\n<img src=\"https:\/\/pic3.zhimg.com\/4b784a796_xs.jpg\" alt=\"Xbox\">\n<strong>Xbox<\/strong>\n<\/a>\n<p>Xbox \u662f\u5fae\u8f6f\u6240\u5f00\u53d1\u3001\u9500\u552e\u7684\u5bb6\u7528\u6e38\u620f\u4e3b\u673a\u3002<\/p>\n\n<a id=\"t::-1015\" href=\"javascript:;\" class=\"follow meta-item zg-follow\"><i class=\"z-icon-follow\"><\/i>\u5173\u6ce8<\/a>\n\n<\/p><\/p>","<p class=\"item\"><p class=\"blk\">\n<a target=\"_blank\" href=\"\/topic\/19552479\">\n<img src=\"https:\/\/pic4.zhimg.com\/aea34d9b3_xs.jpg\" alt=\"\u661f\u9645\u4e89\u9738\">\n<strong>\u661f\u9645\u4e89\u9738<\/strong>\n<\/a>\n<p>\u300a\u661f\u9645\u4e89\u9738\u300b\u662f\u7531\u66b4\u96ea\u5a31\u4e50\u5236\u4f5c\u53d1\u884c\u7684\u4e00\u6b3e\u8457\u540d\u5373\u65f6\u6218\u7565\u6e38\u620f\u3002\u8fd9\u662f\u661f\u2026<\/p>\n\n<a id=\"t::-757\" href=\"javascript:;\" class=\"follow meta-item zg-follow\"><i class=\"z-icon-follow\"><\/i>\u5173\u6ce8<\/a>\n\n<\/p><\/p>","<p class=\"item\"><p class=\"blk\">\n<a target=\"_blank\" href=\"\/topic\/19625841\">\n<img src=\"https:\/\/pic4.zhimg.com\/8d6a36a3f_xs.jpg\" alt=\"Cocos2d-x\">\n<strong>Cocos2d-x<\/strong>\n<\/a>\n<p>Cocos2d-x\u662f\u4e00\u4e2a\u5f00\u6e90\u7684\u79fb\u52a82D\uff08\u76ee\u524d\u5df2\u7ecf\u67093D\u7248\u672c\uff09\u6e38\u2026<\/p>\n\n<a id=\"t::-25295\" href=\"javascript:;\" class=\"follow meta-item zg-follow\"><i class=\"z-icon-follow\"><\/i>\u5173\u6ce8<\/a>\n\n<\/p><\/p>","<p class=\"item\"><p class=\"blk\">\n<a target=\"_blank\" href=\"\/topic\/19839210\">\n<img src=\"https:\/\/pic1.zhimg.com\/47206e784_xs.jpg\" alt=\"\u7cbe\u7075\u5b9d\u53ef\u68a6\uff08Pok\u00e9mon\uff09\">\n<strong>\u7cbe\u7075\u5b9d\u53ef\u68a6\uff08Pok\u00e9mon\uff09<\/strong>\n<\/a>\n<p>\u7cbe\u7075\u5b9d\u53ef\u68a6\u7cfb\u5217\uff08Pok\u00e9mon\uff0c\u30dd\u30b1\u30c3\u30c8\u30e2\u30f3\u30b9\u30bf\u30fc\uff09\uff0c\u53c8\u79f0\u53e3\u888b\u2026<\/p>\n\n<a id=\"t::-98461\" href=\"javascript:;\" class=\"follow meta-item zg-follow\"><i class=\"z-icon-follow\"><\/i>\u5173\u6ce8<\/a>\n\n<\/p><\/p>","<p class=\"item\"><p class=\"blk\">\n<a target=\"_blank\" href=\"\/topic\/19785574\">\n<img src=\"https:\/\/pic3.zhimg.com\/ae975386a_xs.jpg\" alt=\"Ingress\uff08\u589e\u5f3a\u73b0\u5b9e\u6e38\u620f\uff09\">\n<strong>Ingress\uff08\u589e\u5f3a\u73b0\u5b9e\u6e38\u620f\uff09<\/strong>\n<\/a>\n<p>Ingress \u662f\u4e00\u6b3e\u4fb5\u5165\u5f0f\u865a\u62df\u73b0\u5b9e\u6e38\u620f\u3001\u5927\u578b\u591a\u4eba\u7535\u5b50\u6e38\u620f\uff0c\u4e2d\u2026<\/p>\n\n<a id=\"t::-78940\" href=\"javascript:;\" class=\"follow meta-item zg-follow\"><i class=\"z-icon-follow\"><\/i>\u5173\u6ce8<\/a>\n\n<\/p><\/p>","<p class=\"item\"><p class=\"blk\">\n<a target=\"_blank\" href=\"\/topic\/19670052\">\n<img src=\"https:\/\/pic2.zhimg.com\/24e0bcc51_xs.jpg\" alt=\"\u6881\u6b22\">\n<strong>\u6881\u6b22<\/strong>\n<\/a>\n<p>\u6881\u6b22 <\/p>\n\n<a id=\"t::-40034\" href=\"javascript:;\" class=\"follow meta-item zg-follow\"><i class=\"z-icon-follow\"><\/i>\u5173\u6ce8<\/a>\n\n<\/p><\/p>","<p class=\"item\"><p class=\"blk\">\n<a target=\"_blank\" href=\"\/topic\/19558999\">\n<img src=\"https:\/\/pic3.zhimg.com\/05912f7847b0654a695c8bc68d348ba2_xs.jpg\" alt=\"\u7b2c\u4e00\u4eba\u79f0\u89c6\u89d2\u5c04\u51fb\u6e38\u620f\uff08FPS\uff09\">\n<strong>\u7b2c\u4e00\u4eba\u79f0\u89c6\u89d2\u5c04\u51fb\u6e38\u620f\uff08FPS\uff09<\/strong>\n<\/a>\n<p>\u7b2c\u4e00\u4eba\u79f0\u5c04\u51fb\uff08First-person shooter\uff09\u6e38\u620f\u662f\u2026<\/p>\n\n<a id=\"t::-2937\" href=\"javascript:;\" class=\"follow meta-item zg-follow\"><i class=\"z-icon-follow\"><\/i>\u5173\u6ce8<\/a>\n\n<\/p><\/p>","<p class=\"item\"><p class=\"blk\">\n<a target=\"_blank\" href=\"\/topic\/19552043\">\n<img src=\"https:\/\/pic1.zhimg.com\/df7985174_xs.jpg\" alt=\"\u76db\u5927\u521b\u65b0\u9662\">\n<strong>\u76db\u5927\u521b\u65b0\u9662<\/strong>\n<\/a>\n<p><\/p>\n\n<a id=\"t::-612\" href=\"javascript:;\" class=\"follow meta-item zg-follow\"><i class=\"z-icon-follow\"><\/i>\u5173\u6ce8<\/a>\n\n<\/p><\/p>","<p class=\"item\"><p class=\"blk\">\n<a target=\"_blank\" href=\"\/topic\/19556258\">\n<img src=\"https:\/\/pic3.zhimg.com\/389f27418bb3b6e5baddb06b00d98f36_xs.jpg\" alt=\"\u6e38\u620f\u5f15\u64ce\">\n<strong>\u6e38\u620f\u5f15\u64ce<\/strong>\n<\/a>\n<p><\/p>\n\n<a id=\"t::-2000\" href=\"javascript:;\" class=\"follow meta-item zg-follow\"><i class=\"z-icon-follow\"><\/i>\u5173\u6ce8<\/a>\n\n<\/p><\/p>","<p class=\"item\"><p class=\"blk\">\n<a target=\"_blank\" href=\"\/topic\/19554169\">\n<img src=\"https:\/\/pic4.zhimg.com\/d74afc7906aa5209503de78978777997_xs.jpg\" alt=\"\u7535\u5b50\u6e38\u620f\">\n<strong>\u7535\u5b50\u6e38\u620f<\/strong>\n<\/a>\n<p><\/p>\n\n<a id=\"t::-1314\" href=\"javascript:;\" class=\"follow meta-item zg-follow\"><i class=\"z-icon-follow\"><\/i>\u5173\u6ce8<\/a>\n\n<\/p><\/p>","<p class=\"item\"><p class=\"blk\">\n<a target=\"_blank\" href=\"\/topic\/19564209\">\n<img src=\"https:\/\/pic3.zhimg.com\/d3f7f95975ae3ff5cfeedad9a4febe56_xs.jpg\" alt=\"\u6e38\u620f\u754c\u9762\u8bbe\u8ba1\">\n<strong>\u6e38\u620f\u754c\u9762\u8bbe\u8ba1<\/strong>\n<\/a>\n<p><\/p>\n\n<a id=\"t::-4657\" href=\"javascript:;\" class=\"follow meta-item zg-follow\"><i class=\"z-icon-follow\"><\/i>\u5173\u6ce8<\/a>\n\n<\/p><\/p>"]
}

<type 'str'>

如何转换为response对象,然后提取中间的目标值?

阿神
阿神

闭关修行中......

membalas semua(2)
黄舟

Ini ialah JSON, ubah ia menjadi:

import json

content = '...你那堆东西'
result = json.loads(content)

print result['msg']

Ini akan menukar rentetan format JSON yang anda perolehi kepada kamus result dan kemudian memprosesnya seperti biasa.

迷茫

Apa yang dikembalikan ialah data json, yang boleh dihuraikan terus menggunakan perpustakaan json python.
Saya rasa merangkak data json adalah yang paling menyeronokkan untuk diproses
Banyak data json yang saya rangkak baru-baru ini ialah data json yang dijana oleh permintaan seterusnya

.
Muat turun terkini
Lagi>
kesan web
Kod sumber laman web
Bahan laman web
Templat hujung hadapan