json - python中用正则表达式去掉字符串中的冒号

Question

初学python，最近尝试爬数据，json字符串的value中有冒号，需要去掉。我的代码如下。 a和b都是value中会有冒号的字符串 {代码...} 代码执行结果是只剩 Customer Experience + Innovation (CX+I) Intern Brands'，...

大家讲道理 · Answer

import re
result = re.sub('^(Title|cmp|cmpesc:)(.+):(.*)',
                '\1\2\3',
                "Title:'Intern: Customer Experience + Innovation (CX+I) Intern Brands'")

print(result) # Title:'Intern Customer Experience + Innovation (CX+I) Intern Brands'

PHPz · Answer

Dalam kes ini:

''.join(re.split('(?


Itu bagus

巴扎黑 · Answer

Memang betul, saya salah baca soalan....

高洛峰 · Answer

Tak perlu buang titik bertindih, cuma jadikan kamus~

>>> a = "Title:'Intern: Customer Experience + Innovation (CX+I) Intern Brands'";\
b = "cmp:'Adecco: USA',cmpesc:'Adecco: USA'"
>>> dict([s.split(':',1) for s in a.split(',')])
{'Title': "'Intern: Customer Experience + Innovation (CX+I) Intern Brands'"}
>>> dict([s.split(':',1) for s in b.split(',')])
{'cmpesc': "'Adecco: USA'", 'cmp': "'Adecco: USA'"}
>>>

Tulis sebagai fungsi

a = "Title:'Intern: Customer Experience + Innovation (CX+I) Intern Brands'"
b = "cmp:'Adecco: USA',cmpesc:'Adecco: USA'"

def fn(x):
    return dict((s.split(':',1) for s in x.replace("'","").split(',')))

print(fn(a))
print(fn(b))

# {'Title': 'Intern: Customer Experience + Innovation (CX+I) Intern Brands'}
# {'cmp': 'Adecco: USA', 'cmpesc': 'Adecco: USA'}