python - コロン付きの文字列で辞書を作成する

Question

次のような文字列 s があるとします。

s = 'Title: A title Date: November 23 1234 Other: Other information'

次のような辞書を作成することは可能ですか:

{'Title':'A title','Date':'November 23 1234','Other':'Other information'}

最初は単純にコロンのある場所で分割するだけだと思っていましたが、タイトルの値が何であるかわからないため、タイトル自体にコロンが含まれている可能性があります。残念ながら、この情報のソースもコンマで区切られていないので、それも面倒です. EG、どうすればそれを行うことができますか:

s = 'Title: Example: of a title Date: November 23 1234 Other: Other information'

その例のタイトルはですExample: of a title。

この質問を確認しましたが、私の質問に対する回答ではありませんでした。

前もって感謝します。

score 3 · Accepted Answer

import re
from itertools import izip

s = 'Title: Example: of a title Date: November 23 1234 Other: Other information'

keys = ['Title', 'Date', 'Other']
pattern = re.compile('({})\s+'.format(':|'.join(keys)))

print dict(izip(*[(i.strip() for i in (pattern.split(s)) if i)]*2))

アウト：

{'Date:': 'November 23 1234 ',
 'Other:': 'Other information',
 'Title:': 'Example: of a title '}

score 1 · Accepted Answer

あなたは正規表現でそれを行うことができます:

>>> import re
>>> 
>>> s = 'Title: A title Date: November 23 1234 Other: Other information'
>>> matches = re.findall(r'(\w+): ((?:\w+\s)+)', s)
>>> 
>>> dict(matches)
    {'Date': 'November 23 1234 ', 'Other': 'Other ', 'Title': 'A title '}

score 0 · Accepted Answer

コロンが複数ある（ネストされている可能性がある）ため、コロンで分割することはできません。

キーワード（、、Title）が修正されている場合はDate、Other次の正規表現を試すことができます。

import re
reg_ex = re.compile("Title\:(.+)Date\:(.+)Other\:(.+)")
reg_ex.match(s).groups() #(' A title ', ' November 23 1234 ', ' Other information')

python - コロン付きの文字列で辞書を作成する

3 に答える 3

Related

Reference