python - リスト内のアイテムへのアクセス

Question

Twitterのハッシュタグを集めてみました。エンティティを取得するために必要なドキュメントを読む https://dev.twitter.com/docs/platform-objects/tweets

"entities":
{
    "hashtags":[],
    "urls":[],
    "user_mentions":[]
}

現在、Entities dict とハッシュタグリストにアクセスできます。

for line in iter(my_tweet_file)
    tweetionary = json.loads(line)
    print tweetionary["entities"]
    print tweetionary["entities"]["hashtags"]

しかし、ハッシュタグリスト内の項目を正しく解析できません。テキスト値 (次の例では lin と Scot) に興味があります。

[{u'indices': [41, 45], u'text': u'lin'}, {u'indices': [55, 60], u'text': u'Scot'}]

ハッシュタグリストから抽出したテキストの辞書を作成したいと考えています。

ありがとう、デニー

score 0 · Accepted Answer

ビルトインを使用してこれをうまく行うことができますCounter()：

from collections import Counter

extracted = [{u'indices': [41, 45], u'text': u'lin'},
             {u'indices': [55, 60], u'text': u'Scot'}]

count = Counter([d['text'] for d in extracted])

#Note: For python 2.x remove brackets around print statements
print(count['lin'])
print(count.most_common())

出力：

1
[('Scot', 1), ('lin', 1)]

python - リスト内のアイテムへのアクセス

1 に答える 1

Related

Reference