python - div タグからデータを抽出する Python

Question

一部のデータが div タグにネストされている Web ページからデータをスクレイピングしようとしています。

url = 'http://london2012.fiba.com/pages/eng/fe/12/olym/p/gid/26/grid/A/rid/9087/sid/6233/game.html'
boxurl = urllib2.urlopen(url).read()
soup = BeautifulSoup(boxurl)

linescoreA = soup.find("div", {"class": "scoreA"})

print linescoreA

これを出力します：

<div class="scoreA">
<div class="period">19</div>
<div class="period">22</div>
<div class="period">22</div><div class="period">26</div>
<div class="final">89</div>
<div class="clear"></div>
</div>

それが私が立ち往生するところです。div タグからデータを取得するにはどうすればよいですか?

score 3 · Accepted Answer

テキストデータだけを取得するには、次を使用します.stripped_strings。

print list(linescoreA.stripped_strings)

score 1 · Accepted Answer

試す

for node in soup.find("div", {"class": "scoreA"}):
    print ''.join(node.findAll(text=True))

そして何について

for node in soup.find("div", {"class": "scoreA"}):
        print node.string

申し訳ありませんが、私はあなたのために試すことができません。

python - div タグからデータを抽出する Python

2 に答える 2

Related

Reference