python - XML から pandas データフレームへ

Question

次のような数千行の XML ファイルがあります。

<Word x1="206" y1="120" x2="214" y2="144" font="Times-Roman" style="font-size:22pt">WORD</Word>

私はそれを（すべての属性）に変換したいpandas dataframe。それを行うには、美しいスープを使用してファイルをループし、値を行ごとに挿入するか、列として挿入するリストを作成します。ただし、私が説明したことを達成するためのよりpythonicな方法があるかどうかを知りたいです。前もって感謝します。

コード例:

x1list=[]
x2list=[]

for word in soup.page.findAll('word'):
    x1list.append(int(word['x1']))
    x2list.append(int(word['x2']))
df=DataFrame({'x1':x1list,'x2':x2list})

score 3 · Accepted Answer

これを試して：

DataFrame.from_records([(int(word['x1']), int(word['x2']))
                        for word in soup.page.findAll('word')],
                       columns=('x1', 'x2'))

python - XML から pandas データフレームへ

1 に答える 1

Related

Reference