python - Python で疎行列を作成する

翻译自：https://stackoverflow.com/questions/37953877 2016-06-21T20:21:34.953

1796 次

データを操作しており、後でクラスタリングの目的で使用する疎行列を作成したいと考えています。

fileHandle = open('data', 'r')

for line in fileHandle:
    json_list = []
    fields = line.split('\t')
    json_list.append(fields[0])
    json_list.append(fields[1])
    json_list.append(fields[3])

現在、データは次のようになっています。

term, ids, quantity
['buick', '123,234', '500']
['chevy', '345,456', '300']
['suv','123', '100']

必要な出力は次のようになります。

term, quantity, '123', '234', '345', '456', '567'
buick, 500, 1, 1, 0, 0, 0
chevy, 300, 0, 0, 1, 1, 0
suv,   100, 1, 0, 0, 0, 0

numpy sparse matrix ライブラリを使用してみましたが、成功しませんでした。

python - Python で疎行列を作成する

2 に答える 2

Related

Reference