python - DBpedia からの情報の抽出

Question

私はプロジェクトに取り組んでおり、DBpedia を利用したいと考えています。私は数百のDBpediaリンクを持っています

http://dbpedia.org/page/Federal_Emergency_Management_Agency

時間の有効活用とは:

それらのページをクロールして必要な情報を抽出するには?
Python から SPARQL クエリを使用してデータをクエリするには?

score 6 · Accepted Answer

まず、DBpedia リソースを識別する URI が

http://dbpedia.org/page/Federal_Emergency_Management_Agency

ページ付きですが、

http://dbpedia.org/resource/Federal_Emergency_Management_Agency

リソース付き。次に、SPARQL を使用して情報を取得する方がはるかに高速です。SPARQL は RDF のクエリ言語であり、RDF データを取得しようとしています。FEMA に関する情報を取得するために SPARQL で行う必要があるのは、describe クエリだけです。

describe
  dbpedia:Federal_Emergency_Management_Agency

SPARQL の結果

Describe クエリは複数のリソースを記述できるため、たとえば次のように実行できます。

describe
  dbpedia:Federal_Emergency_Management_Agency
  dbpedia:Mount_Monadnock
  # more resources...

SPARQL の結果

一部のリソースに関する特定の情報のみが必要な場合でも、values関心のあるリソースを使用してプログラムで注入することで、選択クエリまたは構築クエリで同様のことを行うことができます。

select ?label where { 
  values ?resource {
    dbpedia:Federal_Emergency_Management_Agency # put your values in here and
    dbpedia:Mount_Monadnock                     # ?resource will be bound to each
  }
  ?resource rdfs:label ?label .
  filter( langMatches( lang(?label), "EN" ))
}

SPARQL の結果

コンストラクトを使用して、モデル内のこれらのトリプルを取得することもできます。

construct {
  ?resource rdfs:label ?label 
}
where { 
  values ?resource {
    dbpedia:Federal_Emergency_Management_Agency
    dbpedia:Mount_Monadnock
  }
  ?resource rdfs:label ?label .
  filter( langMatches( lang(?label), "EN" ))
}

SPARQL の結果

python - DBpedia からの情報の抽出

1 に答える 1

Related

Reference