php - サイトのすべてのページからタイトルと説明を抽出する方法はありますか?

Question

私はウェブサイトの2つのバージョンを持っています:

もう ftp/code でアクセスできないものと、ftp/code でアクセスできるほぼ同じものですが、タイトルと説明のタグが取り除かれています。直接アクセスできなくなったサイトをクロールし、すべてのページのタイトル/説明のメタタグを抽出する方法 (PHP など) はありますか?

これらのタグを、アクセスできるサイトの新しいバージョンに挿入したいと考えています。

score 1 · Accepted Answer

You can use this to extract the meta description from a page:

$xpath = new DOMXPath($doc);
$description = $xpath->query('/html/head/meta[name@="description"]/@content');

This is an alternative solution:

$doc = new DOMDocument;
$doc->loadHTMLFile('http://example.com');

$title = $doc->getElementsByTagName('title');
$title = $title[0];

$metas = $doc->getElementsByTagName('meta');

foreach ($metas as $meta) {
  if (strtolower($meta->getAttribute('name')) == 'description') {
    $description = $meta->getAttribute('value');
  }
}

Source: #6113716

php - サイトのすべてのページからタイトルと説明を抽出する方法はありますか?

1 に答える 1

Related

Reference