php - 文字列から : 文字を削除

Question

タイトルを取得するために Web をクロールします

$title = strip_tags($link1->plaintext);

しかし、結果には次のような結果がありますData Mining&#58 Concepts and Techniques

それを削除するには&#58？ありがとうございました

score 2 · Accepted Answer

問題は、コロン:の文字エンティティ参照ですが、例が正しく終了していないことです (末尾のセミコロンがありません)。次の（かなり素朴な）正規表現を使用して、終了していない参照を修正できます。

$broken = "Data Mining&#58 Concepts and Techniques";
$fixed = preg_replace('/(&#x?[a-e0-9]+)\b/i', '$1;', $broken);

その後、次を使用できますhtml_entity_decode。

echo html_entity_decode($fixed); // Data Mining: Concepts and Techniques

score 0 · Accepted Answer

0

$title = str_replace("&#58", "", strip_tags($link1->plaintext));

于 2012-09-23T22:41:54.007 に答える

2 に答える 2