xml - text() を含むが空白を含まないネストされた要素の XPath

Question

xpath で解析する必要がある貧弱な XHTML があります。次のようになります。

<div class="foo">
  i need this text
  <br/>
  <br/>
  <span>sometext</span>
</div>

<div class="foo">
  <span>some other text</span>
  <span>sometext</span>
</div>

最初の div で「このテキストが必要」ですべてのコンテンツを選択したい。私の問題は、div 要素に空白やその他のものが含まれているため、//div[@class="foo"]/text() が 2 番目の div にも空の文字列を返すことです。これらの空のフィールドを無視したいのですが、どうすればできますか?

score 19 · Accepted Answer

使用:

//div
   [.//text()
        [normalize-space() = 'i need this text']
   ]
    //text()[normalize-space()]

これにより、正規化された文字列値が string である text-node 子孫を持つdiv、ドキュメント内のいずれかの非空白のみのテキストノード子孫が選択されます。div"i need this text"

このnormalize-space()関数は文字列 (コンテキストノードの文字列値 -- 引数が指定されていない場合) を取り、そこから別の文字列を生成します。この文字列では、先頭と末尾の空白文字がすべて削除され、隣接する空白文字の内部グループがシングルスペース。

score -1 · Accepted Answer

このセレクターを試してください:

//span[@class='glyphicon glyphicon-list mr5']/..[contains(normalize-space(text()),'Applications')]

xml - text() を含むが空白を含まないネストされた要素の XPath

2 に答える 2

Related

Reference