php - 2 つのパターン間の PHP 正規表現一致

Question

多数のトレースを含むログファイルを解析しようとしていますが、その一部には複数の行が含まれています。

例：

[trace-123] <request>This is a log line</request>
[trace-124] <reply>This is another log line

this is part of "[trace-124]" still.</reply>
[trace-125] <request>final log line.</request>

すべてのトレースの配列を取得するために preg_match_all を使用しようとしています。

$file = file_get_contents("traces.txt");
$tracePattern = "/(\[trace-[0-9]*+\]+[\s\S]*)(?<=\<\/reply>|\<\/request>)/";

preg_match_all($tracePattern,$file,$lines);

echo "<pre>";print_r($lines);echo "</pre>";

理想的には、結果が次のようになることを望みます。

Array
(
    [0] => [trace-123] <request>This is a log line</request>
    [1] => [trace-124] <reply>This is another log line

this is part of "[trace-124]" still.</reply>
    [2] => [trace-125] <request>final log line.</request>
)

しかし、実行すると、配列の1つの要素にすべてが含まれる配列が得られます。式を書いたときの目標は、基本的に次のものを探すことでした。

[trace-\[0-9]*\]

その試合から次の試合までのすべてを見つけます。

見つけた

\[trace-[0-9]*+\].*

かなりうまく機能しますが、改行があると機能しなくなります。

score 3 · Accepted Answer

ここでは、次の方法がおそらくより良い方法です。

$results = preg_split('/\R(?=\[trace[^\]]*\])/', $text);
print_r($results);

見るworking demo

出力

Array
(
    [0] => [trace-123] <request>This is a log line</request>
    [1] => [trace-124] <reply>This is another log line

this is part of "[trace-124]" still.</reply>
    [2] => [trace-125] <request>final log line.</request>
)

score 2 · Accepted Answer

これは MULTI_LINE モードで機能します。先頭のスペースと末尾の改行を削除します。

編集：これは、行の先頭または先頭にあるアンカーと「トレース」までの改行以外の空白の[trace- ]いずれかであると想定しています。
これは、
識別可能な唯一のレコードセパレータです。

 #  ^[^\S\n]*(\[trace-[^]]*\][^\n]*(?:(?!\s+\[trace-[^]]*\])\n[^\n]*)*)

 ^ [^\S\n]* 
 (
      \[trace- [^]]* \] [^\n]* 

      (?:
           (?! \s+ \[trace- [^]]* \] )
           \n [^\n]* 
      )*
 )

出力 (一重引用符で囲む)

 '[trace-123] <request>This is a log line</request>'
 '[trace-124] <reply>This is another log line

 this is part of "[trace-124]" still.</reply>'
 '[trace-125] <request>final log line.</request>'

score 0 · Accepted Answer

これは、フラグsをオンにして行う必要があります。

(\[trace-[0-9]+\].*?<\/(?:reply|request)>)

ライブデモ

score 0 · Accepted Answer

記号は、.改行を除くすべての文字を意味します。次の方法\nで変更を試みることができます。(.|\s)

#\[trace-[0-9]*+\](.|\s)*#

注：非キャプチャ括弧を使用できます(?: )

簡単に、フラグ「s」を追加します

#\[trace-[0-9]*+\].*#s

php - 2 つのパターン間の PHP 正規表現一致

7 に答える 7

Related

Reference