string - sed/grep を使用して 2 つの単語の間のテキストを抽出する方法は?

Question

文字列の 2 つの単語間のすべてを含む文字列を出力しようとしています。

入力：

"Here is a String"

出力：

"is a"

使用:

sed -n '/Here/,/String/p'

エンドポイントが含まれていますが、それらを含めたくありません。

score 211 · Accepted Answer

GNU grep は、肯定的および否定的な先読みと振り返りもサポートできます。あなたの場合、コマンドは次のようになります。

echo "Here is a string" | grep -o -P '(?<=Here).*(?=string)'

Hereとが複数出現する場合は、最初と最後stringから照合するか、個別に照合するかを選択できます。正規表現に関しては、貪欲な一致 (最初のケース)または非貪欲な一致 (2 番目のケース)と呼ばれます。Herestring

$ echo 'Here is a string, and Here is another string.' | grep -oP '(?<=Here).*(?=string)' # Greedy match
 is a string, and Here is another 
$ echo 'Here is a string, and Here is another string.' | grep -oP '(?<=Here).*?(?=string)' # Non-greedy match (Notice the '?' after '*' in .*)
 is a 
 is another

score 140 · Accepted Answer

140

sed -e 's/Here\(.*\)String/\1/'

于 2012-11-06T00:14:09.013 に答える

score 84 · Accepted Answer

Here受け入れられた回答は、の前後にある可能性のあるテキストを削除しませんString。この意志：

sed -e 's/.*Here\(.*\)String.*/\1/'

主な違いは、.*直前Hereと直後の追加ですString。

score 46 · Accepted Answer

Bashだけで文字列を削除できます。

$ foo="Here is a String"
$ foo=${foo##*Here }
$ echo "$foo"
is a String
$ foo=${foo%% String*}
$ echo "$foo"
is a
$

また、 PCREを含む GNU grep がある場合は、ゼロ幅アサーションを使用できます。

$ echo "Here is a String" | grep -Po '(?<=(Here )).*(?= String)'
is a

score 26 · Accepted Answer

多くの複数行のオカレンスを含む長いファイルがある場合は、最初に number 行を出力すると便利です。

cat -n file | sed -n '/Here/,/String/p'

score 9 · Accepted Answer

これはあなたのために働くかもしれません（GNU sed）：

sed '/Here/!d;s//&\n/;s/.*\n//;:a;/String/bb;$!{n;ba};:b;s//\n&/;P;D' file

これにより、改行上の2つのマーカー（この場合はHereとString）の間のテキストの各表現が表示され、テキスト内の改行が保持されます。

score 8 · Accepted Answer

上記のすべてのソリューションには、最後の検索文字列が文字列内の他の場所で繰り返されるという欠陥があります。私は、bash 関数を作成するのが最善であることを発見しました。

    function str_str {
      local str
      str="${1#*${2}}"
      str="${str%%$3*}"
      echo -n "$str"
    }

    # test it ...
    mystr="this is a string"
    str_str "$mystr" "this " " string"

score 4 · Accepted Answer

使用できます\1( http://www.grymoire.com/Unix/Sed.html#uh-4を参照):

echo "Hello is a String" | sed 's/Hello\(.*\)String/\1/g'

括弧内の内容はとして保存され\1ます。

string - sed/grep を使用して 2 つの単語の間のテキストを抽出する方法は?

12 に答える 12

Related

Reference