regex - 2番目に一致した正規表現をsedで表示するにはどうすればよいですか

Question

このテキストがあるとします

233-CO のコードは、45-DFG とこの 45-GH の主な理由です。

これで、と\s[0-9]+-\w+に一致する正規表現ができました。233-CO45-DFG45-GH

3 番目の一致だけを表示するにはどうすればよい45-GHですか?

sed -re 's/\s[0-9]+-\w+/\3/g' file.txt

どこ\3に 3 番目の正規表現の一致があるはずです。

score 2 · Accepted Answer

使用は必須sedですか？grep配列を使用して、でそれを行うことができます：

text="The code for 233-CO is the main reason for 45-DFG and this 45-GH"
matches=( $(echo "$text" | grep -o -m 3 '\s[0-9]\+-\w\+') ) # store first 3 matches in array
echo "${matches[0]} ${matches[2]}" # prompt first and third match

score 0 · Accepted Answer

awkが受け入れられた場合、awk onlinerがあり、取得したい一致番号を指定すると、一致したstrが表示されます。

awk -vn=$n '{l=$0;for(i=1;i<n;i++){match(l,/\s[0-9]+-\w+/,a);l=substr(l,RSTART+RLENGTH);}print a[0]}' file

テスト

kent$  echo $STR     #so we have 7 matches in str                                                                                                  
The code for 233-CO is the main reason for 45-DFG and this 45-GH,foo 004-AB, bar 005-CC baz 006-DDD and 007-AWK

kent$  n=6       #now I want the 6th match

#here you go:
kent$   awk -vn=$n '{l=$0;for(i=1;i<=n;i++){match(l,/\s[0-9]+-\w+/,a);l=substr(l,RSTART+RLENGTH);}print a[0]}' <<< $STR
 006-DDD

score 0 · Accepted Answer

With grep for matching and sed for printing the occurrence:

$ egrep -o '\b[0-9]+-\w+' file | sed -n '1p'
233-CO

$ egrep -o '\b[0-9]+-\w+' file | sed -n '2p'
45-DFG

$ egrep -o '\b[0-9]+-\w+' file | sed -n '3p'
45-GH

Or with a little awk passing the occurrence to print using the variable o:

$ awk -v o=1 '{for(i=0;i++<NF;)if($i~/[0-9]+-\w+/&&j++==o-1)print $i}' file
233-CO

$ awk -v o=2 '{for(i=0;i++<NF;)if($i~/[0-9]+-\w+/&&j++==o-1)print $i}' file
45-DFG

$ awk -v o=3 '{for(i=0;i++<NF;)if($i~/[0-9]+-\w+/&&j++==o-1)print $i}' file
45-GH

score 0 · Accepted Answer

これはうまくいくかもしれません（GNU sed）：

sed -r 's/\b[0-9]+-[A-Z]+\b/\n&\n/3;s/.*\n(.*)\n.*/\1/' file

s/\b[0-9]+-[A-Z]+\b/\n&\n/3\n問題の 3 番目 (n) のパターンの先頭と末尾に (改行) を追加します。
s/.*\n(.*)\n.*/\1/パターンの前後のテキストを削除する

score 0 · Accepted Answer

パターンの最後の出現を見つけるには、これを使用できます。

$ sed -re 's/.*\s([0-9]+-\w+).*/\1/g' file
45-GH

regex - 2番目に一致した正規表現をsedで表示するにはどうすればよいですか

5 に答える 5

Related

Reference