r - 文字列からの文字抽出

Question

指定された文字までのすべての文字をどのように抽出しますか? 例として、「。」の前のすべてを抽出したいと思います。（限目）：

a<-c("asdasd.sss","segssddfge.sss","se.sss")

戻りたい:

asdasd segssddfge se

私は試した：

substr(a,1,".")

しかし、うまくいかないようです。

何か案は？

score 7 · Accepted Answer

非常に基本的なアプローチは次のとおりです。

sapply(strsplit(a, "\\."), `[[`, 1)
# [1] "asdasd"     "segssddfge" "se"

そしてもう一つ：

sub(".sss", "", a, fixed = TRUE)
# [1] "asdasd"     "segssddfge" "se" 
## OR sub("(.*)\\..*", "\\1", a) 
## And possibly other variations

score 4 · Accepted Answer

使用sub:

# match a "." (escape with "\" to search for "." as a normal "." 
# means "any character") followed by 0 to any amount of characters
# until the end of the string and replace with nothing ("")
sub("\\..*$", "", a)

subtrandを使用しgregexprます (1 つしかなく.、ベクトル内のすべての文字列に明確な一致があると仮定します)。

# get the match position of a "." for every string in "a" (returns a list)
# unlist it and get the substring of each from 1 to match.position - 1
substr(a, 1, unlist(gregexpr("\\.", a)) - 1)

score 2 · Accepted Answer

ここで使用する試みgsub

gsub(pattern='(.*)[.](.*)','\\1', c("asdasd.sss","segssddfge.sss","se.sss"))
[1] "asdasd"     "segssddfge" "se"

r - 文字列からの文字抽出

3 に答える 3

Related

Reference