string - R の階層に基づいて単語をグループ化する

翻译自：https://stackoverflow.com/questions/18706371 2013-09-09T20:20:03.797

169 次

例のように、単語のベクトルで階層を取得したいと思います。

# Start (in reality these will not be right next to each other)

words <- c("hello-world", "hello", "string", "sub-string", "custom-fields", 
           "custom", "hi-hat", "hat") 

# Result

highlevel <- c("hello-world", "sub-string", "custom-fields", "hi-hat")
lowerlevel <- c("hello", "string", "custom", "hat")

実際には、ビッグデータに直面することになり、これらをグループ化する効率的な方法を探しています。できればそちらもリンクさせていただきたいです。目標は、最初に高レベルの単語を検索し、見つからない場合は低レベルの単語を探すことです。

アイデア？

string - R の階層に基づいて単語をグループ化する

1 に答える 1

Related

Reference