テーブルを使用するデータ フレームのリストがあります。リストは次のようになります。
pronouns <- data.frame(pronounciation = c("juː","juː","juː","ju","ju","jə","jə","hɪm","hɪm","hɪm", "həm","ðɛm"), words = c("you","you","you","you","you","you","you","him","him","him","him","them"))
articles <- data.frame(pronounciation = c("ðiː","ði","ði","ðə","ðə","ði","ðə","eɪ","eɪ","æɪ","æɪ","eɪ","eɪ","eɪ","e"), words = c("the","the","the","the","the","the","the","a","a","a","a","a","a","a","a"))
numbers <- data.frame(pronounciation = c("wʌn","wʌn","wʌn","wʌn","wan","wa:n","tuː","tuː","tuː","tuː","tu","tu","tuː","tuː","θɹiː"), words = c("one","one","one","one","one","one","two","two","two","two","two","two","two","two","three"))
ls <- list(pronouns, articles, numbers)
ls[[1]]
pronounciation words
1 juː you
2 juː you
3 juː you
4 ju you
5 ju you
6 jə you
7 jə you
8 hɪm him
9 hɪm him
10 hɪm him
11 həm him
12 ðɛm them
このデータフレームのリストから、table() を使用して $words の分割表を抽出しますが、同時に各単語の最も一般的な発音も選択します。必要な結果は ls_out にあります。
pronouns_out <- data.frame(pronounciation = c("juː","hɪm","ðɛm"), words = c("you","him","them"), occurence = c(7,4,1))
articles_out <- data.frame(pronounciation = c("ði","eɪ"), words = c("the","a"), occurence = c(7,8))
numbers_out <- data.frame(pronounciation = c("wʌn","tuː","θɹiː"), words = c("one","two","three"), occurence = c(6,8,1))
ls_out <- list(pronouns_out, articles_out, numbers_out)
ls_out[[1]]
pronounciation words occurence
1 juː you 7
2 hɪm him 4
3 ðɛm them 1
2 つ以上の発音の頻度が同じ場合 (ls[[2]] の ði と ðə のように)、1 つの発音をランダムに選択する必要があります。
これについてのアドバイスは大歓迎です。