下面是美国语料库的部分截图,我想知道上面第一行的单词分别是什么意思啊?比如CDcount,等

如题所述

就是些统计的东西。
FREQcount. This is the number of times the word appears in the corpus (i.e., on the total of 51 million words).
CDcount. This is the number of films in which the word appears (i.e., it has a maximum value of 8,388).
FREQlow. This is the number of times the word appears in the corpus starting with a lowercase letter. This allows users to further match their stimuli.
CDlow. This is the number of films in which the word appears starting with a lowercase letter.
SUBTLWF. This is the word frequency per million words. It is the measure you would preferably use in your manuscripts, because it is a standard measure of word frequency independent of the corpus size. It is given with two digits precision, in order not to lose precision of the frequency counts.
Lg10WF. This value is based on log10(FREQcount+1) and has four digit precision. Because FREQcount is based on 51 million words, the following conversions apply for SUBTLEXUS:

http://expsy.ugent.be/subtlexus/追问

市面上说要求掌握2700个单词,这是美剧中出现频率很高的单词,我想知道怎么让这个2700个单词排列出来啊,是不是选择第一个FREQcount让这个按照升序排列啊?谢谢你的答复

温馨提示:答案为网友推荐,仅供参考

相关了解……

你可能感兴趣的内容

本站内容来自于网友发表,不代表本站立场,仅表示其个人看法,不对其真实性、正确性、有效性作任何的担保
相关事宜请发邮件给我们
© 非常风气网