corpus / hellog～英語史ブログ

最終更新時間: 2026-07-15 01:27

2012-01-08 Sun

■ #986. COCA の "WORD AND PHRASE . INFO" [coca][corpus][dictionary][synonym][collocation][semantic_prosody][intensifier][web_service]

　COCA ( Corpus of Contemporary American English ) を運営する Mark Davies 氏が，年末に，COCAベースで語に関する諸情報を一覧できるサービス WORD AND PHRASE . INFO を公開した．語（lemma 頻度で上位60,000語以内に限る）を入力すると，ジャンルごとの生起頻度やそのコンコーダンス・ラインはもとより，WordNet に基づいた定義や類義語群までが画面上に現われる．ほとんどの項目がクリック可能で，さらなる機能へとアクセスできる．インターフェースが直感的で使いやすい．
　類義語研究や collocation 研究には相当に役立つ仕様になったのではないか．例えば，semantic_prosody を扱った[2011-03-12-1]の記事「#684. semantic prosody と文法カテゴリー」で，強意語 utterly, absolutely, perfectly, totally, completely, entirely, thoroughly についての研究を紹介したが，WORD AND PHRASE . INFO で utterly を入力すれば，これらの類義語群が左下ウィンドウに一覧される．あとは，各語をクリックしてゆくだけで，頻度や collocation の詳細が得られる．このような当たりをつけるのに効果を発揮しそうだ．

utterly by WORD AND PHRASE . INFO

Referrer (Inside): [2012-03-03-1]

Rank	Under 35		Over 35
Rank	Word	χ²	Word	χ²
1	mum	1409.3	yes	2365.0
2	fucking	1184.6	well	1059.8
3	my	762.4	mm	895.2
4	mummy	755.2	er	773.8
5	like	745.2	they	682.2
6	na as in wanna and gonna	712.8	said	538.3
7	goes	606.6	says	443.1
8	shit	410.1	were	385.8
9	dad	403.7	the	352.2
10	daddy	380.1	of	314.6
11	me	371.9	and	224.7
12	what	357.3	to	211.2
13	fuck	330.1	mean	155.0
14	wan as in wanna	320.6	he	144.0
15	really	277.0	but	139.0
16	okay	257.0	perhaps	136.0
17	cos	254.4	that	131.3
18	just	251.8	see	122.1
19	why	240.0	had	118.3

Rank	Characteristically male		Characteristically female
Rank	Word	χ²	Word	χ²
1	fucking	1233.1	she	3109.7
2	er	945.4	her	965.4
3	the	698.0	said	872.0
4	year	310.3	n't	443.9
5	aye	291.8	I	357.9
6	right	276.0	and	245.3
7	hundred	251.1	to	198.6
8	fuck	239.0	cos	194.6
9	is	233.3	oh	170.2
10	of	203.6	Christmas	163.9
11	two	170.3	thought	159.7
12	three	168.2	lovely	140.3
13	a	151.6	nice	134.4
14	four	145.5	mm	133.8
15	ah	143.6	had	125.9
16	no	140.8	did	109.6
17	number	133.9	going	109.0
18	quid	124.2	because	105.0
19	one	123.6	him	99.2
20	mate	120.8	really	97.6
21	which	120.5	school	96.3
22	okay	119.9	he	90.4
23	that	114.2	think	88.8
24	guy	108.6	home	84.0
25	da	105.3	me	83.5

corpus - hellog～英語史ブログ

■ #986. COCA の "WORD AND PHRASE . INFO" [coca][corpus][dictionary][synonym][collocation][semantic_prosody][intensifier][web_service]

■ #982. アメリカ英語の口語に頻出する flat adverb [adverb][adjective][register][corpus][ame_bre][americanisation][colloquialisation][grammar][flat_adverb]

■ #956. COCA N-Gram Search [cgi][web_service][coca][corpus][collocation][n-gram]

■ #955. 完璧な語呂合わせの2項イディオム [binomial][rhyme][corpus][coca][collocation][euphony][n-gram][suffix][proverb]

■ #954. 脚韻を踏む2項イディオム [binomial][rhyme][corpus][coca][collocation][euphony][n-gram][suffix][compound]

■ #953. 頭韻を踏む2項イディオム [binomial][alliteration][corpus][coca][collocation][euphony][n-gram]

■ #930. a large number of people の数の一致 [agreement][number][syntax][bnc][corpus]

■ #914. BNC による語彙の世代差の調査 [bnc][corpus][statistics][lltest][interjection]

■ #913. BNC による語彙の男女差の調査 [bnc][corpus][statistics][lltest][interjection][gender_difference]

■ #880. いかにもイギリス英語，いかにもアメリカ英語の単語 [corpus][ame_bre][ame][bre][flob][frown][text_tool][keyword]

■ #872. -ick or -ic [suffix][johnson][webster][corpus][google_books][spelling][n-gram]

■ #868. EDD Online [dialect][web_service][corpus][lmode][lexicography][edd][dictionary]

■ #854. 船や国名を受ける代名詞 she (3) [personal_pronoun][she][gender][personification][political_correctness][corpus][statistics][lexical_diffusion]

■ #845. 現代英語の語彙の起源と割合 [lexicology][loan_word][statistics][bnc][corpus]

■ #799. 海賊複数の <z> [plural][netspeak][suffix][corpus][z][alphabet]

■ #773. PPCMBE と COHA の比較 [corpus][coha][ppcmbe][lmode][adjective][comparison][inflection][representativeness]

■ #771. 名詞の単数形と複数形の頻度 [corpus][statistics][plural][countability]

■ #757. decline + 蜍募錐隧杣syntax] [gerund][bnc][corpus]

■ #738. inclusive superlative [superlative][contamination][syntax][corpus][ppceme]

■ #737. 構文の contamination [blend][contamination][syntax][superlative][bnc][corpus]