lexicology / hellog～英語史ブログ

最終更新時間: 2026-07-10 19:54

2012-07-01 Sun

■ #1161. 英語と日本語における語彙の音節数別割合 [lexicology][statistics][syllable][corpus][japanese]

　昨日の記事「#1160. MRC Psychological Database より各種統計を視覚化」 ([2012-06-30-1]) の (3) で，英語語彙を音節数により分別して，それぞれの頻度を出した．それによると，対象となった92767語の語彙全体における1音節語，2音節語，3音節語，4音節語の占める割合は，それぞれ13.46%，35.40%，29.91%，15.26%であり，合わせて94.03%に達する．とりわけ2音節語と3音節語を合わせて65.31%である．9万余という大規模な語彙で調査する限り，英語語彙の3分の2近くは2--3音節語であるということになる．
　一方，##348,349,355 の記事では，BNC や COLT のコーパスを用いて，最も頻度の高い数百語から数千語を対象に音節数調査を行なった．調査対象となる語彙の規模は格段に小さく，それに従って音節数別の割合も変わる．1音節語と2音節語が優勢であり，最大の6000語規模の調査でもこの2種類だけで68.7%を占める（「#349. BNC Word Frequency List による音節数の分布調査 (2)」 ([2010-04-11-1]) のグラフを参照）．対象とする語彙規模により，優勢な占有率を示す音節数が変動することがわかるが，全般的に，英語語彙においては1--3音節語が主要であることは間違いないだろう．
　では，日本語の語彙について，音節数別の割合はどうだろうか．加藤ほか (80) では，林大氏による『日本語アクセント辞典』の見出し語形に基づく拍数の分布の調査結果が要約されている．辞典の見出し語形であるから対称語彙は数万語の規模と思われる．以下のような結果が出た．

1拍 2拍 3拍 4拍 5拍 6拍 7拍 8拍 9拍 10拍計

0.3 4.8 22.7 38.8 17.7 11.0 3.3 1.2 0.2 0.1 100

　割合のピークは4拍語にあり，その前後の3拍語と5拍語を合わせて79.2%，6拍語を加えれば90.2%になる．英語の語彙の主たる構成要素が1--3音節語とすれば，日本語の語彙の主たる構成要素は3--5拍語となる．音節数でみる限り，英単語は相対的に短く，日本語単語は相対的に長いことがよくわかる．
　両言語間の際だった差異は，音韻数の差と音節構造の差に起因するといってよいだろう．音韻数については，[2012-02-12-1]の記事「#1021. 英語と日本語の音素の種類と数」で見たとおり，著しい差がある．また，音節構造については，日本語の音節がほぼ「子音＋母音」の1形式だけであるのに対して，英語の音節は，[2012-02-14-1]の記事「#1023. 日本語の拍の種類と数」で示唆したとおり，数万形式がある．
　日本語の語彙は，2拍語を基本としていると考えられる．和語でも漢語でも2±1拍語が多く，語彙の膨張に従って，その結合が増え，結果として4±1拍語が主流となってきた経緯がある．洋語についても，優勢な4拍語に合わせて「マスコミュニケーション」→「マスコミ」，「ハンガーストライキ」→「ハンスト」，「エンジンストップ」→「エンスト」と省略されることが多い．2拍語を基本とした日本語語彙の成立と，その後の発展については，小松 (48--62) が詳しい．

　・加藤彰彦，佐治圭三，森田良行編　『日本語概説』　おうふう，1989年．
　・小松秀雄　『日本語の歴史　青信号はなぜアオなのか』　笠間書院，2001年．

1拍	2拍	3拍	4拍	5拍	6拍	7拍	8拍	9拍	10拍	計
0.3	4.8	22.7	38.8	17.7	11.0	3.3	1.2	0.2	0.1	100

POS	FREQ	%
noun	7326	57.04%
verb	2501	19.47%
adjective	2420	18.84%
adverb	291	2.27%
preposition	68	0.53%
conjunction	21	0.16%
pronoun	15	0.12%
interjection	37	0.29%
past participle	57	0.44%
others	108	0.84%

Class A	Class B	Class C
affective	affectionate	academic
appositive	apposite	aesthetic
behavioural	mannerly	American
bibliographic	bookish	artistic
cardiac	hearty	British
causal	effectual	Christian
ceremonial	ceremonious	cinematic
commemorative	memorable	civil
conceptual	thoughtful	constitutional
connective	coherent	conventional
consonantal	consonant	critical
continental	continent	demonstrative
corrective	correct	diplomatic
cultural	cultured	dramatic
deductive	seductive	emotional
dental	toothsome	English
devotional	devout	ethical
doctrinal	docile	formal
durative	durable	French
elective	eligible	grammatical
entrepreneurial	enterprising	historical
evaluative	valid	human
experiential	experienced	legal
factual	accurate	literary
fiduciary	faithful	logical
financial	lucrative	Marxian
genealogical	genteel	moral
generative	degenerate	musical
generic	generous	parliamentary
governmental	ruly	philosophical
gustatory	tasteful	poetic
inflexional	flexible	professional
interrogative	inquisitive	rational
intonational	tuneful	religious
juridical	just	royal
legislative	legitimate	sanitary
manual	handy	scientific
mental	sane	social
methodological	methodical	spiritual
modal	modish	theatrical
morphological	shapely
nutritional	nutritious
observational	observant
olfactory	savoury
optical	sightly
ostensive	ostentatious
palatal	palatable
pecuniary	pecunious
pedagogic	pedantic
penitential	penitent
perceptual	perceptive
pictorial	picturesque
residential	homely
retributive	rewarding
semantic	significant
sensory	sensitive
sociological	sociable
stylistic	stylish
supervisory	watchful
syntactic	orderly
tactile	tactful
temporal	timely
theological	godly
urban	urbane
verbal	verbose
verificatory	veracious
visual	conspicuous
vocalic	equivocal
vocative	provocative
volitional	willing

lexicology - hellog～英語史ブログ

■ #1161. 英語と日本語における語彙の音節数別割合 [lexicology][statistics][syllable][corpus][japanese]

■ #1160. MRC Psychological Database より各種統計を視覚化 [lexicology][statistics][syllable][corpus]

■ #1159. MRC Psycholinguistic Database Search [cgi][web_service][lexicology][frequency][statistics]

■ #1158. MRC Psycholinguistic Database [web_service][lexicology][frequency][statistics]

■ #1148. 古英語の豊かな語形成力 [oe][lexicology][derivation][compound][compounding][word_formation][productivity][kenning]

■ #1132. 英単語の品詞別の割合 [lexicology][corpus][statistics]

■ #1129. 印欧祖語の分岐は紀元前5800--7800年？ [indo-european][archaeology][glottochronology][family_tree][lexicology]

■ #1128. glottochronology [glottochronology][history_of_linguistics][family_tree][lexicology]

■ #1103. GSL による Zipf's law の検証 [lexicology][statistics][frequency][zipfs_law][corpus]

■ #1100. Farsi の形容詞区分の通時的な意味合い [adjective][loan_word][lexicology][suffix][semantic_change][prediction_of_language_change][register][lexical_stratification]

■ #1099. 記述の形容詞と評価の形容詞 [adjective][loan_word][lexicology][lexical_stratification]

■ #1067. 初期近代英語と現代日本語の語彙借用 [lexicology][loan_word][borrowing][emode][renaissance][latin][japanese][linguistic_imperialism][lexical_stratification]

■ #989. 2011年の英語流行語大賞 [lexicology][ads][woy]

■ #985. 中英語の語彙の起源と割合 [lexicology][loan_word][statistics][me][sggk]

■ #912. 語の定義がなぜ難しいか (3) [morphology][terminology][word_formation][word][dictionary][lexicology][hapax_legomenon][ghost_word]

■ #879. Algeo の新語ソース調査から示唆される通時的傾向 [pde][word_formation][loan_word][statistics][lexicology][neologism]

■ #878. Algeo と Bauer の新語ソース調査の比較 [pde][word_formation][loan_word][statistics][lexicology][neologism]

■ #877. Algeo の現代英語の新語ソース調査 [pde][word_formation][loan_word][statistics][lexicology][neologism]

■ #876. 現代英語におけるかばん語の生産性は本当に高いか？ [blend][productivity][pde][pde_language_change][word_formation][statistics][lexicology]

■ #875. Bauer による現代英語の新語のソースのまとめ [loan_word][word_formation][lexicology][pde][pde_language_change][statistics][lexicology]