corpus / hellog～英語史ブログ

最終更新時間: 2026-07-15 01:27

2014-04-15 Tue

■ #1814. 18--19世紀の be 完了の衰退を CLMET で確認 [perfect][clmet][corpus][syntax][be][auxiliary_verb][aspect][participle][lmode]

　「#1653. be 完了の歴史」 ([2013-11-05-1]) で，変移動詞 (mutative verb) は，18世紀末まで，通常 be + 過去分詞というかたちで完了形を作っていたことを見た．英語史では，この be 完了が18世紀末辺りを境に衰退の一途をたどることになったとされている．「#1637. CLMET3.0 で between と betwixt の分布を調査」 ([2013-10-20-1]) で紹介した CLMET3.0 は，1710--1920年をカバーする約3,400万語からなる大型バランスコーパスであり，この種の言語変化を追うには最適なリソースと思われるので，これを用いて be 完了の衰退を確認してみた．
　今回は，先の記事でも取り上げた7つの変移動詞 (arrive, become, come, fall, flee, grow; go) に限定し，CLMET3.0 の3つの時代区分 (1710--1780, 1780--1850, 1850--1920) と6つのジャンル分け (Narrative fiction, Narrative non-fiction, Drama, Letters, Treatise, Other) にしたがって，コーパスから用例を拾った．3つの時期のサブコーパスの規模はおよそ同程度だが，ジャンル別のサブコーパスは，[2013-10-20-1]の表で示したように，Narrative fiction に大きく偏っているので，その解釈には注意を要する．以下，(1)--(7) に各動詞に関する推移の積み上げ棒グラフ，(8), (9) に7動詞をひっくるめたジャンル別，動詞別のシェアを示す積み上げ棒グラフを示す．(1)--(6) については，比較のためにY軸の最大値を揃えてある．データファイルと頻度表はソースHTMLを参照されたい．

Be Perfect with Seven Verbs

　動詞によって衰退のスピードに若干の違いがみられるが，全体として急激に衰退したというよりは，比較的穏やかに，着実に衰退していったという印象を受ける．ただし，(7) の go は（現代英語でも be gone がイディオム化して残っていることから分かるように）後期近代英語期中にはそれほど落ち込んでおらず，しかも用例数が他の動詞よりも大きく上回っているために，(8) や (9) に示されるような be 完了の衰退の全体像を多少なりとも歪めていることには注意する必要がある．

Referrer (Inside): [2021-07-31-1] [2017-08-14-1] [2016-02-20-1]

BRITISH		a	d	f	h	j	l	m	n	p	s	x	y	TOTAL
1600--49	files	0	10	0	0	0	10	0	0	10	0	0	0	30
1600--49	words	0	32,342	0	0	0	21,026	0	0	32,741	0	0	0	86,109
1650--99	files	0	10	11	10	10	10	21	10	0	10	75	10	177
1650--99	words	0	30,328	41,667	21,818	21,186	20,466	23,811	22,304	0	21,427	38,767	20,488	262,262
1700--49	files	0	10	11	10	11	10	14	10	0	10	77	10	173
1700--49	words	0	27,862	44,057	21,511	23,265	21,315	22,066	21,612	0	20,812	33,896	20,495	256,891
1750--99	files	10	10	10	10	10	10	20	10	0	10	70	11	181
1750--99	words	25,386	27,484	45,198	21,752	21,284	20,367	21,002	23,172	0	20,599	29,589	23,043	278,876
1800--49	files	10	10	10	10	11	10	10	10	0	10	25	10	126
1800--49	words	30,804	31,211	45,107	21,777	23,249	20,531	20,286	22,951	0	21,015	12,671	20,883	270,485
1850--99	files	10	10	10	10	10	10	10	10	0	10	26	10	126
1850--99	words	30,684	34,856	43,427	21,322	21,243	20,757	22,265	23,072	0	21,810	10,819	21,789	272,044
1900--49	files	10	11	10	10	10	10	10	10	0	10	29	10	130
1900--49	words	26,717	31,391	45,408	21,123	22,208	21,160	20,213	21,977	0	21,664	12,529	22,424	266,814
1950--99	files	10	11	10	10	10	10	13	10	0	10	28	10	132
1950--99	words	23,437	32,200	45,109	21,093	22,723	20,721	20,994	22,935	0	21,385	11,361	22,060	264,018
TOTAL	files	50	82	72	70	72	80	98	70	10	70	330	71	1,075
TOTAL	words	137,028	247,674	309,973	150,396	155,158	166,343	150,637	158,023	32,741	148,712	149,632	151,182	1,957,499
AMERICAN		a	d	f	h	j	l	m	n	p	s	x	y	TOTAL
1750--99	files	3	10	10	10	10	12	9	10	0	10	58	10	152
1750--99	words	9,214	29,980	38,980	21,271	21,896	41,177	23,541	22,265	0	20,668	27,860	21,315	278,167
1800--49	files	1	10	10	0	10	12	0	10	0	10	10	10	83
1800--49	words	2,822	40,568	44,676	0	21,476	33,409	0	37,107	0	20,904	20,739	20,695	242,396
1850--99	files	8	10	11	10	10	10	10	10	0	10	28	11	128
1850--99	words	24,480	32,721	44,394	21,056	22,436	28,506	20,547	21,994	0	21,311	11,361	23,419	272,225
1900--49	files	10	10	10	0	10	11	0	15	0	10	52	10	138
1900--49	words	30,460	52,514	53,430	0	21,661	21,607	0	22,802	0	20,984	25,021	20,731	269,210
1950--99	files	10	10	10	10	10	12	10	10	0	12	30	10	134
1950--99	words	29,563	31,037	44,382	21,051	22,109	25,517	22,617	23,069	0	25,623	11,961	21,654	278,583
TOTAL	files	32	50	51	30	50	57	29	55	0	52	178	51	635
TOTAL	words	96,539	186,820	225,862	63,378	109,578	150,216	66,705	127,237	0	109,490	96,942	107,814	1,340,581

I		1150--1250 (ME I)		1250--1350 (ME II)		1350--1420 (ME III)		1420--1500 (ME IV)
I		tokens	%	tokens	%	tokens	%	tokens	%
before V	ich	169	100	121	95	4	3	0	0
before V	I	0	0	6	5	135	97	253	100
before <h>	ich	171	100	105	97	3	2	0	0
before <h>	I	0	0	3	3	156	98	316	100
before C	ich	513	94	363	42	0	0	0	0
before C	I	33	6	494	58	1106	100	2043	100
EVERY		1150--1250 (ME I)		1250--1350 (ME II)		1350--1420 (ME III)		1420--1500 (ME IV)
EVERY		tokens	%	tokens	%	tokens	%	tokens	%
before V	everich	-		6	86	7	64	9	39
	everiche	-		1	14	0	0	0	0
	every	-		0	0	4	36	14	61
before <h>	everich	-		0	0	1	20	-
	everiche	-		1	100	1	20	-
	every	-		0	0	3	60	-
before C	everich	-		6	29	2	2	0	0
	everiche	-		10	48	2	2	0	0
	every	-		5	24	105	96	138	100
-LY		1150--1250 (ME I)		1250--1350 (ME II)		1350--1420 (ME III)		1420--1500 (ME IV)
-LY		tokens	%	tokens	%	tokens	%	tokens	%
before V	-lich	23	12	8	12	12	4	1	0
	-liche	162	87	51	77	23	8	21	5
	-ly	1	1	7	11	251	88	421	95
before <h>	-lich	13	18	7	21	1	2	0	0
	-liche	59	82	24	73	8	14	0	0
	-ly	0	0	2	6	49	84	76	100
before C	-lich	70	13	18	15	18	2	2	0
	-liche	468	85	93	77	39	5	23	2
	-ly	11	2	10	8	788	93	947	97

	shew 系列	show 系列	総語数
1710--1780	335	1,545	10,480,431
1780--1850	159	3,100	11,285,587
1850--1920	92	5,118	12,620,207

Decade	Frequency	Corpus size
1710--1780	5 (5)	10,480,431 words
1780--1850	70 (18)	11,285,587
1850--1920	347 (6)	12,620,207

Genre	1710--1780	1780--1850	1850--1920
Narrative fiction	4,642,670 words	4,830,718	6,311,301
Narrative non-fiction	1,863,855	1,940,245	958,410
Drama	407,885	347,493	607,401
Letters	1,016,745	714,343	479,724
Treatise	1,114,521	1,692,992	1,782,124
Other	1,434,755	1,759,796	2,481,247

Sub-period	between	betwixt
1710--1780	4,869 words (464.58 wpm)	657 (62.69 wpm)
1780--1850	5,457 (483.54 wpm)	109 (9.66 wpm)
1850--1920	7,672 (607.91 wpm)	51 (4.04 wpm)

	-o(u)r	-er(s)
E1 (1500--1569)	2	1
E2 (1570--1639)	3	5
E3 (1640--1710)	0	6

	LONGEST	LENGEST
O1	0	0
O2	0	2
O3	0	13
O4	0	3
M1	0	1
M2	0	0
M3	0	1
M4	0	1
E1	3	0
E2	4	0
E3	2	0

	LONGER	LENG(ER)
O1	0	1
O2	0	14
O3	0	45
O4	0	7
M1	0	14
M2	0	21
M3	11	26
M4	3	25
E1	11	6
E2	19	0
E3	46	0

	LONGER	LENG(ER)
CEECS1	31	6
CEECS2	37	0

corpus - hellog～英語史ブログ

■ #1814. 18--19世紀の be 完了の衰退を CLMET で確認 [perfect][clmet][corpus][syntax][be][auxiliary_verb][aspect][participle][lmode]

■ #1808. ARCHER 検索結果の時代×ジャンル仕分けツール (ARCHER Period-Genre Sorter) [cgi][web_service][corpus][archer][mode]

■ #1807. ARCHER で between と betwixt [spelling][corpus][archer][mode]

■ #1806. ARCHER で shew と show [spelling][corpus][archer][mode]

■ #1802. ARCHER 3.2 [corpus][archer][mode][frequency]

■ #1773. ich, everich, -lich から語尾の ch が消えた時期 [me][corpus][hc][phonetics][personal_pronoun][consonant][-ly]

■ #1752. interpretor → interpreter (2) [spelling][suffix][corpus][emode][hc][ppcme2][ppceme][archer][lc]

■ #1743. ICE Frequency Comparer [corpus][web_service][cgi][frequency][new_englishes][variety][ice]

■ #1739. AmE-BrE Diachronic Frequency Comparer [corpus][ame_bre][web_service][cgi][frequency][representativeness]

■ #1730. AmE-BrE 2006 Frequency Comparer [corpus][ame_bre][web_service][cgi][frequency][spelling]

■ #1716. shew と show (3) [spelling][corpus][clmet][representativeness]

■ #1712. as regards [preposition][conjunction][impersonal_verb][corpus][clmet]

■ #1669. longest が lengest を置き換えたのはいつか？ [hc][corpus][adjective][comparison][i-mutation][analogy]

■ #1649. longer が leng(er) を置き換えたのはいつか？ [hc][corpus][adjective][comparison][i-mutation][analogy]

■ #1637. CLMET3.0 で between と betwixt の分布を調査 [corpus][lmode][preposition][clmet]

■ #1626. 現代日本語書き言葉均衡コーパス BCCWJ の各種インターフェース [web_service][corpus][link][japanese]

■ #1621. The Middle English Grammar Corpus (MEG-C) [corpus][preposition][me_dialect]

■ #1567. 英語と日本語のオンラインコーパスをいくつか紹介 [web_service][corpus][efl][link][japanese]

■ #1555. unbeknownst [phonetics][corpus][-st]

■ #1477. The Salamanca Corpus --- 近代英語方言コーパス [corpus][emode][dialect][dialectology][caxton][popular_passage]