The Chinese/English Political Interpreting Corpus (CEPIC), with about 6.5 million word tokens in size, is designed for the study of Chinese/English political interpreting and translation. It consists of transcripts of speeches delivered by top political figures from Hong Kong, Beijing, Washington DC and London, as well as their translated/interpreted texts.
The main speech types of CEPIC include the reading of government reports such as policy addresses and budget speeches, Q&A at press conferences, parliamentary debates, as well as remarks delivered at bilateral meetings (For details, please refer to the section Basic Statistics). In particular, speeches in the Hong Kong subset were mostly interpreted from Cantonese into Putonghua and English, and those in the Beijing subset Putonghua to English. The other two subsets, i.e., Washington DC and London, mainly includes English speeches delivered in similar settings and can be regarded as reference subsets to the interpreted English speeches.
Data of CEPIC were collected in two ways: 1) Speech transcripts and their translations collected from government websites (“Raw”); and 2) A revised or newly transcribed version (when there are no readily available transcripts) of these speeches and their interpreted texts based on audios/videos collected from government websites and TV programme archives (“Annotated”).
The corpus features a parallel display of up to six versions of the same speech segment, aligned at paragraph level. Apart from POS tagging, the corpus is also annotated with different prosodic and paralinguistic features that are of concern to the study of spoken language as well as interpreting.
The CEPIC can be used to investigate matters relating to Chinese/English political translation/interpreting and political discourse at large. It can also serve students, teachers, as well as people working in political settings, in aspects of political speech delivery and translation/interpreting production. Users can also download search results from the corpus for their own teaching/research purposes.
The CEPIC consists of parallel representation of speech segments in Cantonese, Putonghua and English. The following table shows the number of words (word token) and unique words (type) in each language.
Table 1. The composition of the CEPIC by language
Word (Word Token) | Unique Word (Type) | |
---|---|---|
Chinese | 2,578,911 | 83,312 |
Cantonese Putonghua |
1,072,368 1,506,541 |
61,837 30,320 |
English | 3,815,083 | 32,748 |
Total | 6,393,994 | 116,060 |
The main speech types of CEPIC include the reading of government reports such as policy addresses and budget speeches, Q&A at press conferences, parliamentary debates, as well as remarks delivered at bilateral meetings. The following table shows the current composition of the corpus and some basic statistics of each subset.
Table 2. The composition of the CEPIC by speech types
Speech Type | Word (Word Token) | |
---|---|---|
1 | HK SAR Policy Addresses (HKPA) | 1,290,774 |
2 | Press Conferences of HK SAR Policy Addresses (HKPAPC) | 326,194 |
3 | HK SAR Budget Speeches (HKBS) | 1,167,530 |
4 | Press Conferences of HK SAR Budge Speeches (HKBSPC) | 419,236 |
5 | PRC Reports on the Work of the Government (PRCWoG) | 782,794 |
6 | Press Conferences of PRC Reports on the Work of the Government (PRCWoGPC) | 448,111 |
7 | US State of the Union Addresses (USSoUA) | 275,018 |
8 | Press Conferences of US State of the Union Addresses (USSoUAPC) | 266,639 |
9 | US Budget Speeches (USBS) | 73,115 |
10 | Press Conferences of US Budget Speeches (USBSPC) | 328,850 |
11 | UK State Opening Addresses of Parliament (UKSOoP) | 31,006 |
12 | Debates on the UK State Opening Addresses of Parliament (UKSOoPD) | 53,941 |
13 | UK Budget Speeches (UKBS) | 469,452 |
14 | Debates on the UK Budget Speeches (UKBSD) | 376,721 |
15 | Bilateral Meetings between PRC Key Politicians and their Counterparts in US (BMPRCUS) | 70,138 |
16 | Bilateral Meetings between PRC Key Politicians and their Counterparts in UK (BMPRCUK) | 14,473 |
Total | 6,393,994 |
The following is a list of words that have specific meaning in the CEPIC.
The CEPIC is POS tagged with the assistance of Stanford CoreNLP 3.9.2 (Manning et al. 2014).
A semi-automatic process was employed to enhance the accuracy rate of machine tagging, in which all taggers were checked and revised based on subsets of manually checked testing data. Please click here(available soon) for a detailed account of the semi-automatic process employed in the POS tagging of CEPIC.
The following table provides a list of the POS taggers that appeared in the English subset of CEPIC, which is based on the Part-of-Speech Tagging Guidelines for the Penn Treebank Project (Santorini 1990, 6-7).
Table 1. POS taggers that appeared in the English subset of CEPIC (based on Santorini 1990: 6-7)
POS tagger | Description |
---|---|
/CC | Coordinating conjunction |
/CD | Cardinal number |
/DT | Determiner |
/EX | Existential there |
/FW | Foreign word |
/IN | Preposition or subordinating conjunction |
/JJ | Adjective |
/JJR | Adjective, comparative |
/JJS | Adjective, superlative |
/LRB | Open parenthesis |
/LS | List item marker |
/MD | Modal |
/NN | Noun, singular or mass |
/NNP | Noun, plural |
/NNPS | Proper noun, singular |
/NNS | Proper noun, plural |
/PDT | Predeterminer |
/POS | Possessive ending |
/PRP | Personal pronoun |
/PRP$ | Possessive pronoun |
/PU | Punctuation |
/RB | Adverb |
/RBR | Adverb, comparative |
/RBS | Adverb, superlative |
/RP | Particle |
/RRB | Close parenthesis |
/SYM | Symbol |
/TO | to |
/UH | Interjection |
/VB | Verb, base form |
/VBD | Verb, past tense |
/VBG | Verb, gerund or present participle |
/VBN | Verb, past participle |
/VBP | Verb, non-3rd person singular present |
/VBZ | Verb, 3rd person singular present |
/WDT | Wh-determiner |
/WP | Wh-pronoun |
/WP$ | Possessive wh-pronoun |
/WRB | Wh-adverb |
The following table provides a list of the POS taggers that appeared in the Chinese subset of CEPIC, which is based on the Part-Of-Speech Tagging Guidelines for the Penn Chinese Treebank (3.0) (Xia 2000: 37).
Table 2. POS taggers that appeared in the Chinese subset of CEPIC (based on Xia 2000: 37)
POS tagger | Description |
---|---|
/AD | Adverb |
/AS | Aspect Particle |
/BA | 把 in ba-construction: |
/CC | Coordinating conjunction |
/CD | Cardinal number |
/CS | Subordinating conjunction |
/DEC | 的 as a complementizer or a nominalizer |
/DEG | 的as a genitive marker and an associative marker |
/DER | Resultative得 |
/DEV | Manner地 (before VP) |
/DT | Determiner |
/ETC | For words 等, 等等 |
/FW | Foreign words |
/IJ | Interjection |
/JJ | Other noun-modifer |
/LB | 被 in long bei-construction |
/LC | Localizer |
/LRB | Open parenthesis |
/M | Measure word |
/MSP | Other particle |
/NN | Common noun |
/NR | Proper noun |
/NT | Temporal noun |
/OD | Ordinal number |
/P | Preposition |
/PN | Pronoun |
/PU | Punctuation |
/RRB | Close parenthesis |
/SB | 被 in short bei-construction |
/SP | Sentence-final particle |
/VA | Predicative adjective |
/VC | Copula |
/VE | 有 as the main verb |
/VV | Other verb |
References:
Manning, Christopher D., Mihai Surdeanu, John Bauer, Jenny Finkel, Steven J. Bethard, and David McClosky. (2014). The Stanford CoreNLP Natural Language Processing Toolkit. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations, pp. 55-60.
Santorini, B. (1990). Part-Of-Speech Tagging Guidelines for the Penn Treebank Project (3rd revision, 2nd printing). Department of Linguistics, University of Pennsylvania.
Xia, Fei. (2000). The Part-Of-Speech Tagging Guidelines for the Penn Chinese Treebank (3.0). IRCS Technical Reports Series. 38.
Back to TopData of CEPIC were collected following specific steps and protocols. In particular, the “annotated” version of the CEPIC corpus was transcribed and annotated in a way that reflects features of spoken language data.
Speeches of CEPIC were manually revised or transcribed based on audios/videos with the speeches and their interpreting, if any. Apart from following a standardised process, the transcription of CEPIC aims to represent the spoken text as close as it was delivered. In addition, all Cantonese texts were transcribed in a way to capture spoken Cantonese features. Text and audio/video links were also included for those who may be interested in the sources of the speeches.
The following table shows the differences between the “raw” and “annotated” data.
Raw | Annotated | |
---|---|---|
Cantonese | 我們要重新認識八十年代關於香港發展的一些重要概念,放棄二元對立分析方法。(HKSAR Policy Address, 2008-10-15) | [...][我要]我哋要重新認識八十年代關於香港發展嘅一啲嘅重要嘅概念,係放棄二元對立嘅分析嘅方法。(HKSAR Policy Address, 2008-10-15) |
Putonghua | 在元朝有一位画家叫黄公望,他画了一幅著名的《富春江居图》。(Press Conference of PRC Report on the Work of the Government, 2010-03-14) | 在元朝 [...]有一位画家[...]叫黄公望,他画了一幅[...]著名的 [...]啊 [...]《富春江居图》。(Press Conference of PRC Report on the Work of the Government, 2010-03-14) |
English | So that is the big difference in our approach and the approach that I think might have been debated about. (Press Conference of US Budget Speech, 1997-02-06) | [er] So [that] that is the big difference [er] in our approach and the approach [er] that [er] I think [er] might have been debated about. (Press Conference of US Budget Speech, 1997-02-06) |
As can be seen from the above table, the “annotated” version features annotations of different prosodic and paralinguistic features (e.g., pauses, fillers, repetitions and self-repair, etc.) that are of concern to the study of spoken language as well as interpreting.
Please click here(available soon) for a detailed account of the steps and protocols used in the transcription and annotation of CEPIC.
This section shows the most frequently used words in CEPIC and in its subsets by language and speech type. The data were extracted with the help of the lexical analysis software WordSmith 7.0.
To accommodate a wide range of research/teaching purposes, the word frequency data were generated with no stop word list.
References:
Scott, M. (2016). WordSmith Tools 7.0. Accessed from https://lexically.net/wordsmith/ (Accessed on 10 September 2017).
N | Word | Freq. |
---|---|---|
1 | 的 | 123,613 |
2 | 和 | 32,017 |
3 | 在 | 26,392 |
4 | 我 | 26,347 |
5 | 是 | 23,998 |
6 | 一 | 20,961 |
7 | 政府 | 20,632 |
8 | 香港 | 20,575 |
9 | 有 | 19,748 |
10 | 我们 | 19,147 |
11 | 呢 | 15,344 |
12 | 嘅 | 14,758 |
13 | 发展 | 12,239 |
14 | # | 12,205 |
15 | 会 | 11,938 |
16 | 要 | 11,446 |
17 | 都 | 10,896 |
18 | 及 | 10,741 |
19 | 了 | 10,234 |
20 | 经济 | 9,863 |
21 | 个 | 9,769 |
22 | 係 | 8,965 |
23 | 不 | 8,830 |
24 | 個 | 8,740 |
25 | 元 | 8,604 |
26 | 會 | 7,742 |
27 | 方面 | 7,565 |
28 | 你 | 7,494 |
29 | 可以 | 7,437 |
30 | 也 | 7,252 |
31 | 亦 | 7,242 |
32 | 就 | 7,189 |
33 | 我們 | 7,167 |
34 | 工作 | 7,118 |
35 | 呃 | 7,020 |
36 | 提供 | 6,957 |
37 | 以 | 6,896 |
38 | 新 | 6,867 |
39 | 增加 | 6,851 |
40 | 更 | 6,819 |
41 | 措施 | 6,761 |
42 | 社会 | 6,473 |
43 | 为 | 6,206 |
44 | 多 | 6,205 |
45 | 这 | 6,146 |
46 | 市民 | 6,072 |
47 | 我哋 | 5,862 |
48 | 到 | 5,807 |
49 | 同埋 | 5,803 |
50 | 需要 | 5,615 |
51 | 發展 | 5,592 |
52 | 年 | 5,479 |
53 | 政策 | 5,345 |
54 | 各 | 5,294 |
55 | 中 | 5,182 |
56 | 金融 | 5,061 |
57 | 对 | 4,955 |
58 | 问题 | 4,950 |
59 | 教育 | 4,798 |
60 | 等 | 4,768 |
61 | 加强 | 4,686 |
62 | 服务 | 4,513 |
63 | 今年 | 4,446 |
64 | 上 | 4,438 |
65 | 大 | 4,405 |
66 | 已 | 4,405 |
67 | 經濟 | 4,368 |
68 | 為 | 4,328 |
69 | 以及 | 4,230 |
70 | 与 | 4,172 |
71 | 计划 | 4,168 |
72 | 包括 | 4,138 |
73 | 至 | 4,124 |
74 | 地 | 4,087 |
75 | 将 | 4,079 |
76 | 改革 | 4,059 |
77 | 研究 | 4,031 |
78 | 而 | 3,961 |
79 | 人 | 3,867 |
80 | 市场 | 3,860 |
81 | 啊 | 3,808 |
82 | 收入 | 3,784 |
83 | 建设 | 3,739 |
84 | 但 | 3,686 |
85 | 港 | 3,659 |
86 | 企业 | 3,641 |
87 | 做 | 3,636 |
88 | 去年 | 3,610 |
89 | 改善 | 3,605 |
90 | 合作 | 3,592 |
91 | 服務 | 3,573 |
92 | 这个 | 3,563 |
93 | 每 | 3,524 |
94 | 支持 | 3,504 |
95 | 人士 | 3,444 |
96 | 中国 | 3,444 |
97 | 土地 | 3,375 |
98 | 喺 | 3,368 |
99 | 好 | 3,365 |
100 | 同 | 3,362 |
N | Word | Freq. |
---|---|---|
1 | 的 | 35,334 |
2 | 嘅 | 14,674 |
3 | 我 | 13,775 |
4 | 呢 | 12,787 |
5 | 香港 | 10,179 |
6 | 一 | 9,683 |
7 | 有 | 9,480 |
8 | 政府 | 9,370 |
9 | 係 | 8,843 |
10 | 個 | 8,643 |
11 | 在 | 8,508 |
12 | 會 | 7,710 |
13 | 和 | 7,414 |
14 | 我們 | 7,094 |
15 | 都 | 6,989 |
16 | 是 | 6,312 |
17 | 我哋 | 5,859 |
18 | 同埋 | 5,802 |
19 | 發展 | 5,587 |
20 | 亦 | 5,100 |
21 | # | 4,812 |
22 | 及 | 4,781 |
23 | 經濟 | 4,360 |
24 | 為 | 4,322 |
25 | 就 | 4,058 |
26 | 可以 | 3,882 |
27 | 你 | 3,857 |
28 | 元 | 3,730 |
29 | 要 | 3,713 |
30 | 服務 | 3,567 |
31 | 方面 | 3,520 |
32 | 呃 | 3,515 |
33 | 年 | 3,492 |
34 | 提供 | 3,446 |
35 | 喺 | 3,354 |
36 | 市民 | 3,105 |
37 | 措施 | 3,087 |
38 | 以 | 3,077 |
39 | 更 | 2,958 |
40 | 增加 | 2,939 |
41 | 計劃 | 2,908 |
42 | 到 | 2,886 |
43 | 社會 | 2,847 |
44 | 多 | 2,797 |
45 | 同 | 2,780 |
46 | 需要 | 2,707 |
47 | 不 | 2,673 |
48 | 新 | 2,672 |
49 | 工作 | 2,525 |
50 | 至 | 2,375 |
51 | 將 | 2,342 |
52 | 了 | 2,252 |
53 | 對 | 2,228 |
54 | 哋 | 2,102 |
55 | 而 | 2,072 |
56 | 已 | 2,059 |
57 | 市場 | 2,056 |
58 | 金融 | 2,033 |
59 | 但 | 2,001 |
60 | 包括 | 1,990 |
61 | 以及 | 1,964 |
62 | 去 | 1,945 |
63 | 問題 | 1,944 |
64 | 與 | 1,918 |
65 | 已經 | 1,916 |
66 | 唔 | 1,912 |
67 | 研究 | 1,894 |
68 | 這 | 1,890 |
69 | 各 | 1,869 |
70 | 咗 | 1,815 |
71 | 政策 | 1,790 |
72 | 今年 | 1,781 |
73 | 大 | 1,777 |
74 | 教育 | 1,774 |
75 | 每 | 1,757 |
76 | 做 | 1,754 |
77 | 中 | 1,751 |
78 | 好 | 1,722 |
79 | 人士 | 1,722 |
80 | 上 | 1,720 |
81 | 港 | 1,718 |
82 | 嗰 | 1,714 |
83 | 人 | 1,672 |
84 | 收入 | 1,640 |
85 | 於 | 1,610 |
86 | 咁 | 1,593 |
87 | 由 | 1,564 |
88 | 建議 | 1,555 |
89 | 啊 | 1,550 |
90 | 土地 | 1,548 |
91 | 改善 | 1,512 |
92 | 並 | 1,495 |
93 | 項 | 1,492 |
94 | 加強 | 1,480 |
95 | 施政 | 1,467 |
96 | 去年 | 1,460 |
97 | 等 | 1,436 |
98 | 地 | 1,434 |
99 | 其實 | 1,422 |
100 | 報告 | 1,417 |
N | Word | Freq. |
---|---|---|
1 | 的 | 88,279 |
2 | 和 | 24,603 |
3 | 我们 | 19,147 |
4 | 在 | 17,884 |
5 | 是 | 17,686 |
6 | 我 | 12,572 |
7 | 发展 | 12,239 |
8 | 会 | 11,938 |
9 | 一 | 11,278 |
10 | 政府 | 11,262 |
11 | 香港 | 10,396 |
12 | 有 | 10,268 |
13 | 经济 | 9,863 |
14 | 个 | 9,769 |
15 | 了 | 7,982 |
16 | 要 | 7,733 |
17 | # | 7,393 |
18 | 社会 | 6,473 |
19 | 为 | 6,206 |
20 | 不 | 6,157 |
21 | 这 | 6,146 |
22 | 及 | 5,960 |
23 | 也 | 5,873 |
24 | 对 | 4,955 |
25 | 问题 | 4,950 |
26 | 元 | 4,874 |
27 | 加强 | 4,679 |
28 | 工作 | 4,593 |
29 | 服务 | 4,513 |
30 | 新 | 4,195 |
31 | 与 | 4,172 |
32 | 计划 | 4,168 |
33 | 将 | 4,078 |
34 | 方面 | 4,045 |
35 | 增加 | 3,912 |
36 | 都 | 3,907 |
37 | 更 | 3,861 |
38 | 市场 | 3,860 |
39 | 以 | 3,819 |
40 | 建设 | 3,739 |
41 | 措施 | 3,674 |
42 | 企业 | 3,641 |
43 | 你 | 3,637 |
44 | 改革 | 3,587 |
45 | 这个 | 3,563 |
46 | 可以 | 3,555 |
47 | 政策 | 3,555 |
48 | 提供 | 3,511 |
49 | 呃 | 3,505 |
50 | 中国 | 3,444 |
51 | 中 | 3,431 |
52 | 各 | 3,425 |
53 | 多 | 3,408 |
54 | 等 | 3,332 |
55 | 继续 | 3,324 |
56 | 两 | 3,270 |
57 | 就 | 3,131 |
58 | 金融 | 3,028 |
59 | 教育 | 3,024 |
60 | 市民 | 2,967 |
61 | 到 | 2,921 |
62 | 需要 | 2,908 |
63 | 财政 | 2,863 |
64 | 国家 | 2,819 |
65 | 上 | 2,718 |
66 | 今年 | 2,665 |
67 | 地 | 2,653 |
68 | 并 | 2,641 |
69 | 大 | 2,628 |
70 | 呢 | 2,557 |
71 | 增长 | 2,547 |
72 | 支持 | 2,531 |
73 | 国际 | 2,501 |
74 | 推动 | 2,387 |
75 | 提高 | 2,357 |
76 | 已 | 2,346 |
77 | 合作 | 2,341 |
78 | 推进 | 2,325 |
79 | 他们 | 2,309 |
80 | 以及 | 2,266 |
81 | 投资 | 2,264 |
82 | 啊 | 2,258 |
83 | 已经 | 2,254 |
84 | 环境 | 2,233 |
85 | 项 | 2,230 |
86 | 积极 | 2,219 |
87 | 就业 | 2,217 |
88 | 制度 | 2,216 |
89 | 人 | 2,195 |
90 | 去年 | 2,150 |
91 | 包括 | 2,148 |
92 | 收入 | 2,144 |
93 | 亦 | 2,142 |
94 | 研究 | 2,137 |
95 | 说 | 2,129 |
96 | 改善 | 2,093 |
97 | 促进 | 2,020 |
98 | 让 | 2,011 |
99 | 重要 | 2,010 |
100 | 地区 | 1,999 |
N | Word | Freq. |
---|---|---|
1 | THE | 240,433 |
2 | AND | 138,508 |
3 | TO | 126,832 |
4 | OF | 110,864 |
5 | IN | 80,540 |
6 | WE | 62,892 |
7 | A | 61,145 |
8 | THAT | 59,563 |
9 | FOR | 46,229 |
10 | WILL | 44,349 |
11 | IS | 40,311 |
12 | I | 33,912 |
13 | # | 33,206 |
14 | ON | 28,155 |
15 | OUR | 28,063 |
16 | THIS | 26,071 |
17 | HAVE | 25,731 |
18 | S | 25,425 |
19 | IT | 24,264 |
20 | WITH | 22,882 |
21 | BE | 22,386 |
22 | ARE | 21,821 |
23 | AS | 21,081 |
24 | GOVERNMENT | 16,094 |
25 | BY | 15,707 |
26 | YOU | 15,659 |
27 | NOT | 14,718 |
28 | YEAR | 14,322 |
29 | HAS | 14,222 |
30 | ER | 13,640 |
31 | FROM | 13,366 |
32 | PEOPLE | 13,155 |
33 | MORE | 12,984 |
34 | AT | 12,005 |
35 | SO | 11,243 |
36 | BUT | 11,050 |
37 | CAN | 10,678 |
38 | AN | 10,583 |
39 | THEIR | 10,539 |
40 | ALL | 10,466 |
41 | DEVELOPMENT | 10,430 |
42 | HONG | 10,425 |
43 | KONG | 10,339 |
44 | THEY | 10,162 |
45 | NEW | 9,828 |
46 | DO | 9,441 |
47 | ALSO | 9,271 |
48 | TAX | 9,258 |
49 | THERE | 8,827 |
50 | ABOUT | 8,747 |
51 | ECONOMIC | 8,496 |
52 | HE | 8,464 |
53 | WAS | 8,146 |
54 | WHAT | 8,041 |
55 | OR | 8,028 |
56 | YEARS | 7,938 |
57 | UP | 7,698 |
58 | WORK | 7,339 |
59 | BUDGET | 7,289 |
60 | PUBLIC | 7,118 |
61 | BEEN | 7,010 |
62 | NOW | 6,930 |
63 | WOULD | 6,915 |
64 | WHICH | 6,874 |
65 | ONE | 6,822 |
66 | ECONOMY | 6,816 |
67 | OVER | 6,752 |
68 | SHOULD | 6,704 |
69 | UM | 6,549 |
70 | PER | 6,460 |
71 | THESE | 6,407 |
72 | THAN | 5,926 |
73 | MAKE | 5,918 |
74 | MY | 5,863 |
75 | OUT | 5,759 |
76 | SUPPORT | 5,747 |
77 | LAST | 5,745 |
78 | GROWTH | 5,663 |
79 | T | 5,630 |
80 | N | 5,580 |
81 | SOME | 5,575 |
82 | CENT | 5,567 |
83 | NIL | 5,548 |
84 | WHO | 5,499 |
85 | CHINA | 5,495 |
86 | NEED | 5,481 |
87 | IF | 5,419 |
88 | OTHER | 5,371 |
89 | SERVICES | 5,354 |
90 | PRESIDENT | 5,318 |
91 | FINANCIAL | 5,207 |
92 | THOSE | 5,172 |
93 | TIME | 5,049 |
94 | US | 5,024 |
95 | SYSTEM | 4,990 |
96 | TWO | 4,941 |
97 | THINK | 4,908 |
98 | BILLION | 4,862 |
99 | WHEN | 4,847 |
100 | COUNTRY | 4,785 |
N | Word | Freq. |
---|---|---|
1 | 的 | 13,374 |
2 | 政府 | 4,661 |
3 | 嘅 | 4,588 |
4 | 香港 | 4,579 |
5 | 和 | 3,765 |
6 | 會 | 3,114 |
7 | 發展 | 3,023 |
8 | 在 | 2,973 |
9 | 我們 | 2,887 |
10 | 同埋 | 2,841 |
11 | 及 | 2,524 |
12 | 一 | 2,488 |
13 | 有 | 2,328 |
14 | 個 | 2,099 |
15 | 為 | 1,941 |
16 | 服務 | 1,838 |
17 | 我 | 1,794 |
18 | 提供 | 1,751 |
19 | 亦 | 1,683 |
20 | 我哋 | 1,655 |
21 | # | 1,621 |
22 | 都 | 1,607 |
23 | 計劃 | 1,558 |
24 | 經濟 | 1,537 |
25 | 是 | 1,529 |
26 | 以 | 1,504 |
27 | 社會 | 1,485 |
28 | 更 | 1,430 |
29 | 將 | 1,414 |
30 | 新 | 1,309 |
31 | 市民 | 1,299 |
32 | 工作 | 1,241 |
33 | 要 | 1,163 |
34 | 與 | 1,156 |
35 | 研究 | 1,090 |
36 | 教育 | 1,070 |
37 | 需要 | 1,058 |
38 | 多 | 1,054 |
39 | 已 | 1,015 |
40 | 增加 | 1,013 |
41 | 各 | 969 |
42 | 同 | 930 |
43 | 並 | 908 |
44 | 加強 | 903 |
45 | 包括 | 897 |
46 | 喺 | 891 |
47 | 方面 | 882 |
48 | 對 | 879 |
49 | 以及 | 879 |
50 | 年 | 877 |
51 | 就 | 868 |
52 | 港 | 861 |
53 | 了 | 842 |
54 | 措施 | 840 |
55 | 係 | 839 |
56 | 政策 | 837 |
57 | 土地 | 822 |
58 | 人士 | 784 |
59 | 推動 | 756 |
60 | 已經 | 743 |
61 | 支援 | 743 |
62 | 可以 | 739 |
63 | 合作 | 735 |
64 | 內地 | 733 |
65 | 改善 | 716 |
66 | 至 | 715 |
67 | 中 | 713 |
68 | 等 | 705 |
69 | 中心 | 679 |
70 | 環境 | 659 |
71 | 市場 | 653 |
72 | 也 | 650 |
73 | 元 | 649 |
74 | 項 | 641 |
75 | 問題 | 632 |
76 | 繼續 | 625 |
77 | 金融 | 624 |
78 | 文化 | 622 |
79 | 由 | 616 |
80 | 今年 | 614 |
81 | 提升 | 609 |
82 | 家庭 | 605 |
83 | 這 | 604 |
84 | 到 | 602 |
85 | 而 | 602 |
86 | 咗 | 590 |
87 | 可 | 587 |
88 | 特區 | 577 |
89 | 上 | 574 |
90 | 推行 | 572 |
91 | 建議 | 569 |
92 | 向 | 564 |
93 | 房屋 | 563 |
94 | 國際 | 557 |
95 | 委員會 | 548 |
96 | 不 | 546 |
97 | 下 | 546 |
98 | 大 | 545 |
99 | 本 | 540 |
100 | 地區 | 539 |
N | Word | Freq. |
---|---|---|
1 | 的 | 23,620 |
2 | 和 | 6,685 |
3 | 在 | 5,151 |
4 | 政府 | 4,862 |
5 | 香港 | 4,725 |
6 | 我们 | 4,698 |
7 | 会 | 4,307 |
8 | 发展 | 3,628 |
9 | 及 | 3,438 |
10 | 是 | 2,606 |
11 | 为 | 2,359 |
12 | 个 | 2,196 |
13 | 将 | 2,139 |
14 | 有 | 2,109 |
15 | 计划 | 2,090 |
16 | 一 | 2,050 |
17 | 服务 | 2,010 |
18 | 经济 | 1,986 |
19 | 社会 | 1,903 |
20 | 与 | 1,869 |
21 | 我 | 1,828 |
22 | 提供 | 1,811 |
23 | # | 1,661 |
24 | 也 | 1,604 |
25 | 以 | 1,560 |
26 | 更 | 1,540 |
27 | 了 | 1,470 |
28 | 并 | 1,454 |
29 | 市民 | 1,402 |
30 | 新 | 1,384 |
31 | 已 | 1,355 |
32 | 工作 | 1,299 |
33 | 加强 | 1,258 |
34 | 对 | 1,231 |
35 | 教育 | 1,177 |
36 | 研究 | 1,145 |
37 | 要 | 1,121 |
38 | 需要 | 1,105 |
39 | 多 | 1,067 |
40 | 增加 | 1,062 |
41 | 各 | 1,044 |
42 | 以及 | 1,020 |
43 | 港 | 999 |
44 | 内地 | 962 |
45 | 这 | 961 |
46 | 继续 | 945 |
47 | 方面 | 936 |
48 | 包括 | 926 |
49 | 支持 | 924 |
50 | 推动 | 915 |
51 | 政策 | 911 |
52 | 措施 | 871 |
53 | 人士 | 841 |
54 | 土地 | 832 |
55 | 环境 | 820 |
56 | 问题 | 817 |
57 | 等 | 816 |
58 | 市场 | 796 |
59 | 中 | 786 |
60 | 合作 | 767 |
61 | 改善 | 754 |
62 | 委员会 | 741 |
63 | 至 | 736 |
64 | 元 | 733 |
65 | 项 | 729 |
66 | 资助 | 729 |
67 | 建议 | 727 |
68 | 都 | 712 |
69 | 长者 | 712 |
70 | 国际 | 709 |
71 | 亦 | 703 |
72 | 中心 | 685 |
73 | 积极 | 684 |
74 | 就 | 680 |
75 | 不 | 671 |
76 | 可 | 668 |
77 | 特区 | 667 |
78 | 于 | 661 |
79 | 约 | 654 |
80 | 让 | 646 |
81 | 由 | 643 |
82 | 金融 | 641 |
83 | 上 | 641 |
84 | 提升 | 640 |
85 | 两 | 638 |
86 | 未来 | 638 |
87 | 可以 | 632 |
88 | 文化 | 630 |
89 | 下 | 629 |
90 | 家庭 | 624 |
91 | 地区 | 620 |
92 | 国家 | 620 |
93 | 本 | 619 |
94 | 今年 | 613 |
95 | 内 | 603 |
96 | 同时 | 601 |
97 | 推行 | 598 |
98 | 正 | 596 |
99 | 而 | 595 |
100 | 医疗 | 590 |
N | Word | Freq. |
---|---|---|
1 | THE | 37,091 |
2 | AND | 20,046 |
3 | TO | 18,499 |
4 | OF | 16,115 |
5 | IN | 11,292 |
6 | WILL | 8,362 |
7 | A | 7,790 |
8 | FOR | 7,329 |
9 | WE | 6,655 |
10 | HONG | 4,572 |
11 | OUR | 4,567 |
12 | KONG | 4,515 |
13 | GOVERNMENT | 4,053 |
14 | WITH | 4,041 |
15 | IS | 3,646 |
16 | AS | 3,517 |
17 | ON | 3,509 |
18 | # | 3,473 |
19 | DEVELOPMENT | 3,065 |
20 | HAVE | 2,993 |
21 | THAT | 2,651 |
22 | THIS | 2,447 |
23 | BE | 2,439 |
24 | BY | 2,355 |
25 | HAS | 2,347 |
26 | ARE | 2,113 |
27 | S | 2,047 |
28 | YEAR | 1,933 |
29 | PUBLIC | 1,867 |
30 | SERVICES | 1,841 |
31 | FROM | 1,764 |
32 | I | 1,764 |
33 | PEOPLE | 1,692 |
34 | MORE | 1,690 |
35 | THEIR | 1,643 |
36 | NEW | 1,631 |
37 | ALSO | 1,629 |
38 | AN | 1,620 |
39 | COMMUNITY | 1,410 |
40 | UP | 1,353 |
41 | SUPPORT | 1,322 |
42 | AT | 1,312 |
43 | PROVIDE | 1,283 |
44 | ECONOMIC | 1,260 |
45 | IT | 1,142 |
46 | EDUCATION | 1,139 |
47 | YEARS | 1,135 |
48 | MAINLAND | 1,090 |
49 | CAN | 1,040 |
50 | FINANCIAL | 1,035 |
51 | SCHEME | 1,031 |
52 | THESE | 1,027 |
53 | BEEN | 931 |
54 | WHICH | 926 |
55 | ABOUT | 896 |
56 | ALL | 879 |
57 | LAND | 874 |
58 | CARE | 872 |
59 | SOCIAL | 846 |
60 | WORK | 837 |
61 | CONTINUE | 831 |
62 | NOT | 824 |
63 | OR | 824 |
64 | MEASURES | 817 |
65 | ENHANCE | 797 |
66 | HOUSING | 796 |
67 | ELDERLY | 774 |
68 | POLICY | 771 |
69 | ITS | 770 |
70 | TWO | 765 |
71 | OVER | 763 |
72 | QUALITY | 756 |
73 | INTO | 731 |
74 | PROMOTE | 725 |
75 | MARKET | 703 |
76 | UNDER | 703 |
77 | OTHER | 701 |
78 | NEED | 679 |
79 | THROUGH | 673 |
80 | SHOULD | 671 |
81 | SERVICE | 667 |
82 | PROJECTS | 661 |
83 | SUCH | 657 |
84 | BUSINESS | 652 |
85 | HELP | 648 |
86 | FURTHER | 628 |
87 | SET | 625 |
88 | ECONOMY | 624 |
89 | SYSTEM | 622 |
90 | NEXT | 616 |
91 | LAST | 612 |
92 | USE | 598 |
93 | COUNCIL | 597 |
94 | INTERNATIONAL | 587 |
95 | IMPROVE | 577 |
96 | INDUSTRIES | 576 |
97 | ENVIRONMENT | 575 |
98 | AREAS | 570 |
99 | OPPORTUNITIES | 569 |
100 | MUST | 561 |
N | Word | Freq. |
---|---|---|
1 | 呢 | 4,001 |
2 | 的 | 3,660 |
3 | 呃 | 2,916 |
4 | 我 | 2,504 |
5 | 一 | 2,268 |
6 | 嘅 | 2,160 |
7 | 有 | 1,921 |
8 | 我哋 | 1,780 |
9 | 個 | 1,621 |
10 | 都 | 1,520 |
11 | 你 | 1,481 |
12 | 我們 | 1,365 |
13 | 是 | 1,348 |
14 | 係 | 1,119 |
15 | 要 | 944 |
16 | 香港 | 930 |
17 | 做 | 875 |
18 | 在 | 850 |
19 | 施政 | 848 |
20 | 報告 | 847 |
21 | 可以 | 769 |
22 | 會 | 703 |
23 | 問題 | 701 |
24 | 政府 | 694 |
25 | 去 | 671 |
26 | 就 | 654 |
27 | 不 | 648 |
28 | 亦 | 640 |
29 | 好 | 613 |
30 | 呢個 | 609 |
31 | 方面 | 545 |
32 | 到 | 529 |
33 | 喺 | 470 |
34 | 這 | 465 |
35 | 大家 | 445 |
36 | 其實 | 444 |
37 | 啊 | 438 |
38 | 需要 | 435 |
39 | 就係 | 429 |
40 | 同埋 | 416 |
41 | 市民 | 415 |
42 | 能夠 | 406 |
43 | 啦 | 401 |
44 | 政策 | 398 |
45 | 所以 | 365 |
46 | 和 | 353 |
47 | 發展 | 347 |
48 | 想 | 345 |
49 | 即係 | 335 |
50 | 已經 | 324 |
51 | 這個 | 323 |
52 | 工作 | 316 |
53 | 社會 | 314 |
54 | 年 | 311 |
55 | 說 | 307 |
56 | 但 | 302 |
57 | 人 | 299 |
58 | 如果 | 294 |
59 | 希望 | 292 |
60 | 上 | 291 |
61 | 了 | 286 |
62 | 多 | 267 |
63 | 宜家 | 267 |
64 | 新 | 266 |
65 | 咁 | 264 |
66 | 一啲 | 263 |
67 | 所 | 259 |
68 | 覺得 | 256 |
69 | 很 | 254 |
70 | 每 | 251 |
71 | 同 | 246 |
72 | 大 | 241 |
73 | 而 | 239 |
74 | 裏面 | 238 |
75 | 呢啲 | 231 |
76 | 提出 | 229 |
77 | 對 | 226 |
78 | 一些 | 226 |
79 | 但係 | 224 |
80 | 得 | 224 |
81 | 自己 | 224 |
82 | 最 | 221 |
83 | 重要 | 220 |
84 | 包括 | 218 |
85 | 經濟 | 217 |
86 | 房屋 | 215 |
87 | 一定 | 215 |
88 | 向 | 212 |
89 | 他們 | 204 |
90 | 這些 | 202 |
91 | 好多 | 201 |
92 | 用 | 201 |
93 | 唔 | 197 |
94 | 事 | 190 |
95 | 措施 | 189 |
96 | 講 | 185 |
97 | 金融 | 185 |
98 | 相信 | 185 |
99 | # | 182 |
100 | 或者 | 177 |
N | Word | Freq. |
---|---|---|
1 | 的 | 8,494 |
2 | 是 | 3,053 |
3 | 我们 | 2,902 |
4 | 我 | 2,220 |
5 | 有 | 1,820 |
6 | 呃 | 1,770 |
7 | 一 | 1,769 |
8 | 在 | 1,695 |
9 | 个 | 1,487 |
10 | 不 | 1,245 |
11 | 你 | 1,191 |
12 | 会 | 1,064 |
13 | 这 | 945 |
14 | 这个 | 945 |
15 | 香港 | 857 |
16 | 要 | 841 |
17 | 做 | 815 |
18 | 报告 | 809 |
19 | 问题 | 792 |
20 | 施政 | 783 |
21 | 呢 | 763 |
22 | 了 | 752 |
23 | 都 | 698 |
24 | 说 | 646 |
25 | 可以 | 640 |
26 | 政府 | 633 |
27 | 也 | 620 |
28 | 和 | 607 |
29 | 方面 | 571 |
30 | 很 | 487 |
31 | 他们 | 481 |
32 | 就是 | 472 |
33 | 一些 | 458 |
34 | 去 | 413 |
35 | 需要 | 406 |
36 | 市民 | 404 |
37 | 就 | 396 |
38 | 发展 | 388 |
39 | 政策 | 366 |
40 | 社会 | 365 |
41 | 所以 | 361 |
42 | 工作 | 336 |
43 | 很多 | 334 |
44 | 到 | 329 |
45 | 大家 | 315 |
46 | 经济 | 314 |
47 | 但 | 311 |
48 | 这些 | 302 |
49 | 人 | 299 |
50 | 已经 | 299 |
51 | 亦 | 299 |
52 | 没有 | 298 |
53 | 希望 | 298 |
54 | 对 | 297 |
55 | 现在 | 297 |
56 | 能 | 282 |
57 | 想 | 280 |
58 | 甚么 | 271 |
59 | 能够 | 267 |
60 | 多 | 263 |
61 | 如果 | 261 |
62 | 好 | 259 |
63 | 但是 | 257 |
64 | 上 | 253 |
65 | 新 | 253 |
66 | 地 | 244 |
67 | 其实 | 244 |
68 | 觉得 | 237 |
69 | 大 | 234 |
70 | 时候 | 223 |
71 | 为 | 221 |
72 | 还有 | 220 |
73 | 措施 | 208 |
74 | 时间 | 208 |
75 | 提出 | 205 |
76 | 两 | 203 |
77 | 每 | 200 |
78 | 重要 | 198 |
79 | 所 | 196 |
80 | # | 195 |
81 | 这样 | 193 |
82 | 更 | 187 |
83 | 来 | 186 |
84 | 向 | 186 |
85 | 包括 | 185 |
86 | 事 | 185 |
87 | 应该 | 184 |
88 | 房屋 | 183 |
89 | 因为 | 183 |
90 | 跟 | 181 |
91 | 里面 | 181 |
92 | 一定 | 180 |
93 | 金融 | 178 |
94 | 最 | 177 |
95 | 市场 | 172 |
96 | 其他 | 171 |
97 | 得 | 169 |
98 | 用 | 167 |
99 | 比较 | 163 |
100 | 土地 | 163 |
N | Word | Freq. |
---|---|---|
1 | THE | 4,932 |
2 | ER | 3,701 |
3 | AND | 2,383 |
4 | TO | 2,329 |
5 | WE | 1,902 |
6 | IN | 1,777 |
7 | OF | 1,752 |
8 | UM | 1,512 |
9 | THAT | 1,415 |
10 | I | 1,386 |
11 | IS | 1,174 |
12 | A | 1,172 |
13 | HAVE | 1,100 |
14 | YOU | 989 |
15 | FOR | 824 |
16 | IT | 815 |
17 | WILL | 790 |
18 | BE | 770 |
19 | ARE | 755 |
20 | THIS | 749 |
21 | ON | 612 |
22 | SO | 600 |
23 | NOT | 583 |
24 | S | 574 |
25 | POLICY | 522 |
26 | THERE | 502 |
27 | HONG | 492 |
28 | KONG | 485 |
29 | DO | 445 |
30 | AS | 444 |
31 | WITH | 418 |
32 | ADDRESS | 405 |
33 | BUT | 394 |
34 | NOW | 371 |
35 | ALSO | 365 |
36 | CAN | 360 |
37 | OUR | 352 |
38 | PEOPLE | 342 |
39 | THEY | 339 |
40 | WHAT | 324 |
41 | WOULD | 305 |
42 | FROM | 283 |
43 | T | 262 |
44 | ALL | 260 |
45 | ABOUT | 258 |
46 | MY | 258 |
47 | N | 257 |
48 | GOVERNMENT | 245 |
49 | YOUR | 232 |
50 | WELL | 225 |
51 | OR | 219 |
52 | THESE | 215 |
53 | ONE | 199 |
54 | THEN | 198 |
55 | SAID | 190 |
56 | TIME | 189 |
57 | MORE | 188 |
58 | PUBLIC | 188 |
59 | IF | 187 |
60 | SOME | 187 |
61 | VERY | 183 |
62 | BEEN | 182 |
63 | SHOULD | 180 |
64 | AT | 174 |
65 | THINK | 173 |
66 | DEVELOPMENT | 171 |
67 | BY | 170 |
68 | HOUSING | 168 |
69 | HAS | 164 |
70 | NEW | 163 |
71 | AN | 162 |
72 | YEAR | 158 |
73 | NEED | 155 |
74 | TERM | 154 |
75 | M | 150 |
76 | WHICH | 150 |
77 | LIKE | 148 |
78 | FINANCIAL | 146 |
79 | WHEN | 144 |
80 | JUST | 143 |
81 | IMPORTANT | 142 |
82 | HOW | 138 |
83 | YEARS | 138 |
84 | CHIEF | 135 |
85 | ANY | 134 |
86 | TWO | 134 |
87 | COMMUNITY | 133 |
88 | ECONOMIC | 132 |
89 | VE | 130 |
90 | EXECUTIVE | 128 |
91 | GOING | 128 |
92 | THEM | 128 |
93 | THEIR | 124 |
94 | UP | 124 |
95 | BECAUSE | 120 |
96 | MR | 120 |
97 | WORK | 117 |
98 | MARKET | 113 |
99 | WANT | 113 |
100 | MAKE | 111 |
N | Word | Freq. |
---|---|---|
1 | 的 | 14,216 |
2 | 香港 | 4,192 |
3 | 嘅 | 3,647 |
4 | 在 | 3,556 |
5 | 政府 | 3,365 |
6 | 我 | 3,305 |
7 | 和 | 3,052 |
8 | 元 | 2,857 |
9 | 我們 | 2,810 |
10 | 會 | 2,608 |
11 | 一 | 2,606 |
12 | 有 | 2,472 |
13 | # | 2,407 |
14 | 經濟 | 2,268 |
15 | 發展 | 2,097 |
16 | 同埋 | 2,093 |
17 | 及 | 2,045 |
18 | 為 | 2,020 |
19 | 個 | 1,937 |
20 | 亦 | 1,885 |
21 | 年 | 1,659 |
22 | 我哋 | 1,644 |
23 | 係 | 1,624 |
24 | 服務 | 1,584 |
25 | 提供 | 1,535 |
26 | 呢 | 1,503 |
27 | 都 | 1,495 |
28 | 增加 | 1,471 |
29 | 至 | 1,422 |
30 | 是 | 1,391 |
31 | 措施 | 1,375 |
32 | 以 | 1,334 |
33 | 更 | 1,302 |
34 | 同 | 1,240 |
35 | 計劃 | 1,150 |
36 | 市場 | 1,138 |
37 | 金融 | 1,116 |
38 | 方面 | 1,069 |
39 | 收入 | 1,046 |
40 | 開支 | 1,039 |
41 | 市民 | 1,021 |
42 | 去年 | 949 |
43 | 以及 | 945 |
44 | 可以 | 942 |
45 | 本地 | 940 |
46 | 喺 | 940 |
47 | 多 | 929 |
48 | 新 | 929 |
49 | 而 | 920 |
50 | 社會 | 916 |
51 | 每 | 912 |
52 | 財政 | 880 |
53 | 建議 | 872 |
54 | 已 | 868 |
55 | 需要 | 842 |
56 | 包括 | 814 |
57 | 到 | 799 |
58 | 增長 | 793 |
59 | 港 | 792 |
60 | 企業 | 790 |
61 | 就 | 779 |
62 | 將 | 773 |
63 | 由 | 764 |
64 | 工作 | 760 |
65 | 於 | 741 |
66 | 不 | 737 |
67 | 對 | 730 |
68 | 項 | 730 |
69 | 要 | 713 |
70 | 基金 | 712 |
71 | 各 | 702 |
72 | 人士 | 682 |
73 | 改善 | 673 |
74 | 研究 | 649 |
75 | 中 | 632 |
76 | 本 | 625 |
77 | 可 | 616 |
78 | 今年 | 615 |
79 | 已經 | 611 |
80 | 大 | 596 |
81 | 了 | 594 |
82 | 內地 | 591 |
83 | 與 | 588 |
84 | 提升 | 587 |
85 | 但 | 582 |
86 | 推出 | 577 |
87 | 向 | 565 |
88 | 預計 | 562 |
89 | 支援 | 560 |
90 | 等 | 557 |
91 | 中心 | 556 |
92 | 地 | 555 |
93 | 投資 | 551 |
94 | 這 | 548 |
95 | 下 | 547 |
96 | 年度 | 533 |
97 | 推動 | 533 |
98 | 並 | 532 |
99 | 國際 | 528 |
100 | 加強 | 527 |
N | Word | Freq. |
---|---|---|
1 | 的 | 19,306 |
2 | 在 | 4,772 |
3 | 和 | 4,736 |
4 | 会 | 3,748 |
5 | 香港 | 3,605 |
6 | 我们 | 3,490 |
7 | 政府 | 2,877 |
8 | 元 | 2,755 |
9 | 经济 | 2,715 |
10 | 我 | 2,404 |
11 | 发展 | 2,361 |
12 | 为 | 2,250 |
13 | 及 | 2,229 |
14 | 是 | 1,827 |
15 | 个 | 1,789 |
16 | # | 1,779 |
17 | 有 | 1,598 |
18 | 一 | 1,557 |
19 | 计划 | 1,430 |
20 | 服务 | 1,341 |
21 | 提供 | 1,322 |
22 | 开支 | 1,315 |
23 | 增加 | 1,268 |
24 | 市场 | 1,206 |
25 | 更 | 1,136 |
26 | 措施 | 1,127 |
27 | 以 | 1,118 |
28 | 社会 | 1,084 |
29 | 将 | 1,028 |
30 | 也 | 1,021 |
31 | 财政 | 1,005 |
32 | 金融 | 922 |
33 | 至 | 906 |
34 | 对 | 876 |
35 | 以及 | 872 |
36 | 与 | 867 |
37 | 这 | 867 |
38 | 方面 | 861 |
39 | 建议 | 819 |
40 | 去年 | 819 |
41 | 本地 | 817 |
42 | 市民 | 808 |
43 | 了 | 800 |
44 | 多 | 798 |
45 | 收入 | 787 |
46 | 并 | 782 |
47 | 企业 | 769 |
48 | 亦 | 766 |
49 | 已 | 759 |
50 | 项 | 733 |
51 | 港 | 727 |
52 | 可以 | 726 |
53 | 每 | 720 |
54 | 年 | 720 |
55 | 增长 | 717 |
56 | 包括 | 713 |
57 | 新 | 705 |
58 | 内地 | 693 |
59 | 需要 | 693 |
60 | 继续 | 677 |
61 | 约 | 666 |
62 | 预计 | 664 |
63 | 加强 | 653 |
64 | 推动 | 637 |
65 | 国际 | 616 |
66 | 工作 | 611 |
67 | 基金 | 605 |
68 | 到 | 602 |
69 | 而 | 601 |
70 | 投资 | 596 |
71 | 由 | 584 |
72 | 未来 | 573 |
73 | 就业 | 570 |
74 | 中 | 569 |
75 | 推出 | 554 |
76 | 不 | 549 |
77 | 提升 | 548 |
78 | 改善 | 547 |
79 | 研究 | 546 |
80 | 超过 | 541 |
81 | 可 | 528 |
82 | 今年 | 526 |
83 | 人士 | 526 |
84 | 下 | 523 |
85 | 资助 | 523 |
86 | 等 | 521 |
87 | 支援 | 519 |
88 | 有关 | 513 |
89 | 各 | 502 |
90 | 项目 | 487 |
91 | 地 | 486 |
92 | 中心 | 486 |
93 | 较 | 485 |
94 | 向 | 481 |
95 | 已经 | 472 |
96 | 这些 | 467 |
97 | 就 | 465 |
98 | 或 | 464 |
99 | 进一步 | 461 |
100 | 环境 | 454 |
N | Word | Freq. |
---|---|---|
1 | THE | 33,049 |
2 | OF | 16,463 |
3 | TO | 16,456 |
4 | AND | 15,673 |
5 | IN | 11,548 |
6 | # | 7,712 |
7 | A | 7,189 |
8 | FOR | 7,138 |
9 | WILL | 6,307 |
10 | WE | 4,723 |
11 | OUR | 4,097 |
12 | HONG | 3,991 |
13 | KONG | 3,960 |
14 | IS | 3,459 |
15 | WITH | 3,387 |
16 | AS | 3,231 |
17 | I | 3,148 |
18 | ON | 3,079 |
19 | THIS | 3,049 |
20 | THAT | 3,017 |
21 | GOVERNMENT | 2,852 |
22 | HAVE | 2,639 |
23 | YEAR | 2,491 |
24 | BY | 2,407 |
25 | BE | 2,351 |
26 | PER | 2,262 |
27 | CENT | 2,082 |
28 | FROM | 2,041 |
29 | DEVELOPMENT | 1,969 |
30 | S | 1,951 |
31 | HAS | 1,908 |
32 | ARE | 1,906 |
33 | AN | 1,899 |
34 | ECONOMIC | 1,803 |
35 | FINANCIAL | 1,678 |
36 | MORE | 1,601 |
37 | SERVICES | 1,538 |
38 | EXPENDITURE | 1,453 |
39 | ALSO | 1,436 |
40 | THEIR | 1,323 |
41 | AT | 1,305 |
42 | TAX | 1,293 |
43 | DOLLARS | 1,266 |
44 | BILLION | 1,258 |
45 | OVER | 1,238 |
46 | PUBLIC | 1,237 |
47 | YEARS | 1,229 |
48 | MARKET | 1,198 |
49 | NEW | 1,160 |
50 | ECONOMY | 1,132 |
51 | THESE | 1,117 |
52 | IT | 1,100 |
53 | MEASURES | 1,089 |
54 | PEOPLE | 1,064 |
55 | INCREASE | 1,055 |
56 | UP | 1,026 |
57 | LAST | 1,016 |
58 | GROWTH | 1,014 |
59 | PROVIDE | 941 |
60 | OR | 939 |
61 | MAINLAND | 924 |
62 | NOT | 923 |
63 | WHICH | 900 |
64 | BUSINESS | 897 |
65 | ABOUT | 885 |
66 | SUPPORT | 866 |
67 | COMMUNITY | 858 |
68 | REVENUE | 852 |
69 | CAN | 840 |
70 | INDUSTRY | 835 |
71 | BEEN | 819 |
72 | THAN | 778 |
73 | ITS | 765 |
74 | SCHEME | 761 |
75 | INTO | 759 |
76 | CONTINUE | 727 |
77 | FURTHER | 715 |
78 | FISCAL | 686 |
79 | MILLION | 671 |
80 | OTHER | 663 |
81 | SHALL | 661 |
82 | INTERNATIONAL | 659 |
83 | ADDITIONAL | 654 |
84 | RATE | 654 |
85 | TWO | 654 |
86 | ALL | 652 |
87 | SUCH | 636 |
88 | HELP | 626 |
89 | SHOULD | 613 |
90 | INDUSTRIES | 612 |
91 | FUND | 609 |
92 | UNDER | 598 |
93 | TOTAL | 594 |
94 | LAND | 589 |
95 | THROUGH | 572 |
96 | CARE | 562 |
97 | FIRST | 556 |
98 | ENHANCE | 552 |
99 | ENTERPRISES | 548 |
100 | PROJECTS | 545 |
N | Word | Freq. |
---|---|---|
1 | 呢 | 6,805 |
2 | 我 | 6,172 |
3 | 係 | 5,261 |
4 | 嘅 | 4,279 |
5 | 的 | 4,084 |
6 | 個 | 2,986 |
7 | 有 | 2,759 |
8 | 都 | 2,367 |
9 | 你 | 2,366 |
10 | 一 | 2,321 |
11 | 哋 | 2,095 |
12 | 是 | 2,044 |
13 | 就 | 1,757 |
14 | 嗰 | 1,623 |
15 | 唔 | 1,466 |
16 | 可以 | 1,432 |
17 | 咁 | 1,307 |
18 | 會 | 1,285 |
19 | 啲 | 1,146 |
20 | 在 | 1,129 |
21 | 啊 | 1,063 |
22 | 喺 | 1,053 |
23 | 方面 | 1,024 |
24 | 們 | 1,022 |
25 | 啦 | 967 |
26 | 其實 | 962 |
27 | 到 | 956 |
28 | 去 | 911 |
29 | 要 | 893 |
30 | 亦 | 892 |
31 | 即 | 867 |
32 | 所以 | 847 |
33 | 我哋 | 780 |
34 | 做 | 758 |
35 | 好 | 750 |
36 | 不 | 742 |
37 | 但 | 724 |
38 | 措施 | 683 |
39 | 嚟 | 666 |
40 | 政府 | 650 |
41 | 咗 | 646 |
42 | 年 | 645 |
43 | 㗎 | 608 |
44 | # | 602 |
45 | 呃 | 586 |
46 | 如果 | 569 |
47 | 多 | 547 |
48 | 一個 | 535 |
49 | 了 | 530 |
50 | 想 | 521 |
51 | 大家 | 519 |
52 | 因為 | 488 |
53 | 億 | 483 |
54 | 香港 | 478 |
55 | 同埋 | 452 |
56 | 睇 | 434 |
57 | 今年 | 428 |
58 | 人 | 421 |
59 | 喥 | 416 |
60 | 佢 | 404 |
61 | 大 | 395 |
62 | 對 | 393 |
63 | 話 | 386 |
64 | 需要 | 372 |
65 | 市民 | 370 |
66 | 同 | 364 |
67 | 問題 | 363 |
68 | 收入 | 355 |
69 | 或者 | 353 |
70 | 希望 | 352 |
71 | 得 | 346 |
72 | 宜家 | 345 |
73 | 司長 | 341 |
74 | 經濟 | 338 |
75 | 時 | 336 |
76 | 上 | 332 |
77 | 好多 | 329 |
78 | 覺得 | 322 |
79 | 預算案 | 317 |
80 | 一些 | 316 |
81 | 冇 | 315 |
82 | 而 | 311 |
83 | 增加 | 306 |
84 | 樣 | 302 |
85 | 用 | 296 |
86 | 算 | 290 |
87 | 他 | 285 |
88 | CODE | 283 |
89 | SWITCH | 283 |
90 | 很多 | 283 |
91 | 來 | 282 |
92 | 很 | 279 |
93 | 可能 | 273 |
94 | 相信 | 273 |
95 | 這 | 273 |
96 | 後 | 271 |
97 | 另外 | 267 |
98 | 這個 | 262 |
99 | 於 | 260 |
100 | 講 | 259 |
N | Word | Freq. |
---|---|---|
1 | 的 | 8,732 |
2 | 是 | 3,955 |
3 | 我们 | 3,375 |
4 | 我 | 2,224 |
5 | 有 | 2,193 |
6 | 一 | 2,064 |
7 | 在 | 1,832 |
8 | 你 | 1,784 |
9 | 个 | 1,589 |
10 | 会 | 1,570 |
11 | 不 | 1,363 |
12 | 这 | 1,289 |
13 | 呢 | 1,221 |
14 | 了 | 1,128 |
15 | 呃 | 1,088 |
16 | 这个 | 1,062 |
17 | 都 | 991 |
18 | 可以 | 960 |
19 | 方面 | 905 |
20 | 要 | 851 |
21 | 也 | 740 |
22 | 措施 | 703 |
23 | 说 | 695 |
24 | 所以 | 656 |
25 | 做 | 648 |
26 | 啊 | 642 |
27 | 预算案 | 625 |
28 | 经济 | 601 |
29 | 就 | 583 |
30 | 一些 | 525 |
31 | 很 | 523 |
32 | 政府 | 514 |
33 | 问题 | 506 |
34 | 其实 | 504 |
35 | 他们 | 488 |
36 | 很多 | 460 |
37 | 和 | 445 |
38 | 如果 | 442 |
39 | 多 | 425 |
40 | 因为 | 420 |
41 | 没有 | 406 |
42 | 香港 | 393 |
43 | 对 | 383 |
44 | 到 | 379 |
45 | 想 | 379 |
46 | # | 374 |
47 | 大 | 374 |
48 | 就是 | 372 |
49 | 亦 | 372 |
50 | 司长 | 371 |
51 | 人 | 367 |
52 | 今年 | 360 |
53 | 现在 | 357 |
54 | 但 | 350 |
55 | 去 | 350 |
56 | 财政 | 346 |
57 | 年 | 346 |
58 | 大家 | 342 |
59 | 市民 | 336 |
60 | 好 | 328 |
61 | 觉得 | 323 |
62 | 希望 | 321 |
63 | 增加 | 320 |
64 | 那 | 318 |
65 | 元 | 312 |
66 | 但是 | 308 |
67 | 已经 | 306 |
68 | 开支 | 301 |
69 | 需要 | 297 |
70 | 比较 | 286 |
71 | 收入 | 285 |
72 | 什么 | 282 |
73 | 是否 | 269 |
74 | 未来 | 257 |
75 | 次 | 256 |
76 | 这些 | 247 |
77 | 上 | 245 |
78 | 还有 | 244 |
79 | 看 | 242 |
80 | 情况 | 240 |
81 | 这样 | 240 |
82 | 时候 | 239 |
83 | 能 | 236 |
84 | 工作 | 234 |
85 | 用 | 222 |
86 | 刚才 | 220 |
87 | 来 | 220 |
88 | 嗯 | 219 |
89 | 那么 | 217 |
90 | 推出 | 217 |
91 | 没 | 207 |
92 | 过 | 206 |
93 | 还 | 205 |
94 | 再 | 198 |
95 | 可能 | 197 |
96 | 钱 | 197 |
97 | 去年 | 195 |
98 | 社会 | 189 |
99 | 相当 | 188 |
100 | 给 | 187 |
N | Word | Freq. |
---|---|---|
1 | THE | 6,593 |
2 | TO | 3,263 |
3 | UM | 3,032 |
4 | AND | 2,606 |
5 | WE | 2,277 |
6 | OF | 2,227 |
7 | THAT | 2,167 |
8 | IN | 2,108 |
9 | A | 1,841 |
10 | YOU | 1,825 |
11 | I | 1,719 |
12 | IS | 1,613 |
13 | HAVE | 1,525 |
14 | ARE | 1,155 |
15 | FOR | 1,143 |
16 | IT | 1,137 |
17 | BE | 1,086 |
18 | THIS | 1,076 |
19 | SO | 1,041 |
20 | WILL | 962 |
21 | NOT | 815 |
22 | ER | 782 |
23 | S | 675 |
24 | DO | 645 |
25 | OUR | 644 |
26 | AS | 611 |
27 | BUDGET | 599 |
28 | ON | 581 |
29 | WOULD | 542 |
30 | TAX | 540 |
31 | THERE | 536 |
32 | YEAR | 512 |
33 | BUT | 476 |
34 | # | 475 |
35 | NOW | 444 |
36 | CAN | 435 |
37 | ABOUT | 430 |
38 | AT | 425 |
39 | WITH | 422 |
40 | N | 411 |
41 | T | 411 |
42 | THINK | 401 |
43 | WELL | 393 |
44 | FROM | 392 |
45 | MEASURES | 389 |
46 | PEOPLE | 386 |
47 | DOLLARS | 380 |
48 | BILLION | 377 |
49 | WHAT | 376 |
50 | BY | 370 |
51 | THEY | 367 |
52 | ALSO | 362 |
53 | IF | 348 |
54 | OR | 342 |
55 | MORE | 332 |
56 | HONG | 324 |
57 | KONG | 323 |
58 | THEN | 323 |
59 | VERY | 320 |
60 | FINANCIAL | 310 |
61 | TIME | 304 |
62 | BECAUSE | 300 |
63 | SOME | 293 |
64 | EXPENDITURE | 286 |
65 | HAS | 283 |
66 | GOVERNMENT | 276 |
67 | SAID | 263 |
68 | YOUR | 261 |
69 | ALL | 256 |
70 | ONE | 249 |
71 | WAS | 249 |
72 | PERCENT | 243 |
73 | OUT | 242 |
74 | AN | 234 |
75 | YEARS | 229 |
76 | QUESTION | 226 |
77 | UP | 226 |
78 | THESE | 225 |
79 | ECONOMIC | 221 |
80 | WHEN | 217 |
81 | ECONOMY | 210 |
82 | HOW | 208 |
83 | SECRETARY | 206 |
84 | NEED | 202 |
85 | MY | 201 |
86 | JUST | 199 |
87 | M | 196 |
88 | SHOULD | 195 |
89 | LIKE | 188 |
90 | LAST | 187 |
91 | REVENUE | 185 |
92 | GOING | 184 |
93 | PUBLIC | 184 |
94 | WHY | 183 |
95 | BEEN | 181 |
96 | ANY | 176 |
97 | FIRST | 176 |
98 | SAY | 176 |
99 | MANY | 172 |
100 | US | 171 |
N | Word | Freq. |
---|---|---|
1 | 的 | 12,110 |
2 | 和 | 9,571 |
3 | 发展 | 4,533 |
4 | 经济 | 3,040 |
5 | 建设 | 3,014 |
6 | 要 | 2,812 |
7 | 社会 | 2,480 |
8 | # | 2,435 |
9 | 加强 | 2,425 |
10 | 改革 | 2,382 |
11 | 是 | 2,010 |
12 | 推进 | 1,887 |
13 | 企业 | 1,766 |
14 | 工作 | 1,700 |
15 | 等 | 1,692 |
16 | 在 | 1,622 |
17 | 各 | 1,507 |
18 | 提高 | 1,460 |
19 | 政策 | 1,440 |
20 | 制度 | 1,379 |
21 | 农村 | 1,349 |
22 | 加快 | 1,331 |
23 | 政府 | 1,322 |
24 | 一 | 1,308 |
25 | 新 | 1,304 |
26 | 了 | 1,294 |
27 | 继续 | 1,219 |
28 | 对 | 1,125 |
29 | 教育 | 1,116 |
30 | 市场 | 1,115 |
31 | 实施 | 1,108 |
32 | 我们 | 1,108 |
33 | 人民 | 1,102 |
34 | 促进 | 1,097 |
35 | 积极 | 1,087 |
36 | 完善 | 1,082 |
37 | 坚持 | 1,072 |
38 | 增长 | 1,072 |
39 | 管理 | 1,047 |
40 | 国家 | 1,046 |
41 | 全面 | 1,008 |
42 | 财政 | 980 |
43 | 增加 | 957 |
44 | 元 | 944 |
45 | 支持 | 934 |
46 | 扩大 | 932 |
47 | 服务 | 930 |
48 | 重点 | 916 |
49 | 为 | 914 |
50 | 基本 | 901 |
51 | 投资 | 892 |
52 | 国 | 863 |
53 | 保障 | 861 |
54 | 体制 | 857 |
55 | 结构 | 832 |
56 | 问题 | 831 |
57 | 与 | 829 |
58 | 生产 | 825 |
59 | 地区 | 817 |
60 | 产业 | 796 |
61 | 文化 | 792 |
62 | 稳定 | 791 |
63 | 就业 | 789 |
64 | 农业 | 787 |
65 | 创新 | 784 |
66 | 中 | 761 |
67 | 体系 | 739 |
68 | 金融 | 736 |
69 | 基础 | 732 |
70 | 技术 | 729 |
71 | 国际 | 726 |
72 | 群众 | 715 |
73 | 进一步 | 711 |
74 | 以 | 705 |
75 | 安全 | 697 |
76 | 中央 | 687 |
77 | 深化 | 686 |
78 | 机制 | 684 |
79 | 三 | 672 |
80 | 个 | 666 |
81 | 环境 | 663 |
82 | 今年 | 661 |
83 | 收入 | 653 |
84 | 事业 | 647 |
85 | 水平 | 646 |
86 | 重大 | 646 |
87 | 调整 | 641 |
88 | 不 | 638 |
89 | 地 | 634 |
90 | 大力 | 632 |
91 | 保护 | 631 |
92 | 有 | 617 |
93 | 科技 | 615 |
94 | 重要 | 613 |
95 | 建立 | 602 |
96 | 上 | 599 |
97 | 实现 | 597 |
98 | 能力 | 593 |
99 | 合作 | 590 |
100 | 推动 | 590 |
N | Word | Freq. |
---|---|---|
1 | AND | 31,201 |
2 | THE | 26,505 |
3 | OF | 17,418 |
4 | TO | 13,787 |
5 | WE | 11,901 |
6 | IN | 9,742 |
7 | WILL | 8,065 |
8 | FOR | 5,766 |
9 | A | 4,873 |
10 | DEVELOPMENT | 3,662 |
11 | ON | 2,878 |
12 | # | 2,747 |
13 | WITH | 2,647 |
14 | PEOPLE | 2,164 |
15 | S | 2,146 |
16 | THAT | 2,144 |
17 | SYSTEM | 2,133 |
18 | ECONOMIC | 2,060 |
19 | GOVERNMENT | 2,058 |
20 | IMPROVE | 2,024 |
21 | OUR | 1,982 |
22 | BE | 1,944 |
23 | WORK | 1,901 |
24 | REFORM | 1,890 |
25 | ALL | 1,856 |
26 | RURAL | 1,852 |
27 | SHOULD | 1,811 |
28 | IS | 1,721 |
29 | AS | 1,678 |
30 | BY | 1,635 |
31 | AREAS | 1,605 |
32 | CHINA | 1,599 |
33 | MORE | 1,572 |
34 | THEIR | 1,568 |
35 | ARE | 1,465 |
36 | NEW | 1,461 |
37 | SOCIAL | 1,418 |
38 | INCREASE | 1,275 |
39 | THIS | 1,272 |
40 | YEAR | 1,236 |
41 | ENTERPRISES | 1,218 |
42 | MUST | 1,172 |
43 | UP | 1,143 |
44 | FROM | 1,141 |
45 | AN | 1,139 |
46 | WAS | 1,116 |
47 | HAVE | 1,110 |
48 | PUBLIC | 1,096 |
49 | PROMOTE | 1,077 |
50 | STRENGTHEN | 1,052 |
51 | MAKE | 1,050 |
52 | NEED | 1,036 |
53 | AT | 1,023 |
54 | CONTINUE | 1,022 |
55 | URBAN | 991 |
56 | DEVELOP | 960 |
57 | MADE | 934 |
58 | CENTRAL | 919 |
59 | SUPPORT | 919 |
60 | ENSURE | 899 |
61 | EDUCATION | 896 |
62 | MAJOR | 893 |
63 | INVESTMENT | 871 |
64 | MARKET | 871 |
65 | EFFORTS | 858 |
66 | GROWTH | 851 |
67 | WERE | 830 |
68 | OUT | 829 |
69 | SERVICES | 811 |
70 | YUAN | 811 |
71 | NATIONAL | 784 |
72 | IMPLEMENT | 778 |
73 | FINANCIAL | 772 |
74 | BASIC | 767 |
75 | POLICY | 767 |
76 | PROJECTS | 756 |
77 | LAW | 755 |
78 | CHINESE | 744 |
79 | OTHER | 739 |
80 | POLICIES | 731 |
81 | INDUSTRIES | 730 |
82 | ECONOMY | 726 |
83 | PROGRESS | 678 |
84 | OVER | 675 |
85 | PRODUCTION | 669 |
86 | MANAGEMENT | 654 |
87 | IT | 652 |
88 | OR | 596 |
89 | STATE | 595 |
90 | MEASURES | 584 |
91 | CULTURAL | 572 |
92 | ENERGY | 572 |
93 | AGRICULTURAL | 569 |
94 | ENCOURAGE | 567 |
95 | KEY | 567 |
96 | USE | 558 |
97 | PROBLEMS | 556 |
98 | COUNTRY | 554 |
99 | ITS | 553 |
100 | HAS | 549 |
N | Word | Freq. |
---|---|---|
1 | 的 | 14,441 |
2 | 是 | 3,956 |
3 | 我 | 3,211 |
4 | 我们 | 2,972 |
5 | 在 | 2,532 |
6 | 中国 | 2,357 |
7 | 一 | 2,290 |
8 | 了 | 2,253 |
9 | 和 | 2,187 |
10 | 有 | 1,826 |
11 | 个 | 1,795 |
12 | 要 | 1,622 |
13 | 问题 | 1,619 |
14 | 不 | 1,590 |
15 | 这 | 1,504 |
16 | 啊 | 1,465 |
17 | 也 | 1,435 |
18 | 经济 | 1,163 |
19 | 会 | 1,117 |
20 | 发展 | 1,034 |
21 | 政府 | 1,030 |
22 | 两 | 985 |
23 | 您 | 965 |
24 | 对 | 928 |
25 | 这个 | 900 |
26 | # | 835 |
27 | 谢谢 | 822 |
28 | 就 | 815 |
29 | 总理 | 780 |
30 | 中 | 779 |
31 | 改革 | 748 |
32 | 还 | 684 |
33 | 都 | 664 |
34 | 说 | 647 |
35 | 你 | 643 |
36 | 想 | 643 |
37 | 关系 | 627 |
38 | 但是 | 620 |
39 | 国家 | 612 |
40 | 香港 | 611 |
41 | 人民 | 597 |
42 | 记者 | 580 |
43 | 来 | 565 |
44 | 大 | 551 |
45 | 现在 | 546 |
46 | 已经 | 544 |
47 | ER | 539 |
48 | 上 | 535 |
49 | 到 | 526 |
50 | 地 | 524 |
51 | 很 | 504 |
52 | 可以 | 502 |
53 | 台湾 | 481 |
54 | 国 | 469 |
55 | 就是 | 469 |
56 | 金融 | 441 |
57 | 能 | 441 |
58 | 社会 | 430 |
59 | 呢 | 424 |
60 | 企业 | 423 |
61 | RAISE | 411 |
62 | VOICE | 411 |
63 | 什么 | 403 |
64 | 市场 | 401 |
65 | 人 | 399 |
66 | 工作 | 390 |
67 | 今年 | 380 |
68 | 一些 | 379 |
69 | 没有 | 376 |
70 | 多 | 363 |
71 | 将 | 358 |
72 | 更 | 357 |
73 | 好 | 357 |
74 | 政策 | 351 |
75 | 世界 | 350 |
76 | 进行 | 346 |
77 | 把 | 343 |
78 | 新 | 343 |
79 | 认为 | 340 |
80 | 解决 | 337 |
81 | 方面 | 335 |
82 | 大家 | 333 |
83 | 美国 | 332 |
84 | 能够 | 326 |
85 | 去年 | 326 |
86 | 岸 | 322 |
87 | 使 | 322 |
88 | 国际 | 321 |
89 | 而且 | 320 |
90 | 种 | 320 |
91 | 让 | 314 |
92 | 最 | 314 |
93 | 从 | 309 |
94 | 推进 | 309 |
95 | 合作 | 307 |
96 | 次 | 303 |
97 | 希望 | 297 |
98 | 中央 | 290 |
99 | 还是 | 289 |
100 | 他们 | 282 |
N | Word | Freq. |
---|---|---|
1 | THE | 17,052 |
2 | AND | 9,107 |
3 | TO | 8,063 |
4 | OF | 7,848 |
5 | IN | 5,623 |
6 | WE | 4,036 |
7 | THAT | 3,795 |
8 | A | 3,774 |
9 | IS | 3,401 |
10 | I | 3,142 |
11 | WILL | 2,669 |
12 | CHINA | 2,489 |
13 | HAVE | 2,368 |
14 | THIS | 2,346 |
15 | FOR | 2,204 |
16 | YOU | 1,884 |
17 | S | 1,859 |
18 | ON | 1,716 |
19 | BE | 1,633 |
20 | ARE | 1,584 |
21 | PEOPLE | 1,563 |
22 | WITH | 1,473 |
23 | AS | 1,458 |
24 | GOVERNMENT | 1,423 |
25 | OUR | 1,413 |
26 | HAS | 1,376 |
27 | IT | 1,373 |
28 | ER | 1,251 |
29 | ALSO | 1,088 |
30 | NOT | 1,051 |
31 | YEAR | 925 |
32 | WHAT | 896 |
33 | SO | 889 |
34 | # | 876 |
35 | BY | 847 |
36 | CHINESE | 829 |
37 | CAN | 818 |
38 | AT | 814 |
39 | DEVELOPMENT | 813 |
40 | THERE | 810 |
41 | ALL | 803 |
42 | FROM | 795 |
43 | KONG | 757 |
44 | HONG | 750 |
45 | REFORM | 740 |
46 | TWO | 728 |
47 | BUT | 727 |
48 | DO | 726 |
49 | TAIWAN | 723 |
50 | ECONOMIC | 717 |
51 | ABOUT | 675 |
52 | BEEN | 663 |
53 | ONE | 663 |
54 | WOULD | 635 |
55 | MORE | 619 |
56 | MY | 618 |
57 | SOME | 613 |
58 | THEIR | 589 |
59 | VERY | 577 |
60 | NEED | 573 |
61 | YOUR | 572 |
62 | AN | 560 |
63 | BETWEEN | 558 |
64 | QUESTION | 557 |
65 | THEY | 545 |
66 | COUNTRY | 521 |
67 | YEARS | 518 |
68 | TIME | 515 |
69 | US | 511 |
70 | WORK | 510 |
71 | COUNTRIES | 509 |
72 | SHOULD | 506 |
73 | OR | 502 |
74 | THESE | 497 |
75 | ECONOMY | 481 |
76 | ITS | 475 |
77 | GROWTH | 463 |
78 | WAS | 452 |
79 | MAKE | 433 |
80 | LIKE | 432 |
81 | NEW | 427 |
82 | NOW | 426 |
83 | MARKET | 424 |
84 | LAST | 417 |
85 | FINANCIAL | 412 |
86 | MUST | 412 |
87 | SUCH | 399 |
88 | TAKE | 393 |
89 | UP | 388 |
90 | THINK | 385 |
91 | PREMIER | 384 |
92 | BELIEVE | 366 |
93 | WORLD | 363 |
94 | NIL | 362 |
95 | THANK | 358 |
96 | ME | 357 |
97 | IF | 355 |
98 | SYSTEM | 354 |
99 | OTHER | 353 |
100 | STILL | 351 |
Table 8. The 100 Most Frequent Words in the Subset of US State of the Union Addresses
N | Word | Freq. |
---|---|---|
1 | THE | 12,441 |
2 | AND | 10,176 |
3 | TO | 9,625 |
4 | OF | 7,142 |
5 | WE | 5,903 |
6 | A | 5,136 |
7 | IN | 4,832 |
8 | OUR | 4,717 |
9 | THAT | 4,337 |
10 | FOR | 2,873 |
11 | I | 2,692 |
12 | IS | 2,679 |
13 | WILL | 2,293 |
14 | S | 2,159 |
15 | THIS | 2,077 |
16 | IT | 1,965 |
17 | HAVE | 1,875 |
18 | APPLAUSE | 1,861 |
19 | ARE | 1,727 |
20 | ON | 1,658 |
21 | WITH | 1,627 |
22 | YOU | 1,584 |
23 | INTERRUPTION | 1,410 |
24 | AMERICA | 1,392 |
25 | MORE | 1,355 |
26 | CHEERS | 1,339 |
27 | NOT | 1,332 |
28 | THEIR | 1,264 |
29 | THEY | 1,202 |
30 | ALL | 1,190 |
31 | BY | 1,187 |
32 | BE | 1,144 |
33 | AS | 1,130 |
34 | CAN | 1,117 |
35 | BUT | 1,102 |
36 | FROM | 1,038 |
37 | PEOPLE | 1,034 |
38 | NEW | 1,031 |
39 | DO | 1,015 |
40 | # | 990 |
41 | WHO | 939 |
42 | MUST | 920 |
43 | SO | 919 |
44 | US | 885 |
45 | OR | 852 |
46 | NOW | 850 |
47 | HAS | 832 |
48 | AMERICAN | 786 |
49 | AT | 760 |
50 | YEARS | 753 |
51 | EVERY | 739 |
52 | WORLD | 736 |
53 | AMERICANS | 725 |
54 | N | 709 |
55 | T | 697 |
56 | YEAR | 682 |
57 | MAKE | 667 |
58 | WORK | 665 |
59 | AN | 651 |
60 | ONE | 635 |
61 | THAN | 635 |
62 | THEM | 623 |
63 | THESE | 622 |
64 | SHOULD | 607 |
65 | HELP | 592 |
66 | CONGRESS | 583 |
67 | COUNTRY | 575 |
68 | TONIGHT | 572 |
69 | VE | 563 |
70 | WHAT | 560 |
71 | WHEN | 555 |
72 | IF | 529 |
73 | NATION | 505 |
74 | JOBS | 499 |
75 | MY | 496 |
76 | NO | 492 |
77 | TIME | 490 |
78 | LET | 486 |
79 | KNOW | 485 |
80 | BECAUSE | 481 |
81 | NEED | 478 |
82 | CHILDREN | 470 |
83 | SECURITY | 470 |
84 | ECONOMY | 463 |
85 | ALSO | 457 |
86 | RE | 453 |
87 | UP | 444 |
88 | LAST | 438 |
89 | WAS | 426 |
90 | JUST | 417 |
91 | TAX | 413 |
92 | THERE | 411 |
93 | GOVERNMENT | 398 |
94 | THOSE | 398 |
95 | LIKE | 388 |
96 | FIRST | 386 |
97 | HEALTH | 386 |
98 | OVER | 382 |
99 | CARE | 378 |
100 | ASK | 376 |
Table 9. The 100 Most Frequent Words in the Subset of Press Conferences of US State of the Union Addresses
N | Word | Freq. |
---|---|---|
1 | THE | 15,801 |
2 | TO | 9,141 |
3 | THAT | 9,102 |
4 | AND | 7,122 |
5 | OF | 6,288 |
6 | A | 4,740 |
7 | I | 4,619 |
8 | IN | 4,610 |
9 | IS | 3,838 |
10 | S | 3,683 |
11 | ER | 3,654 |
12 | WE | 3,543 |
13 | IT | 3,290 |
14 | YOU | 3,145 |
15 | PRESIDENT | 2,926 |
16 | ON | 2,601 |
17 | THIS | 2,325 |
18 | HAVE | 2,169 |
19 | HE | 1,956 |
20 | FOR | 1,913 |
21 | ARE | 1,653 |
22 | WITH | 1,647 |
23 | BE | 1,626 |
24 | ABOUT | 1,604 |
25 | THINK | 1,596 |
26 | DO | 1,551 |
27 | AS | 1,498 |
28 | WHAT | 1,495 |
29 | NOT | 1,467 |
30 | N | 1,436 |
31 | WILL | 1,427 |
32 | T | 1,422 |
33 | BUT | 1,388 |
34 | THERE | 1,331 |
35 | THEY | 1,310 |
36 | WAS | 1,227 |
37 | HAS | 1,190 |
38 | SO | 1,107 |
39 | WOULD | 1,060 |
40 | AT | 991 |
41 | GOING | 987 |
42 | OUR | 981 |
43 | OR | 968 |
44 | CAN | 952 |
45 | RE | 944 |
46 | WELL | 883 |
47 | AN | 866 |
48 | HIS | 774 |
49 | PEOPLE | 768 |
50 | IF | 721 |
51 | KNOW | 711 |
52 | UM | 699 |
53 | ONE | 677 |
54 | SOME | 676 |
55 | BEEN | 673 |
56 | JUST | 652 |
57 | FROM | 646 |
58 | THEIR | 634 |
59 | OUT | 622 |
60 | ALL | 620 |
61 | MORE | 619 |
62 | BY | 611 |
63 | THOSE | 601 |
64 | SAID | 591 |
65 | ANY | 541 |
66 | THESE | 535 |
67 | NO | 528 |
68 | WHO | 505 |
69 | WHEN | 489 |
70 | GET | 487 |
71 | VE | 487 |
72 | HOW | 481 |
73 | MAKE | 478 |
74 | BECAUSE | 473 |
75 | M | 469 |
76 | DOES | 468 |
77 | WERE | 467 |
78 | VERY | 460 |
79 | ALSO | 449 |
80 | AGAIN | 443 |
81 | OTHER | 441 |
82 | # | 440 |
83 | UP | 440 |
84 | NOW | 436 |
85 | HOUSE | 423 |
86 | HAD | 422 |
87 | LL | 409 |
88 | LIKE | 406 |
89 | SAY | 399 |
90 | QUESTION | 389 |
91 | WAY | 385 |
92 | DID | 380 |
93 | LAST | 380 |
94 | STATES | 378 |
95 | GO | 377 |
96 | TAKE | 371 |
97 | IMPORTANT | 370 |
98 | FORWARD | 362 |
99 | CONGRESS | 358 |
100 | THEM | 358 |
Table 10. The 100 Most Frequent Words in the Subset of US Budget Speeches
N | Word | Freq. |
---|---|---|
1 | THE | 3,454 |
2 | TO | 2,996 |
3 | AND | 2,254 |
4 | THAT | 1,703 |
5 | WE | 1,686 |
6 | OF | 1,669 |
7 | A | 1,418 |
8 | IN | 1,313 |
9 | I | 1,101 |
10 | IT | 962 |
11 | IS | 881 |
12 | OUR | 881 |
13 | S | 867 |
14 | FOR | 808 |
15 | YOU | 715 |
16 | THIS | 535 |
17 | BUDGET | 520 |
18 | ARE | 493 |
19 | HAVE | 491 |
20 | ON | 464 |
21 | DO | 434 |
22 | AS | 405 |
23 | RE | 399 |
24 | BE | 380 |
25 | WILL | 368 |
26 | WITH | 354 |
27 | THEY | 342 |
28 | NOT | 325 |
29 | N | 322 |
30 | T | 322 |
31 | CAN | 319 |
32 | BUT | 306 |
33 | SO | 304 |
34 | # | 303 |
35 | VE | 298 |
36 | BY | 297 |
37 | MORE | 292 |
38 | PEOPLE | 288 |
39 | MAKE | 286 |
40 | ABOUT | 280 |
41 | WHAT | 266 |
42 | GOING | 253 |
43 | AMERICA | 245 |
44 | IF | 240 |
45 | AT | 238 |
46 | ALL | 228 |
47 | THEIR | 228 |
48 | WHO | 218 |
49 | THERE | 217 |
50 | US | 205 |
51 | NEW | 200 |
52 | NOW | 198 |
53 | ECONOMY | 194 |
54 | WHEN | 190 |
55 | AN | 188 |
56 | FROM | 188 |
57 | ONE | 185 |
58 | APPLAUSE | 182 |
59 | WANT | 182 |
60 | OR | 173 |
61 | SECURITY | 173 |
62 | NEED | 172 |
63 | TAX | 172 |
64 | CONGRESS | 170 |
65 | GOT | 170 |
66 | YEARS | 168 |
67 | HERE | 166 |
68 | ALSO | 162 |
69 | MY | 162 |
70 | UP | 162 |
71 | HAS | 160 |
72 | M | 155 |
73 | THANK | 153 |
74 | SURE | 152 |
75 | THEM | 152 |
76 | YOUR | 150 |
77 | GET | 144 |
78 | ER | 143 |
79 | AMERICAN | 142 |
80 | BEEN | 142 |
81 | SOME | 141 |
82 | WORK | 140 |
83 | JOBS | 138 |
84 | SPENDING | 138 |
85 | WHY | 137 |
86 | THESE | 136 |
87 | MONEY | 134 |
88 | COUNTRY | 132 |
89 | KEEP | 132 |
90 | WAY | 132 |
91 | KNOW | 131 |
92 | JUST | 130 |
93 | LIKE | 130 |
94 | WELL | 130 |
95 | OUT | 129 |
96 | BECAUSE | 127 |
97 | GOVERNMENT | 126 |
98 | WAS | 126 |
99 | TIME | 124 |
100 | WHICH | 122 |
Table 11. The 100 Most Frequent Words in the Subset of Press Conferences of US Budget Speeches
N | Word | Freq. |
---|---|---|
1 | THE | 19,885 |
2 | THAT | 10,029 |
3 | TO | 9,722 |
4 | AND | 8,374 |
5 | OF | 8,277 |
6 | IN | 7,435 |
7 | WE | 6,505 |
8 | A | 6,276 |
9 | IS | 5,065 |
10 | YOU | 4,049 |
11 | IT | 4,019 |
12 | S | 3,732 |
13 | I | 3,582 |
14 | FOR | 3,319 |
15 | ON | 2,868 |
16 | THIS | 2,690 |
17 | ARE | 2,628 |
18 | ER | 2,623 |
19 | BUDGET | 2,518 |
20 | # | 2,334 |
21 | BE | 2,297 |
22 | HAVE | 2,277 |
23 | AS | 2,074 |
24 | SO | 1,850 |
25 | DO | 1,749 |
26 | OUR | 1,748 |
27 | WHAT | 1,736 |
28 | WITH | 1,691 |
29 | THERE | 1,672 |
30 | NOT | 1,610 |
31 | BUT | 1,563 |
32 | WILL | 1,475 |
33 | RE | 1,424 |
34 | ABOUT | 1,423 |
35 | PRESIDENT | 1,417 |
36 | THINK | 1,415 |
37 | AT | 1,408 |
38 | TAX | 1,359 |
39 | WOULD | 1,326 |
40 | YEAR | 1,316 |
41 | PERCENT | 1,253 |
42 | T | 1,229 |
43 | N | 1,215 |
44 | WAS | 1,104 |
45 | IF | 1,088 |
46 | UM | 1,083 |
47 | HAS | 1,045 |
48 | FROM | 1,024 |
49 | THEY | 1,024 |
50 | OR | 1,013 |
51 | BY | 1,006 |
52 | MORE | 992 |
53 | CAN | 973 |
54 | WHICH | 894 |
55 | SPENDING | 889 |
56 | THOSE | 889 |
57 | JUST | 881 |
58 | ONE | 870 |
59 | OVER | 858 |
60 | YEARS | 843 |
61 | AN | 834 |
62 | SOME | 830 |
63 | DEFICIT | 829 |
64 | ALL | 793 |
65 | VE | 782 |
66 | VERY | 777 |
67 | GROWTH | 773 |
68 | GOING | 765 |
69 | BEEN | 715 |
70 | DOLLARS | 709 |
71 | ALSO | 695 |
72 | OUT | 662 |
73 | LAST | 660 |
74 | WELL | 654 |
75 | ECONOMY | 643 |
76 | HOW | 611 |
77 | GET | 604 |
78 | THAN | 596 |
79 | BILLION | 592 |
80 | KNOW | 590 |
81 | NOW | 577 |
82 | UP | 577 |
83 | WHEN | 577 |
84 | PEOPLE | 575 |
85 | CONGRESS | 567 |
86 | BECAUSE | 564 |
87 | SECURITY | 557 |
88 | THESE | 554 |
89 | WERE | 540 |
90 | HE | 521 |
91 | WHO | 517 |
92 | OTHER | 512 |
93 | PROGRAMS | 506 |
94 | TIME | 504 |
95 | DOES | 502 |
96 | MAKE | 495 |
97 | FIRST | 490 |
98 | WHERE | 476 |
99 | ECONOMIC | 470 |
100 | SEE | 451 |
Table 12. The 100 Most Frequent Words in the Subset of UK State Opening Addresses of Parliament
N | Word | Freq. |
---|---|---|
1 | THE | 1,964 |
2 | TO | 1,865 |
3 | AND | 1,393 |
4 | WILL | 1,368 |
5 | OF | 1,197 |
6 | MY | 776 |
7 | GOVERNMENT | 704 |
8 | A | 628 |
9 | BE | 595 |
10 | IN | 536 |
11 | FOR | 454 |
12 | LEGISLATION | 281 |
13 | FORWARD | 280 |
14 | BILL | 231 |
15 | ON | 228 |
16 | CONTINUE | 221 |
17 | WORK | 214 |
18 | INTRODUCED | 210 |
19 | S | 173 |
20 | WITH | 170 |
21 | THAT | 169 |
22 | REFORM | 166 |
23 | HOUSE | 150 |
24 | PUBLIC | 147 |
25 | NEW | 142 |
26 | MEMBERS | 133 |
27 | BROUGHT | 127 |
28 | COMMONS | 127 |
29 | MEASURES | 121 |
30 | PEOPLE | 121 |
31 | ENSURE | 117 |
32 | UNITED | 113 |
33 | IMPROVE | 111 |
34 | ALSO | 107 |
35 | NATIONAL | 106 |
36 | BY | 104 |
37 | MORE | 104 |
38 | LORDS | 100 |
39 | SERVICES | 100 |
40 | SUPPORT | 100 |
41 | IS | 99 |
42 | SECURITY | 91 |
43 | INTRODUCE | 89 |
44 | SYSTEM | 89 |
45 | THEY | 82 |
46 | KINGDOM | 81 |
47 | ECONOMIC | 79 |
48 | THEIR | 79 |
49 | BRING | 72 |
50 | HELP | 71 |
51 | HEALTH | 70 |
52 | IT | 70 |
53 | PROVIDE | 69 |
54 | ITS | 68 |
55 | REDUCE | 68 |
56 | PROMOTE | 67 |
57 | # | 64 |
58 | BEFORE | 64 |
59 | FROM | 64 |
60 | INCLUDING | 63 |
61 | OUR | 63 |
62 | ARE | 61 |
63 | EUROPEAN | 61 |
64 | I | 61 |
65 | SERVICE | 61 |
66 | LAID | 60 |
67 | YOU | 60 |
68 | POWERS | 59 |
69 | THIS | 59 |
70 | AN | 58 |
71 | AS | 58 |
72 | INTERNATIONAL | 57 |
73 | STATE | 57 |
74 | TACKLE | 57 |
75 | AT | 56 |
76 | MAKE | 54 |
77 | MINISTERS | 54 |
78 | UNION | 54 |
79 | CRIME | 53 |
80 | ALL | 52 |
81 | DRAFT | 52 |
82 | FURTHER | 52 |
83 | OTHER | 52 |
84 | ECONOMY | 51 |
85 | LOOK | 51 |
86 | PROPOSALS | 51 |
87 | STRENGTHEN | 51 |
88 | TAKE | 51 |
89 | COMMITTED | 50 |
90 | LAW | 50 |
91 | SECURE | 50 |
92 | CREATE | 47 |
93 | WALES | 45 |
94 | CHILDREN | 44 |
95 | EDUCATION | 44 |
96 | VISIT | 44 |
97 | DEVELOPMENT | 42 |
98 | FINANCIAL | 42 |
99 | GREATER | 41 |
100 | PUBLISHED | 41 |
Table 13. The 100 Most Frequent Words in the Subset of Debates on UK State Opening Addresses of Parliament
N | Word | Freq. |
---|---|---|
1 | THE | 3,332 |
2 | TO | 1,754 |
3 | OF | 1,660 |
4 | AND | 1,468 |
5 | IN | 1,124 |
6 | I | 1,095 |
7 | A | 1,043 |
8 | THAT | 1,010 |
9 | IS | 668 |
10 | FOR | 594 |
11 | IT | 483 |
12 | WE | 481 |
13 | MY | 473 |
14 | WAS | 427 |
15 | AS | 423 |
16 | ON | 410 |
17 | BE | 409 |
18 | THIS | 396 |
19 | HAVE | 374 |
20 | OUR | 340 |
21 | NOT | 333 |
22 | ARE | 317 |
23 | WITH | 294 |
24 | BUT | 284 |
25 | HE | 277 |
26 | S | 266 |
27 | # | 246 |
28 | WILL | 246 |
29 | HAS | 242 |
30 | HOUSE | 240 |
31 | ALL | 218 |
32 | WHO | 214 |
33 | SPEECH | 205 |
34 | INTERRUPTION | 201 |
35 | WHICH | 200 |
36 | AT | 198 |
37 | BY | 198 |
38 | YOUR | 198 |
39 | AN | 195 |
40 | ONE | 188 |
41 | GRACIOUS | 185 |
42 | MAJESTY | 185 |
43 | MOST | 183 |
44 | THEY | 175 |
45 | FROM | 165 |
46 | HIS | 165 |
47 | SO | 164 |
48 | HEAR | 161 |
49 | THERE | 161 |
50 | CAN | 157 |
51 | MORE | 155 |
52 | PARLIAMENT | 149 |
53 | HER | 144 |
54 | BEEN | 143 |
55 | WHEN | 141 |
56 | PEOPLE | 140 |
57 | DO | 139 |
58 | WHAT | 139 |
59 | ME | 135 |
60 | GOVERNMENT | 132 |
61 | YOU | 132 |
62 | AM | 125 |
63 | LORDS | 125 |
64 | THEIR | 125 |
65 | HAD | 117 |
66 | NOBLE | 115 |
67 | YEARS | 115 |
68 | FIRST | 113 |
69 | ADDRESS | 112 |
70 | ABOUT | 109 |
71 | US | 108 |
72 | THOSE | 105 |
73 | NOW | 104 |
74 | OR | 104 |
75 | GREAT | 100 |
76 | FRIEND | 99 |
77 | MAY | 99 |
78 | TIME | 99 |
79 | WOULD | 99 |
80 | ONLY | 95 |
81 | LAUGH | 94 |
82 | NO | 93 |
83 | LORD | 91 |
84 | BOTH | 90 |
85 | SHOULD | 88 |
86 | KNOW | 87 |
87 | SHE | 87 |
88 | SAY | 85 |
89 | MR | 83 |
90 | BEG | 80 |
91 | MANY | 79 |
92 | RIGHT | 79 |
93 | IF | 78 |
94 | LOYAL | 78 |
95 | OTHER | 78 |
96 | VERY | 76 |
97 | MUST | 75 |
98 | THEM | 75 |
99 | LIKE | 74 |
100 | MEMBER | 74 |
Table 14. The 100 Most Frequent Words in the Subset of UK Budget Speeches
N | Word | Freq. |
---|---|---|
1 | THE | 27,247 |
2 | TO | 16,377 |
3 | AND | 14,716 |
4 | OF | 11,902 |
5 | IN | 10,286 |
6 | # | 9,572 |
7 | A | 7,959 |
8 | WE | 7,824 |
9 | FOR | 7,080 |
10 | THAT | 6,826 |
11 | WILL | 6,752 |
12 | IS | 4,877 |
13 | I | 4,840 |
14 | OUR | 4,291 |
15 | THIS | 4,203 |
16 | ON | 3,472 |
17 | YEAR | 3,266 |
18 | HAVE | 3,065 |
19 | BE | 3,040 |
20 | IT | 2,946 |
21 | BY | 2,818 |
22 | TAX | 2,763 |
23 | ARE | 2,751 |
24 | PER | 2,728 |
25 | WITH | 2,615 |
26 | CENT | 2,545 |
27 | FROM | 2,516 |
28 | Â | 2,423 |
29 | AS | 2,247 |
30 | NEW | 2,105 |
31 | HEAR | 2,033 |
32 | S | 2,008 |
33 | CAN | 2,000 |
34 | TODAY | 1,985 |
35 | MORE | 1,977 |
36 | BUT | 1,863 |
37 | AT | 1,836 |
38 | SO | 1,770 |
39 | POUNDS | 1,628 |
40 | NOT | 1,571 |
41 | NOW | 1,527 |
42 | BRITAIN | 1,478 |
43 | NEXT | 1,464 |
44 | HAS | 1,458 |
45 | THEIR | 1,454 |
46 | INTERRUPTION | 1,430 |
47 | MR | 1,405 |
48 | ALSO | 1,395 |
49 | SPEAKER | 1,359 |
50 | THAN | 1,354 |
51 | PEOPLE | 1,353 |
52 | ALL | 1,349 |
53 | GOVERNMENT | 1,329 |
54 | YEARS | 1,237 |
55 | ECONOMY | 1,225 |
56 | OVER | 1,217 |
57 | BUDGET | 1,189 |
58 | BILLION | 1,163 |
59 | HELP | 1,154 |
60 | THEY | 1,129 |
61 | AN | 1,123 |
62 | WORK | 1,038 |
63 | SUPPORT | 1,035 |
64 | INVESTMENT | 1,025 |
65 | DEPUTY | 1,022 |
66 | COUNTRY | 994 |
67 | WHICH | 963 |
68 | PUBLIC | 928 |
69 | THOSE | 909 |
70 | UP | 907 |
71 | GROWTH | 905 |
72 | MILLION | 894 |
73 | WHO | 880 |
74 | SPENDING | 878 |
75 | FIRST | 874 |
76 | ONE | 872 |
77 | BUSINESS | 862 |
78 | DO | 845 |
79 | LAST | 831 |
80 | RATE | 828 |
81 | WORLD | 800 |
82 | EVERY | 790 |
83 | FURTHER | 777 |
84 | OUT | 776 |
85 | BEEN | 762 |
86 | PAY | 755 |
87 | THERE | 746 |
88 | NATIONAL | 745 |
89 | MAKE | 724 |
90 | ECONOMIC | 719 |
91 | AM | 707 |
92 | INCOME | 703 |
93 | FORECAST | 702 |
94 | TIME | 682 |
95 | WOULD | 679 |
96 | FUTURE | 664 |
97 | THESE | 664 |
98 | BUSINESSES | 663 |
99 | DEBT | 661 |
100 | WAS | 647 |
Table 15. The 100 Most Frequent Words in the Subset of Debates on US Budget Speeches
N | Word | Freq. |
---|---|---|
1 | THE | 27,280 |
2 | TO | 10,580 |
3 | THAT | 10,089 |
4 | OF | 9,164 |
5 | AND | 8,830 |
6 | IN | 7,124 |
7 | IS | 6,525 |
8 | A | 6,298 |
9 | HE | 4,719 |
10 | FOR | 4,276 |
11 | WE | 4,087 |
12 | IT | 3,895 |
13 | I | 3,824 |
14 | # | 3,697 |
15 | ON | 3,505 |
16 | NOT | 3,219 |
17 | HAVE | 3,107 |
18 | CHANCELLOR | 3,059 |
19 | WILL | 2,896 |
20 | ARE | 2,797 |
21 | S | 2,628 |
22 | BE | 2,469 |
23 | HAS | 2,453 |
24 | THIS | 2,446 |
25 | THEY | 2,149 |
26 | AS | 1,903 |
27 | BUT | 1,846 |
28 | TAX | 1,784 |
29 | GOVERNMENT | 1,757 |
30 | WITH | 1,701 |
31 | HIS | 1,693 |
32 | BY | 1,540 |
33 | WHAT | 1,482 |
34 | WAS | 1,455 |
35 | PEOPLE | 1,438 |
36 | MORE | 1,287 |
37 | AT | 1,280 |
38 | ABOUT | 1,269 |
39 | YEAR | 1,250 |
40 | HON | 1,223 |
41 | OUR | 1,213 |
42 | FROM | 1,162 |
43 | WHICH | 1,152 |
44 | WOULD | 1,147 |
45 | ALL | 1,108 |
46 | DO | 1,095 |
47 | CAN | 1,087 |
48 | BUDGET | 1,086 |
49 | THERE | 1,079 |
50 | BEEN | 1,034 |
51 | TODAY | 1,001 |
52 | SO | 998 |
53 | RIGHT | 992 |
54 | NOW | 977 |
55 | INTERRUPTION | 970 |
56 | THEIR | 937 |
57 | Â | 932 |
58 | WHO | 932 |
59 | THAN | 930 |
60 | MY | 913 |
61 | IF | 904 |
62 | US | 900 |
63 | AN | 877 |
64 | WHEN | 872 |
65 | ER | 857 |
66 | PER | 856 |
67 | YEARS | 836 |
68 | PUBLIC | 831 |
69 | HEAR | 817 |
70 | ONE | 795 |
71 | SAID | 791 |
72 | CENT | 788 |
73 | GROWTH | 786 |
74 | UP | 772 |
75 | MR | 770 |
76 | ECONOMY | 755 |
77 | OUT | 754 |
78 | SHOULD | 745 |
79 | THOSE | 726 |
80 | NO | 708 |
81 | COUNTRY | 701 |
82 | OR | 683 |
83 | BECAUSE | 676 |
84 | JUST | 645 |
85 | LABOUR | 634 |
86 | HAD | 622 |
87 | SPENDING | 619 |
88 | GENTLEMAN | 614 |
89 | OVER | 611 |
90 | DOES | 606 |
91 | SPEAKER | 600 |
92 | LAST | 599 |
93 | MINISTER | 576 |
94 | T | 570 |
95 | DID | 566 |
96 | THEM | 565 |
97 | FRIEND | 561 |
98 | MONEY | 561 |
99 | BILLION | 553 |
100 | WHY | 552 |
N | Word | Freq. |
---|---|---|
1 | 的 | 1,201 |
2 | 我们 | 476 |
3 | 我 | 294 |
4 | 和 | 283 |
5 | 是 | 217 |
6 | 在 | 214 |
7 | 个 | 201 |
8 | 了 | 198 |
9 | 这 | 196 |
10 | 中国 | 186 |
11 | 一 | 181 |
12 | 两 | 165 |
13 | 中 | 164 |
14 | 国 | 155 |
15 | 美 | 140 |
16 | 合作 | 138 |
17 | 美国 | 133 |
18 | 问题 | 112 |
19 | 关系 | 109 |
20 | 人民 | 106 |
21 | 也 | 89 |
22 | 就 | 86 |
23 | 不 | 84 |
24 | 对 | 81 |
25 | 有 | 80 |
26 | 国家 | 75 |
27 | 发展 | 73 |
28 | 共同 | 67 |
29 | 都 | 65 |
30 | 为 | 63 |
31 | 双方 | 61 |
32 | 进行 | 58 |
33 | 和平 | 56 |
34 | 要 | 54 |
35 | 总统 | 54 |
36 | 上 | 53 |
37 | 世界 | 53 |
38 | 地 | 52 |
39 | 努力 | 52 |
40 | 啊 | 50 |
41 | 会 | 48 |
42 | 将 | 47 |
43 | 更 | 46 |
44 | 与 | 46 |
45 | 大 | 45 |
46 | 而 | 44 |
47 | 能够 | 43 |
48 | 重要 | 43 |
49 | 主席 | 43 |
50 | 达成 | 42 |
51 | 到 | 41 |
52 | 说 | 41 |
53 | 国际 | 40 |
54 | 奥巴马 | 39 |
55 | 今天 | 39 |
56 | 领域 | 39 |
57 | 呢 | 39 |
58 | # | 38 |
59 | 并 | 38 |
60 | 解决 | 38 |
61 | 习 | 38 |
62 | 坚持 | 37 |
63 | 来 | 37 |
64 | 同意 | 37 |
65 | 这些 | 37 |
66 | 次 | 36 |
67 | 取得 | 36 |
68 | 认为 | 36 |
69 | 新 | 36 |
70 | 继续 | 35 |
71 | 应该 | 35 |
72 | 还 | 34 |
73 | 加强 | 34 |
74 | 通过 | 34 |
75 | 同 | 34 |
76 | 以及 | 34 |
77 | 访问 | 33 |
78 | 以 | 33 |
79 | 多 | 32 |
80 | 各 | 31 |
81 | 欢迎 | 31 |
82 | 年 | 31 |
83 | 网络 | 31 |
84 | 向 | 31 |
85 | 好 | 30 |
86 | 实现 | 30 |
87 | 讨论 | 30 |
88 | 一些 | 30 |
89 | 让 | 29 |
90 | 稳定 | 29 |
91 | 之间 | 29 |
92 | 变化 | 28 |
93 | 他们 | 28 |
94 | 已经 | 28 |
95 | 很 | 27 |
96 | 气候 | 27 |
97 | 相互 | 27 |
98 | 地区 | 26 |
99 | 分歧 | 26 |
100 | 夫人 | 26 |
N | Word | Freq. |
---|---|---|
1 | THE | 3,244 |
2 | AND | 2,697 |
3 | TO | 2,081 |
4 | OF | 1,496 |
5 | WE | 1,162 |
6 | THAT | 1,033 |
7 | IN | 1,014 |
8 | A | 837 |
9 | I | 666 |
10 | CHINA | 637 |
11 | OUR | 636 |
12 | S | 595 |
13 | IS | 585 |
14 | HAVE | 537 |
15 | ON | 520 |
16 | FOR | 410 |
17 | IT | 371 |
18 | ARE | 362 |
19 | WITH | 351 |
20 | AS | 346 |
21 | WILL | 331 |
22 | YOU | 330 |
23 | THIS | 329 |
24 | UNITED | 323 |
25 | STATES | 299 |
26 | PRESIDENT | 259 |
27 | TWO | 236 |
28 | COOPERATION | 232 |
29 | COUNTRIES | 213 |
30 | NOT | 208 |
31 | OTHER | 194 |
32 | WORK | 189 |
33 | CAN | 186 |
34 | ALL | 184 |
35 | PEOPLE | 183 |
36 | BE | 179 |
37 | SO | 173 |
38 | AT | 171 |
39 | WORLD | 167 |
40 | HAS | 163 |
41 | MORE | 161 |
42 | THERE | 160 |
43 | TOGETHER | 153 |
44 | OR | 152 |
45 | # | 151 |
46 | BETWEEN | 151 |
47 | BY | 144 |
48 | AN | 143 |
49 | ER | 140 |
50 | CHINESE | 139 |
51 | BUT | 138 |
52 | WHAT | 138 |
53 | ALSO | 136 |
54 | XI | 133 |
55 | ISSUES | 130 |
56 | NEW | 128 |
57 | FROM | 123 |
58 | RELATIONS | 121 |
59 | DO | 119 |
60 | THANK | 117 |
61 | ISSUE | 114 |
62 | SECURITY | 113 |
63 | U | 111 |
64 | BOTH | 109 |
65 | NUCLEAR | 109 |
66 | VE | 109 |
67 | THEY | 108 |
68 | IMPORTANT | 106 |
69 | BEEN | 105 |
70 | MAKE | 105 |
71 | INTERNATIONAL | 104 |
72 | MY | 104 |
73 | RELATIONSHIP | 103 |
74 | ABOUT | 102 |
75 | CHINA-U | 100 |
76 | THINK | 98 |
77 | VERY | 98 |
78 | RE | 94 |
79 | CONTINUE | 93 |
80 | DEVELOPMENT | 92 |
81 | PROGRESS | 92 |
82 | NATIONS | 90 |
83 | ONE | 90 |
84 | WHICH | 90 |
85 | ME | 89 |
86 | SHOULD | 89 |
87 | IF | 88 |
88 | SIDES | 88 |
89 | SOME | 87 |
90 | WHEN | 87 |
91 | AGREEMENT | 85 |
92 | GLOBAL | 85 |
93 | N | 85 |
94 | TODAY | 85 |
95 | NOW | 84 |
96 | AMERICAN | 83 |
97 | WANT | 82 |
98 | HAD | 81 |
99 | SAID | 81 |
100 | AGREED | 80 |
N | Word | Freq. |
---|---|---|
1 | 的 | 375 |
2 | 我们 | 126 |
3 | 国 | 118 |
4 | 两 | 116 |
5 | 中国 | 100 |
6 | 英国 | 92 |
7 | 和 | 89 |
8 | 了 | 87 |
9 | 关系 | 83 |
10 | 中 | 78 |
11 | # | 76 |
12 | 英 | 75 |
13 | 我 | 70 |
14 | 在 | 66 |
15 | 是 | 62 |
16 | 一 | 59 |
17 | 也 | 48 |
18 | 合作 | 47 |
19 | 个 | 46 |
20 | 为 | 42 |
21 | 发展 | 40 |
22 | 这 | 39 |
23 | 世界 | 35 |
24 | 对 | 34 |
25 | 更 | 31 |
26 | 共同 | 31 |
27 | 将 | 30 |
28 | 人民 | 30 |
29 | 新 | 29 |
30 | 访问 | 26 |
31 | 年 | 26 |
32 | 有 | 25 |
33 | 经济 | 24 |
34 | 今天 | 23 |
35 | 上 | 23 |
36 | 与 | 23 |
37 | 都 | 22 |
38 | 国家 | 22 |
39 | 贸易 | 21 |
40 | 问题 | 21 |
41 | 中英 | 21 |
42 | 次 | 20 |
43 | 大 | 20 |
44 | 来 | 20 |
45 | 要 | 20 |
46 | 作为 | 20 |
47 | 好 | 19 |
48 | 增长 | 19 |
49 | 双方 | 18 |
50 | 不 | 17 |
51 | 最 | 17 |
52 | 国际 | 16 |
53 | 进行 | 16 |
54 | 女王 | 16 |
55 | 首相 | 16 |
56 | 讨论 | 16 |
57 | 投资 | 16 |
58 | 重要 | 16 |
59 | 使 | 15 |
60 | 成为 | 14 |
61 | 伙伴 | 14 |
62 | 机遇 | 14 |
63 | 就 | 14 |
64 | 卡梅伦 | 14 |
65 | 政府 | 14 |
66 | 总理 | 14 |
67 | 第二 | 13 |
68 | 多 | 13 |
69 | 很 | 13 |
70 | 会 | 13 |
71 | 建立 | 13 |
72 | 能够 | 13 |
73 | 陛下 | 12 |
74 | 到 | 12 |
75 | 还 | 12 |
76 | 欢迎 | 12 |
77 | 可以 | 12 |
78 | 不仅 | 11 |
79 | 达成 | 11 |
80 | 各位 | 11 |
81 | 过去 | 11 |
82 | 联合国 | 11 |
83 | 认为 | 11 |
84 | 以来 | 11 |
85 | 之间 | 11 |
86 | 表示 | 10 |
87 | 地 | 10 |
88 | 方面 | 10 |
89 | 国事 | 10 |
90 | 及 | 10 |
91 | 今年 | 10 |
92 | 朋友们 | 10 |
93 | 全球 | 10 |
94 | 相互 | 10 |
95 | 已经 | 10 |
96 | 战略 | 10 |
97 | 着 | 10 |
98 | 尊敬 | 10 |
99 | 安理会 | 9 |
100 | 并 | 9 |
N | Word | Freq. |
---|---|---|
1 | THE | 563 |
2 | AND | 462 |
3 | TO | 294 |
4 | OF | 246 |
5 | WE | 182 |
6 | IN | 176 |
7 | A | 171 |
8 | CHINA | 145 |
9 | OUR | 138 |
10 | FOR | 98 |
11 | I | 87 |
12 | IS | 80 |
13 | # | 76 |
14 | THAT | 76 |
15 | HAVE | 72 |
16 | THIS | 72 |
17 | UK | 72 |
18 | S | 62 |
19 | ALSO | 60 |
20 | AS | 58 |
21 | ARE | 54 |
22 | ON | 54 |
23 | BETWEEN | 53 |
24 | COUNTRIES | 50 |
25 | RELATIONSHIP | 47 |
26 | CHINESE | 46 |
27 | IT | 44 |
28 | PEOPLE | 44 |
29 | WITH | 44 |
30 | MORE | 40 |
31 | BOTH | 38 |
32 | VISIT | 38 |
33 | WILL | 38 |
34 | WHICH | 36 |
35 | AN | 34 |
36 | BUT | 34 |
37 | WORLD | 34 |
38 | COOPERATION | 32 |
39 | NOT | 32 |
40 | TODAY | 32 |
41 | TWO | 32 |
42 | BRITAIN | 30 |
43 | ECONOMIC | 30 |
44 | VE | 30 |
45 | CAN | 28 |
46 | HAS | 28 |
47 | TRADE | 28 |
48 | YEARS | 28 |
49 | DEVELOPMENT | 26 |
50 | FIRST | 26 |
51 | SHOULD | 26 |
52 | SO | 26 |
53 | UNITED | 26 |
54 | WAS | 26 |
55 | NEW | 25 |
56 | BE | 24 |
57 | BRITISH | 24 |
58 | FROM | 24 |
59 | GLOBAL | 24 |
60 | INVESTMENT | 24 |
61 | PARTNERSHIP | 24 |
62 | TOGETHER | 24 |
63 | PRESIDENT | 22 |
64 | TIME | 22 |
65 | YEAR | 22 |
66 | KINGDOM | 20 |
67 | ONE | 20 |
68 | RELATIONS | 20 |
69 | ALL | 18 |
70 | BILATERAL | 18 |
71 | BILLION | 18 |
72 | BY | 18 |
73 | GROWTH | 18 |
74 | ISSUES | 18 |
75 | SEIZE | 18 |
76 | THEY | 18 |
77 | VERY | 18 |
78 | WELCOME | 18 |
79 | WELL | 18 |
80 | WORK | 18 |
81 | COUNTRY | 16 |
82 | FUTURE | 16 |
83 | GOOD | 16 |
84 | LAST | 16 |
85 | LIKE | 16 |
86 | M | 16 |
87 | MARKS | 16 |
88 | MR | 16 |
89 | MY | 16 |
90 | NOW | 16 |
91 | PRIME | 16 |
92 | PRINCE | 16 |
93 | QUEEN | 16 |
94 | SECURITY | 16 |
95 | TIES | 16 |
96 | YOU | 16 |
97 | US | 15 |
98 | AT | 14 |
99 | DISCUSSED | 14 |
100 | HE | 14 |
This part includes a list of publications on and related to CEPIC. You can also find some useful links to corpora as well as conferences, seminars and workshops relevant to political discourse and its translation/interpreting. The information of this page will be updated periodically.
You can access more updates of the CEPIC from https://sites.google.com/a/hkbu.edu.hk/cepic-the-chinese-english-political-interpreting-corpus/, or through following our Facebook / Twitter account.
We would appreciate it if you could send us information of your publications or works based on or related to CEPIC via this link: https://hkbuhk.ca1.qualtrics.com/jfe/form/SV_a97lXo4AKTh0hbT. Selected list of publications and links will be included on this webpage.
Pan, J. (forthcoming). The pragmatics of political discourse: An analytical framework and a comparative study of policy speeches in the United Kingdom and Hong Kong. Bandung: Journal of the Global South.
Pan, J., & Wong, T. M. (forthcoming). Developing Pragmatic Competence in Chinese–English Political Retour Interpreting: A Corpus-Driven Exploratory Study of Pragmatic Markers., inTRAlinea.
Pan, J., & Wong, T. M. (2018). A corpus-driven study of contrastive markers in Cantonese‒English political interpreting. BRAIN – Broad Research in Artificial Intelligence and Neuroscience, pp. 168-176.
Pan, J., & Wang, H. H. (2008). Communication between the speaker and the interpreter. Journal of Jiangsu University (Social Science Edition), 10(4), 77–80.
Pan, J. (2007). Two styles of interpretation: Reflection on the influence of oriental and western thought patterns on the relationship between the speaker and the interpreter. Foreign Language and Culture Studies, 6, 677–688.
Pan, J., (2018, 12-14 September). The use of contrastive markers in English policy speeches: A corpus-based cross-modality comparison of but and however in interpreted and non-interpreted language. Paper presented at the fifth edition of the Using Corpora in Contrastive and Translation Studies conference (UCCTS 2018), Université catholique de Louvain, Belgium. In Sylviane Granger, Marie-Aude Lefer and Laura Aguiar de Souza Penha Marion (eds), Book of Abstracts, pp. 138-139.
Pan, J., & Wong, T. M. (2018, 3-6 July). Pragmatic strategies in political interpreting: A study of pragmatic markers in interpreted political speeches. Paper presented at the IATIS (International Association for Translation and Intercultural Studies) 6th International Conference, Hong Kong Baptist University, Hong Kong.
Pan, J., (2018, 22-24 June). A corpus-based study of the rendition of contrastive markers in Chinese‒English political interpreting. Paper presented at the Corpora and Discourse International Conference, Lancaster University, UK.
Pan, J., (2018, 18-20 June). Pragmatic strategies applied in institutional translation: A case study of the translation of two contrastive markers in Hong Kong’s policy addresses. Poster presentation at the TRANSIUS Conference, University of Geneva, Switzerland.
Pan, J., (2017, December). Developing pragmatic competence in Chinese-English political interpreting. Paper presented at the First National Forum on Diplomatic Discourse and Translation, Henan, PRC.
Pan, J., & Wong, T. M. (2017, September). Developing pragmatic competence in political retour interpreting: A corpus-driven study on the use of pragmatic markers. Paper presented at Teaching Translation and Interpreting 5, University of Łódź, Łódź, Poland.
Pan, J., & Wong, T. M. (2017, September). A Corpus-driven Study of Contrastive Markers in Cantonese‒English Political Interpreting. Paper presented at SMART 2017 – Scientific Methods in Academic Research and Teaching, Timisoara, Romania.
Pan, J., & Wong, T. M. (2015, December). Pragmatic markers in interpreted political discourse: A corpus-driven study. Paper presented at the International Conference on Corpus Linguistics and Technology Advancement (CoLTA), Hong Kong.
Pan, J., & Wong, T. M. (2015, September). Investigating pragmatic markers in interpreted political speeches from Chinese to English. Paper presented at the International Conference “Found in translation – translations are the children of their times”, Bucharest, Romania.
The Corpus of Political Speeches
The Digital Corpus of the European Parliament
The European Comparable and Parallel Corpora
The European Parliament Interpreting Corpus
European Parliament Translation and Interpreting Corpus
Translating and Interpreting Political Discourse (TIPD 2019) (19-20 June 2019)
Principal Investigator: Dr. Jun PAN (Associate Professor, Translation Programme, Hong Kong Baptist University)
Co-Investigator & Special Consultant to the CEPIC: Dr. Billy Tak Ming WONG (Research Coordinator, University Research Centre, Open University of Hong Kong)
Senior Adviser to the CEPIC: Ms. Rebekah WONG (Head of Digital and Multimedia Services Section, Library, Hong Kong Baptist University)
We would like to thank the following research assistants and student helpers for their contribution to the data preparation and library staff for providing support to the technical aspects of the CEPIC.
Research Assistants:
Mr. Fernando GABARRON BARRIOS
Mr. Steven Haoshen HE
Miss Chris Chencheng KUANG
Miss Hannah Qiuhan LIN
Mr. William Dongpeng PAN
Miss Jennifer Lok Man WONG
Miss Grace Jing ZHANG
Student Helpers:
Mr. Antonio Yijiao GUO
Mr. Hank Lin HAN
Miss Rigel Chung Ting PAK
Miss Gladys Hiu Man SHIU
Miss Jess Lin Wing SZE
Miss Tammy Cho Ying TANG
Miss Janny Chi Wai WONG
Miss Alice Yuxin YANG
Mr. Niko Donghuan ZHANG
Library Colleagues:
Mr. Wing Chung YIP
Mr. Timothy Sit YEUNG
Miss Sharon Suk Man YU
Miss Katie Kee Yee CHENG
From left to right: Tammy TANG, Janny WONG, Dr. Jun PAN, Antonio GUO, Fernando GABARRON BARRIOS & William PAN
From left to right, front row: Fernando GABARRON BARRIOS, Dr. Jun PAN & William PAN
From left to right, back row: Janny WONG, Alice YANG, Rigel PAK, Tammy TANG & Gladys SHIU
Meeting with research asssistants and student helpers
Meeting with research asssistants and student helpers
The CEPIC is developed with the funding and support from:
We would like to express our gratitude to the funding bodies for making the work on this corpus possible.
Our appreciation also goes to the Hong Kong Baptist University Library for providing advice on data structure, uploading the corpora, and helping in designing the website and related search functions.