[1] Koehn P.Europarl: A parallel corpus for statistical machine translation[C]//Proceedings of Machine Translation Summit X. Phuket: Asia-Pacific Association for Machine Translation, 2005:79-86.
[2] Dandapat S,Morrissey S, Kumar N, et al. Statistically motivated example-based machine translation using translation memory[C]//Sharma D, Sangal R, Sarkav S. Proceedings of the 8th International Conference on Natural Language Processing. Kharagpur: Macmillan Publishers, 2010: 168-177.
[3] 王克非,熊文新. 汉英对应语料库的检索及应用[J]. 外语电化教学, 2011(6):31-36.
[4] Renouf A,Kehoe A, Banerjee J. WebCorp:An integrated system for Web text search[C]//Nesselhauf C, Hundt M, Biewer C. Corpus Linguistics and the Web. Amsterdam: Rodopi, 2007:47-68.
[5] Baroni M,Bernardini S. BootCaT: Bootstrapping corpora and terms from the Web[C]//Teresa M, Maria L, Xavier F, et al. Proceedings of 4th International Conference on Language Resources and Evaluation. Paris: European Language Resources Association, 2004: 1313-1316.
[6] Kucera H,Francis W, Carroll J. Computational Analysis of Present Day American English[M].Providence: Brown University Press, 1967.
[7] Sharoff S.Creating general-purpose corpora using automated search engine queries[C]//Baroni M, Bernardini S. WaCky! Working papers on the Web as Corpus. Bologna: Gedit, 2006:63-98.
[8] Chang B.Chinese-English Parallel Corpus Construction and its Application[C]//Masuichi H, Ohkuma T, Ishikawa K, et al. Proceedings of 18th Pacific Conference on Language, Information and Communication. Tokyo: The Logico-Linguistic Society of Japan, 2004:283-290.
[9] Eisele A,Chen Y. MultiUN: A multilingual corpus from United Nation documents[C]//Calzolari N, Choukri K, Maegaard B, et al. Proceedings of the 7th International Conference on Language Resources and Evaluation. Paris: European Language Resources Association, 2010:2868-2872.
[10] William G,Church K. A program for aligning sentences in bilingual corpora[C]//Appelt D. Proceedings of the 29th Annual Meeting of the Association for Computational Linguistics. Stroudsburg: Association for Computational Linguistics, 1991:177-184.
[11] Wu D. Aligninga parallel English-Chinese corpus statistically with lexical criteria[C]//Pustejovsky J. Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics. Stroudsburg:Association for Computational Linguistics, 1994:80-87.
[12] Nie J, Simard M,Isabelle P, et al. Cross-language information retrieval based on parallel texts and automatic mining of parallel texts from the Web[C]//Gey F, Hearst M, Tong R. Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval. New York: ACM, 1999:74-81.
[13] Ma X, Liberman M.Bits: A method for bilingual text search over the Web[C]//Proceedings of Machine Translation Summit VII. Singapore: Asia-Pacific Association for Machine Translation, 1999: 538-542.
[14] Resnik P, Smith N.The Web as a parallel corpus[J]. Computational Linguistics, 2003, 29(3):349-380.
[15] Chen J, Chau R, Yeh C.Discovering parallel text from the World Wide Web[C]//Purvis M. The Australasian Workshop on Data Mining and Web Intelligence. Sydney: Australian Computer Society, 2004:157-161.
[16] Zhang Y, Wu K, Gao J,et al. Automatic acquisition of Chinese-English parallel corpus from the Web[C]//Lalmas M, MacFarlane A, Rüger S. Proceedings of 28th European Conference on Information Retrieval. Heidelberg: Springer-Verlag, 2006:420-431.
[17] Harris Z.Discourse and sublanguage[C]//Kittredge R, Lehrberger J. Sublanguage: Studies of Language in Restricted Semantic Domains. Berlin: Walter de Gruyter, 1989:231-236.
[18] O’Brien S.Controlling controlled english: An analysis of several controlled language rule sets[C]//Proceedings of the Conference Controlled Language Translation.Allschwil:The European Association for Machine Translation, 2003:105-114.
[19] 俞士汶,朱学锋. 受限汉语研究的必要性[C]//王均. 语文现代化论丛第三辑. 北京: 语文出版社, 1997:150-160.
[20] 俞士汶. 关于受限的规则汉语的设想[C]//王均. 语文现代化论丛. 济南: 山东教育出版社, 1995:193-205.
[21] 王克非, 熊文新. 用于翻译教学与研究的英汉对应语料库加工处理[J]. 外语电化教学, 2009(6):3-9.
[22] Manning C,Schutze H. Foundations of statistical natural language processing[M]. Cambridge: MIT Press, 1999.
[23] Fletcher W.Implementing a BNC-compareable Web corpus[C]//Fairon C, Naets H, Kilgarriff A. Proceedings of 3rd Web as Corpus Workshop. Louvain-la-Neuve: Presses universitaires de Louvain, 2007:43-56.
[24] 叶莎妮, 吕雅娟, 黄赟. 基于Web的双语平行句对自动获取[J]. 中文信息学报, 2008, 22 (5):67-73.
[25] Cai D, Yu S,Wen J, et al. Extracting content structure for Web pages based on visual representation[C]//Zhou X, Zhang Y, Orlowska M. Proceedings of the 5th Asian-Pacific Web Conference. Heidelberg: Springer-Verlag, 2003: 406-417.
[26] Shi L, Niu C,Zhou M, et al. A DOM tree alignment model for mining parallel data from the Web[C]//Calzolari N, Cardie C,Isabelle P. Proceedings of 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics. Stroudsburg: Association for Computational Linguistics, 2006: 489-496.
[27] Weibel S.The Dublin Core: A simple content description model for electronic resources[J]. Bulletin of the American Society for Information Science and Technology,1997, 24(1): 9-11. |