XML Rule Customization Method of LanguageTool Chinese Grammar Proof-Reading

  • Jiang Ying ,
  • Zeng Jie ,
  • Lin Qihong ,
  • Guo Yingshan ,
  • Liao Wensheng
Expand
  • School of Management, Beijing Normal University Zhuhai Campus, Zhuhai 519087

Received date: 2013-12-24

  Revised date: 2014-01-12

  Online published: 2014-03-05

Abstract

It provides a technical solution of XML rule customization based on Chinese grammar proof-reading, which can establish and extend Chinese grammar proof-reading function.The technology has been implemented in the open source proof-reading tool of LanguageTool, with the corresponding XML rule database of Chinese grammar proof-reading.The evaluation results on multiple Chinese corpuses have shown its feasibility and practicability.

Cite this article

Jiang Ying , Zeng Jie , Lin Qihong , Guo Yingshan , Liao Wensheng . XML Rule Customization Method of LanguageTool Chinese Grammar Proof-Reading[J]. Library and Information Service, 2014 , 58(05) : 86 -92 . DOI: 10.13266/j.issn.0252-3116.2014.05.015

References

[1] 杰子.帮你改正错误——用好WPS Office的“中文校对”功能[J].软件, 2002, 27(3):31-32.
[2] 吴明.最新版黑马校对软件在新闻出版单位使用[EB/OL].[2013-10-03].http://data.chinaxwcb.com/epaper/2011/2011-06-20/11589.html.
[3] Mi?kowski M.Developing an open-source, rule-based proofreading tool[J].Software - Practice and Experience, 2010, 40(7):543-566.
[4] LanguageTool中文支持[EB/OL].[2013-10-09].http://www.languagetool.org/zh/.
[5] Deal Proof[EB/OL].[2013-10-03].http://info.legalsolutions.thomsonreuters.com/business-law/pdf/L-360455_US.pdf.
[6] 张仰森, 俞士汶.文本自动校对技术研究综述[J].计算机应用研究, 2006, 37(6):8-12.
[7] Gill M S, Lehal G S.A grammar checking system for Punjabi[C]// Scott D,Uszkoreit H.Proceedings of the 22nd International Conference on Computational Linguistics: Demonstration Papers.Stroudsburg: Association for Computational Linguistics, 2008: 149-152.
[8] Shaalan K F.Arabic GramCheck: A grammar checker for Arabic[J].Software: Practice and Experience, 2005, 35(7): 643-665.
[9] Domeij R, Knutsson O, Carlberger J, et al.Granska-an efficient hybrid system for Swedish grammar checking[C] //Nordgrd T.Proceedings of the 12th Nordic Conference on Computational Linguistics.Trondheim:University of Trondheim, 2000: 82-85.
[10] Gauthier M.Anglophone high school boys' engagement and achievement in editing their French writing using the BonPatronPro[J].Journal of Classroom Research in Literacy, 2013, 6(1): 24-35.
[11] Rozovskaya A, Roth D.Algorithm selection and model adaptation for ESL correction tasks[C] //Yuji M, Rada M.Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1 (HLT '11).Stroudsburg: Association for Computational Linguistics, 2011:924-933.
[12] Dahlmeier D, Ng H T.A beam-search decoder for grammatical error correction[C] //Junichi Tsujii.Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning.Stroudsburg: Association for Computational Linguistics, 2012: 568-578.
[13] Imamura K, Saito K, Sadamitsu K, et al.Grammar error correction using pseudo-error sentences and domain adaptation[C] //Li Haizhou.Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Short Papers - Volume 2 (ACL '12).Stroudsburg: Association for Computational Linguistics,2012: 388-392.
[14] Chen Hao-Jan Howard.Evaluating two Web-based grammar checkers-Microsoft ESL assistant and NTNU statistical grammar checker[J].International Journal of Computational Linguistics & Chinese Language Processing,2009, 14(2): 161-180.
[15] Zhang Lei, Zhou Ming, Huang Changning.Multifeature-based approach to automatic error detection and correction of Chinese text[J].Microsoft Research China Paper Collection, 2000, 15(4): 193-197.
[16] Li Jianhua, Wang Xiaolong, Wang Ping.Research about the algorithm of multifeature Chinese text proofreading[J].Computer Engineering and Science, 2001, 22(3):93-95.
[17] 程显毅,孙萍, 朱倩.基于HNC的中文文本校对系统模型的研究[J].微电子学与计算机, 2009(10): 49-52.
[18] 于勐, 姚天顺.一种混合的中文文本校对方法[J].中文信息学报, 1998, 28(2): 50-54.
[19] 吴林,张仰森.基于知识库的多层级中文文本查错推理模型[J].计算机工程, 2012, 38(20):21-25.
[20] 郇政永.基于OCR的中文文本校对研究[D].北京: 北方工业大学,2011.
[21] ictclas4j中文分词系统[EB/OL].[2013-10-03].http://code.google.com/p/ictclas4j/.
[22] 刘月华, 潘文娱, 故韦华.实用现代汉语语法[M].增订本.北京:商务印书馆, 2001:23-34.
[23] 陆俭明.面临新世纪挑战的现代汉语语法研究[M].济南:山东教育出版社, 2000:12-17.
[24] XML规则在线编辑工具[EB/OL].[2014-01-06].http://community.languagetool.org/ruleEditor/index.
[25] Mikowski M.Automating rule generation for grammar checkers[C] // Proceedings of Explorations Across Languages and Corpora (PALC 2009), Frankfurt am Main: Peter Lang, 2011: 123-133.
[26] 电驴网[EB/OL].[2014-01-06].http://www.verycd.com.
[27] 维基百科dump[EB/OL].[2014-01-06].http://dumps.wikimedia.org/zhwiki/latest/.
[28] 数据堂[EB/OL].[2014-01-06].http://www.datatang.com.
[29] LanguageTool中文语法校对XML规则库[EB/OL].[2013-10-09].https://github.com/languagetool-org/languagetool/blob/master/languagetool-language-modules/zh/src/main/resources/org/languagetool/rules/zh/grammar.xml.

Outlines

/