政大機構典藏-National Chengchi University Institutional Repository(NCCUR):Item 140.119/32700
English  |  正體中文  |  简体中文  |  Post-Print筆數 : 27 |  Items with full text/Total items : 113656/144643 (79%)
Visitors : 51740789      Online Users : 582
RC Version 6.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
Scope Tips:
  • please add "double quotation mark" for query phrases to get precise results
  • please goto advance search for comprehansive author search
  • Adv. Search
    HomeLoginUploadHelpAboutAdminister Goto mobile version
    Please use this identifier to cite or link to this item: https://nccur.lib.nccu.edu.tw/handle/140.119/32700


    Title: 電腦輔助漢英與英漢翻譯例句搜尋服務
    A Computer Assisted Environment for Searching Related Translations between Chinese and English
    Authors: 賴敏華
    Lai, Min Hua
    Contributors: 劉昭麟
    Liu, Chao Lin
    賴敏華
    Lai, Min Hua
    Keywords: 電腦輔助語文教學
    句型搜尋
    例句式教學
    電腦輔助翻譯
    Date: 2008
    Issue Date: 2009-09-17 14:05:13 (UTC+8)
    Abstract: 本論文為提供一個能輔助學生學習英漢翻譯與漢英翻譯時,搜尋翻譯例句的環境。我們的平行語料是利用網路上可取得的文件,如:英語教學網站、學習單等,利用人工擷取中英文對照的句子。標記化語料庫中記錄了中文句、英文句、中文句斷詞後的結果、中文句的詞性標記、中文句結構樹以及英文結構樹等資訊。
    使用者輸入的查詢句可包含中文句、英文句及中英文混合句。我們的系統會依據不同的搜尋功能,針對查詢句作前處理,如:斷詞、詞性擷取、結構樹分析、詞性還原、查詢句的詞彙擴展等,再與標記化語料庫作比對,最後提供與查詢句相似的中英文對照句子給使用者,讓使用者在學習翻譯時,有更多類似句可以參考。
    我們的系統不容易使用正規的方式評估;為了評估系統的效能,我們記錄各個搜尋功能,在不同門檻值下所得到的類似句句數,並利用NIST及BLEU來評估本系統所提供的類似句品質;另外我們透過問卷調查請受試者勾選本系統所提供類似句。問卷調查結果顯示受試者對於本系統所提供的類似句共識度並不高;本系統在提供10句類似句中,僅有1.6句的類似句是受試者認為有幫助的。
    I present an environment for searching related translations between Chinese and English. A parallel and tagged corpus was constructed based on the text material obtained from the Internet, including English teaching websites and public learn¬ing sheets. The corpus contains both English and Chinese sentences, the infor¬mation about how the Chinese strings were segmented, the POS tags of the Chinese words, and the syntactic structures of the English and Chinese sentences.
    The user can use our system to do some queries by entering a Chinese sentence, an English sentence, or any pattern with mixed Chinese and English. The query sentence will be preprocessed according to the search function which the user selects, and the results of preprocessing will be used to search in the tagged corpus. The search results will be the reference sentences that are related to the query sentence.
    A formal evaluation of our system is not easy. I evaluated the system by entering a set of selected queries. For those tests, I recorded and compared the amount of reference sentences the system returned, and evaluated the quality of the reference sentences with their BLEU and NIST scores with some standard translations. In addition, I evaluated my system with the help of human subjects. Human subjects were asked to choose useful sentences from the reference sentences returned by my system. Experimental results indicated that the agreements between human subjects were not high, and the human subjects found that only about 1.6 sentences were useful from 10 reference sentences.
    Reference: [1] Eric Brill, A Simple Rule-Based Part of Speech Tagger. Proceedings of the Third Conference on Applied Natural Language Processing, 152-155, 1992.
    http://bulba.sdsu.edu/jeanette/thesis/PennTags.html [Last visited on 18 September 2008]
    [2] Y.-F. Chang and D. L. Schallert, The Design for a Collaborative System of English as Foreign Language Composition Writing of Senior High School Students in Taiwan. Proceedings of the Fifth IEEE International Conference on Advance Learning Technologies, 774-775, 2005.
    [3] G. Doddington, Automatic evaluation of machine translation quality using n-gram co-occurrence statistics. Proceedings of the Second international Conference on Human Language Technology Research, 138-145, 2002.
    [4] Z. Dong and Q. Dong, HowNet, 2000. http://www.keenage.com [Last visited on 26 June 2008]
    [5] The Stanford Parser: A statistical parser (version 1.6),
    http://www-nlp.stanford.edu/software/lex-parser.shtml [Last visited on 26 June 2008]
    [6] J. Kakegawa, H. Kanda, E. Fujioka, M. Itami and K. Itoh, Diagnostic Processing of Japanese for Computer-Assisted Second Language Learning. Proceedings of the Thirty Eighth Annual Meeting on Association for Computational Linguistics, 537-546, 2000.
    [7] O. Knutsson, T. C. Pargman and K. S. Eklundh, Transforming Grammar Checking Technology into a Learning Environment for Second Language Writing. Proceedings of the HLT-NAACL 2003 Workshop on Building Educational Applications Using Natural Language Processing, Volume 2, 38-45, 2003.
    [8] C.-L. Liu, C.-H. Wang, and Z.-M. Gao, Using Lexical Constraints to Enhance the Quality of Computer-Generated Multiple-Choice Cloze Items. International Journal of Computational Linguistics and Chinese Language Processing, Volume 10, Number 3, 303-328, 2005.
    [9] C. D. Manning and H. Schütze, Foundations of Statistical Natural Language Processing, the MIT Press, 1999.
    [10] G. A. Miller, R. Beckwith, C. Fellbaum, D. Gross and K. Miller, Introduction to WordNet: An On-line Lexical Database. International Journal of Lexicography, Volume 3, Number 4, 235-244, 1990. http://wordnet.princeton.edu/doc/ [Last visited on 26 June 2008]
    [11] R. Mitkov and L. A. Ha, Computer-Aided Generation of Multiple-Choice Tests. Proceedings of the HLT-NAACL 2003 Workshop on Building Educational Applications Using Natural Language Processing, Volume2, 17-22, 2003.
    [12] S. B. Needleman and C. D. Wunsh, A General Method Applicable to the Search for Similarities in the Amino Acid Sequence of Two Proteins. Journal of Molecular Biology, Volume 48, Number 3, 443-453, 1970.
    [13] L. Nygaard and J. B. Johannessen, SearchTree – A User-friendly Treebank Search Interface. Proceedings of the 3rd Workshop on Treebanks and Linguistic Theories, 183-189, 2004.
    [14] K. Papineni, S. Roukos, T. Ward, and W.-J. Zhu, BLEU: A Method for Automatic Evaluation of Macine Translation. Proceedings of the Fortieth Annual Meeting of the Association for Computational Linguistics, 311-318, 2002.
    [15] S. Sato, CTM: an example-based translation aid system. Proceedings of the fourteenth Conference on Computational Linguistics, Volume 4, 1259-1268, 1992.
    [16] M. Shimohata, E. Sumita, and Y. Matsumoto, Retrieving meaning-equivalent sentences for example-based rough translation. Proceedings of the HLT-NAACL 2003 Workshop on Building and Using Parallel Texts: Data Driven Machine Translation and Beyond, Volume 3, 50-56, 2003.
    [17] M. Volk, J. Lundborg, and M. Mettler, A Search Tool for Parallel Treebanks, Proceedings of the Linguistic Annotation Workshop (LAW) on Association for Computational Linguistics, 85-92, 2007.
    [18] G.R.S. Weir and G. Lepouras, English Assistant: A Support Strategy for On-Line Second Language Learning. Proceedings of the Second IEEE International Conference on Advance Learning Technologies, 125-126, 2001.
    [19] X.-H. Zhou, Application of English Cohesion Theory in the Teaching of Writing to Chinese Graduate Students. Journal of US-China Education Review, Volume 4, Number 7, 31-37, 2007.
    [20] 中研院中文句結構樹資料庫檢索系統,http://turing.iis.sinica.edu.tw/treesearch/ [Last visited on 24 June 2008]
    [21] 中研院中文斷詞系統,http://ckipsvr.iis.sinica.edu.tw/ [Last visited on 24 June 2008]
    [22] 中研院平衡語料庫詞類標記集,http://ckipsvr.iis.sinica.edu.tw/category_list.doc [Last visited on 24 June 2008]
    [23] 中研院現代漢語語料庫一詞泛讀,http://140.109.150.65/cwordframe.html [Last visited on 24 June 2008]
    [24] 呂明欣,國小國語科測驗卷電腦輔助出題系統,碩士論文,國立政治大學。台灣,台北,2007。
    [25] 林仁祥及劉昭麟。國小國語科測驗卷出題輔助系統,2007台灣網際網路研討會論文集,論文光碟。台灣,台北,2007。
    [26] 唐建輝,大滿貫複習講義 英語(全),翰林出版事業股份有限公司。2008。
    [27] 旋元佑文法,http://tw.myblog.yahoo.com/jw!GFGhGimWHxN4wRWXG1UDIL_XSA--/ [Last visited on 24 June 2008]
    [28] 基礎英文1200句,http://hk.geocities.com/cnlyhhp/eng.htm [Last visited on 24 June 2008]
    [29] 國民中學學習資源網,http://140.111.34.172/teacool/new_page_2.htm [Last visited on 24 June 2008]
    [30] 教育部委託宜蘭縣發展九年一貫課程建置語文學習領域(英語)國中教科書補充資料暨題庫建置計畫,http://140.111.66.37/english/ [Last visited on 24 June 2008]
    [31] 教育部國民教育司,http://www.edu.tw/EJE [Last visited on 24 June 20088]
    [32] 陳佳吟、柯明憲、吳紫葦及張俊盛,電腦輔助英文文法出題系統,第十七屆自然語言與語音處理研討會論文集。台灣,台南,2005。
    [33] 劉吉軒、洪培鈞及李金瑛,以英語寫作輔助為目的之語料庫語句檢索方法,第十九屆自然語言與語音處理研討會論文集,5-19。台灣,台北,2007。
    [34] 賴世雄,文法從頭學,長春藤有聲出版有限公司。2007。
    Description: 碩士
    國立政治大學
    資訊科學學系
    95753023
    97
    Source URI: http://thesis.lib.nccu.edu.tw/record/#G0095753023
    Data Type: thesis
    Appears in Collections:[Department of Computer Science ] Theses

    Files in This Item:

    File Description SizeFormat
    302301.pdf60KbAdobe PDF2819View/Open
    302302.pdf165KbAdobe PDF2779View/Open
    302303.pdf134KbAdobe PDF2890View/Open
    302304.pdf151KbAdobe PDF21027View/Open
    302305.pdf219KbAdobe PDF2806View/Open
    302306.pdf371KbAdobe PDF2871View/Open
    302307.pdf334KbAdobe PDF21014View/Open
    302308.pdf415KbAdobe PDF2995View/Open
    302309.pdf115KbAdobe PDF2780View/Open
    302310.pdf1189KbAdobe PDF2891View/Open
    302311.pdf224KbAdobe PDF2825View/Open
    302312.pdf159KbAdobe PDF21069View/Open
    302313.pdf399KbAdobe PDF2838View/Open


    All items in 政大典藏 are protected by copyright, with all rights reserved.


    社群 sharing

    著作權政策宣告 Copyright Announcement
    1.本網站之數位內容為國立政治大學所收錄之機構典藏,無償提供學術研究與公眾教育等公益性使用,惟仍請適度,合理使用本網站之內容,以尊重著作權人之權益。商業上之利用,則請先取得著作權人之授權。
    The digital content of this website is part of National Chengchi University Institutional Repository. It provides free access to academic research and public education for non-commercial use. Please utilize it in a proper and reasonable manner and respect the rights of copyright owners. For commercial use, please obtain authorization from the copyright owner in advance.

    2.本網站之製作,已盡力防止侵害著作權人之權益,如仍發現本網站之數位內容有侵害著作權人權益情事者,請權利人通知本網站維護人員(nccur@nccu.edu.tw),維護人員將立即採取移除該數位著作等補救措施。
    NCCU Institutional Repository is made to protect the interests of copyright owners. If you believe that any material on the website infringes copyright, please contact our staff(nccur@nccu.edu.tw). We will remove the work from the repository and investigate your claim.
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - Feedback