政大機構典藏-National Chengchi University Institutional Repository(NCCUR):Item 140.119/129361
English  |  正體中文  |  简体中文  |  Post-Print筆數 : 27 |  全文笔数/总笔数 : 113656/144643 (79%)
造访人次 : 51716582      在线人数 : 623
RC Version 6.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
搜寻范围 查询小技巧:
  • 您可在西文检索词汇前后加上"双引号",以获取较精准的检索结果
  • 若欲以作者姓名搜寻,建议至进阶搜寻限定作者字段,可获得较完整数据
  • 进阶搜寻
    政大機構典藏 > 資訊學院 > 資訊科學系 > 會議論文 >  Item 140.119/129361


    请使用永久网址来引用或连结此文件: https://nccur.lib.nccu.edu.tw/handle/140.119/129361


    题名: 關聯式文本探勘資訊探索實驗平台設計─以「二二八事件臺灣本地新聞史料彙編」為例
    Designing an Experiment Platform for Information Exploration with Relational Text Mining: A Case Study with the Taiwan 228 - event News Archive
    作者: 劉吉軒
    Liu, Jyi-shane
    甯格致
    薛化元
    蔡銘峰
    贡献者: 資科系
    关键词: 史料資訊探勘;社會網路分析;資訊檢索
    Historical Text Mining;Social Network Analysis;Information Retrieval;Critical Discourse Analysis
    日期: 2014-12
    上传时间: 2020-04-22 15:39:17 (UTC+8)
    摘要: 對人文研究者來說,一個具有意義的思維或觀點的論證過程中,往往牽涉大 量文本資料的爬梳,篩選出研究目標相關資訊,進而由諸多線索累積為更具輪廓 的脈絡,而這些過程也往往會引發研究問題的重新定義、聚焦、深化。數位人文 研究以電腦的資料處理與計算能力,協助研究者以全新的方式從資料中尋找答案, 如同透過一個可移動、可調整的鏡片,以微觀、巨觀或不同視角的彈性檢視能力, 分析大量人文資料,探討人文議題,解讀人文現象[1]。目前許多數位人文研究 應用資訊技術於史料文本的資訊探索取向,乃先鎖定特定文字所代表之主題概念 或現象,再從史料文本中搜尋比對,而以統計量化分析方法,從數量、比例等觀 點,觀察主題概念或現象的顯著或差異程度,進而驗證部分假設或獲取片段式的 新資訊[2][3]。這種資訊探索取向是一種被描述為「hunt and peck」的單一目標費 力搜尋方式,或以「slicing」橫向切片、連貫比較的方式,找出趨勢或異常現象 [4][5]。 大量文本資料往往隱藏之意義豐富的主題資訊,其中錯綜複雜的關係與層層 因果的交疊,更需要資訊技術的功能突破,以有效的關聯挖掘,具體呈現其脈絡 面貌,協助人文研究者解讀與發現。單一視角的現象挖掘,雖然能彰顯特定主題 資訊的意義,但通常也忽略了關聯性資訊與脈絡結構的關鍵性。有鑑於此,本研 究除了採納過去社會、心理研究領域之質化分析精神,即看重每一個代表人、事、 物之個體的相對存在意義,進一步考量每一個體與周遭其他個體所關聯的局部情 況,將個體之間的關聯視為是一種社會定位的呈現;再進而以橫向角度觀察比較 同類型個體與個體之間的異同、或以縱向角度觀察比較基於不同環境、不同時空下所浮現出的脈絡樣貌。這樣的概念是利用先發掘出較具意義的關鍵個體,進一步觀察個體所擁有之關聯情況,再施以橫向合併、縱向貫穿的資訊凝聚視野,期能以一種多層次的資料維度觀點,進行較深度的資訊擷取及關聯探索,協助研究者得到更具深刻意義的發現與結果。
    This study integrates methods on computer science and social sciences and, with historian perspectives, views historical text as embedding a miniature social system. The task involves extracting relations among entities from text and performing structural analysis of the constructed entity-relationship network. One of the primary goals is to find the key-role actor and reveal its social position, which may defined by certain incidents, words, behaviors. Another further goal is to find other actors with similar social position and identify the underlying community. Finally, an abstract social role can be characterized to provide insight on the constructed social system from text. We develop an experimental platform – PARTEX, which provides text analytic tools and allows exploratory observation on relational structure among entities. Among our well-preprocessed and imported document collections, with historian inputs on key conceptual words as focal issues, the platform has been used to identify entity relations and construct the embedded social system. Discourse perspectives of position, demand, emotion, and action, are investigated with contextual parameters of boundary, association type, and relational strength. Both visual representation of the discourse-oriented social system and the quantitative measures are presented for analytic comparison. This study hopes to provide an effective text analytic tool and contribute in discovering historical implications. We intend to further improve the platform by recursive use test and validate the approach by fostering fruitful research results.
    關聯: Symposiums of the Fifth International Conference of Digital Archives and Digital Humanities, pp. 533-540, Academia Sinica, TAIWAN
    数据类型: conference
    显示于类别:[資訊科學系] 會議論文

    文件中的档案:

    档案 描述 大小格式浏览次数
    397.pdf1438KbAdobe PDF2295检视/开启


    在政大典藏中所有的数据项都受到原著作权保护.


    社群 sharing

    著作權政策宣告 Copyright Announcement
    1.本網站之數位內容為國立政治大學所收錄之機構典藏,無償提供學術研究與公眾教育等公益性使用,惟仍請適度,合理使用本網站之內容,以尊重著作權人之權益。商業上之利用,則請先取得著作權人之授權。
    The digital content of this website is part of National Chengchi University Institutional Repository. It provides free access to academic research and public education for non-commercial use. Please utilize it in a proper and reasonable manner and respect the rights of copyright owners. For commercial use, please obtain authorization from the copyright owner in advance.

    2.本網站之製作,已盡力防止侵害著作權人之權益,如仍發現本網站之數位內容有侵害著作權人權益情事者,請權利人通知本網站維護人員(nccur@nccu.edu.tw),維護人員將立即採取移除該數位著作等補救措施。
    NCCU Institutional Repository is made to protect the interests of copyright owners. If you believe that any material on the website infringes copyright, please contact our staff(nccur@nccu.edu.tw). We will remove the work from the repository and investigate your claim.
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 回馈