Loading...
|
Please use this identifier to cite or link to this item:
https://nccur.lib.nccu.edu.tw/handle/140.119/146297
|
Title: | 大英線上圖書館與倫敦大學體系線上圖書館上架編碼的數位考古 The digit archeology about Listing code of online British Library and University of London |
Authors: | 陳以洵 Chen, Yi-Hsun |
Contributors: | 曾正男 Tzeng,Jeng-Nan 陳以洵 Chen, Yi-Hsun |
Keywords: | 論文比對 網路爬蟲 錯排問題 隨機抽樣 排序一致性 Z分數 箱型圖法 Thesis Comparison Web Scraping Derangements Random Sampling Sequential Consistency Z-Score Box Plot Method |
Date: | 2023 |
Issue Date: | 2023-08-02 13:01:59 (UTC+8) |
Abstract: | 由於申請國外學位論文證明的時間成本較高,本論文目標為利用公開網路資訊,來建制出一套學位論文離群程度的初階篩選,我們以Python Selenium及BeautifulSoup針對大英線上圖書館(British Library EThOS)與倫敦大學體系下的倫敦政經學院(LSE)線上圖書館的論文資料為例,論證在這兩邊線上圖書館論文上架編碼的排序方式是否具有一定程度的一致性,共同作為學位論文離群程度檢核的一種參考。
考古是為了還原過去的歷史真相,利用網路公開資訊還原真相的過程,我們稱為數位考古。本論文定義一個同序矩陣,建立評量函數,透過排序的差異度來評斷論文上架時間的離群程度。藉此指標,若驗證學位時發現有嚴重離群此指標平均的論文,我們才需特別用正式管道申請的方式來驗證。 Given the high time cost of applying for foreign degree thesis certification, the aim of this paper is to use publicly available online information to establish a preliminary screening system for the degree of outlier in theses. We use Python Selenium and BeautifulSoup to examine thesis data from the British Library EThOS and the online library of the London School of Economics (LSE) under the University of London system. We argue whether the sorting methods of thesis coding on these two online libraries have a certain degree of consistency, both serving as a reference for checking the degree of outlier in theses.
Archaeology is for the purpose of restoring the historical truth of the past, and the process of using publicly available online information to restore the truth, we call it digital archaeology. This paper defines a permutation matrix and establishes an evaluation function. The degree of deviation in sorting is used to judge the outlier degree of thesis shelf time. With this index, if a severe outlier is found during degree verification, we only need to verify it by applying through formal channels. |
Reference: | [1] 蔡壁如論文遭指「不當引用」 德明科大證實:啟動審理 (https://news.tvbs.com.tw/politics/1877096)
[2] 蔡壁如為論文驟然告別立院 4個考量設下停損點 (https://vip.udn.com/vip/story/122367/6688730)
[3] 林智堅「論文門」懶人包不斷更新:兩派到底吵什麼?後續有何發展?論文爭議始末一次看 (https://ynews.page.link/9Pb8)
[4] 台大認定林智堅論文抄襲撤銷碩士學位 教育部暫未收到訴願申請 (https://ynews.page.link/gpXM)
[5] 快訊》林智堅將主動退選!鄭運鵬接棒選桃園市長 (https://ctsnews.page.link/3jHaq)
[6] 週刊爆博士論文涉抄襲,高虹安公布辛辛那提大學校方聲明強調無版權問題,「我不是林智堅」(https://www.thenewslens.com/article/173039)
[7] 快訊/博士論文突遭母校下架?高虹安回應了 (https://ynews.page.link/CnkFp)
[8] D. M. Thomas and S. Mathur, "Data Analysis by Web Scraping using Python," 2019 3rd International conference on Electronics, Communication and Aerospace Technology (ICECA), Coimbatore, India, 2019, pp. 450-454, doi: 10.1109/ICECA.2019.8822022.
[9] Boeing, G.; Waddell, P. (2017). New Insights into Rental Housing Markets across the United States: Web Scraping and Analyzing Craigslist Rental Listings. Journal of Planning Education and Research, 37(4), 457–476.
[10] IDRIS, Aizal Yusrina; BAMOALLEM, Razan; MOHAMAD HATTA, Mohamad Harith Azfar. Web Scraping and Regression Analysis based on Machine Learning for COVID-19 with Rapid Software Platform. Mathematical Sciences and Informatics Journal, [S.l.], v. 3, n. 1, p. 75-85, may 2022. ISSN 2735-0703.
[11] 錯排問題 (https://peienwu.com/derangement/)
[12] Hassani, Mehdi. "Derangements and applications.." Journal of Integer Sequences [ electronic only ] 6.1 (2003): Art. 03.1.2, 8 p., electronic only-Art. 03.1.2, 8 p., electronic only. <http://eudml.org/doc/51444>.
[13] Sloane, N.J.A. (編). Sequence A000166 (Subfactorial or rencontres numbers, or derangements: number of permutations of n elements with no fixed points.). The On-Line Encyclopedia of Integer Sequences. OEIS Foundation
[14] Ismail, M.E.H., Simeonov, P. Asymptotics of generalized derangements. Adv Comput Math 39, 101–127 (2013). https://doi.org/10.1007/s10444-011-9271-7 |
Description: | 碩士 國立政治大學 應用數學系 106751016 |
Source URI: | http://thesis.lib.nccu.edu.tw/record/#G0106751016 |
Data Type: | thesis |
Appears in Collections: | [Department of Mathematical Sciences] Theses
|
Files in This Item:
File |
Description |
Size | Format | |
101601.pdf | | 1468Kb | Adobe PDF2 | 150 | View/Open |
|
All items in 政大典藏 are protected by copyright, with all rights reserved.
|