|
English
|
正體中文
|
简体中文
|
Post-Print筆數 : 27 |
Items with full text/Total items : 113303/144284 (79%)
Visitors : 50803190
Online Users : 809
|
|
|
Loading...
|
Please use this identifier to cite or link to this item:
https://nccur.lib.nccu.edu.tw/handle/140.119/109731
|
Title: | 充分維度縮減在基因集分析上的運用 |
Authors: | 薛慧敏 |
Contributors: | 統計系 |
Keywords: | 基因集分析;差異共變;充分維度縮減;非線性相關 Gene set analysis;Differential coexpression;Sufficient dimension reduction;Non-linear associations |
Date: | 2016 |
Issue Date: | 2017-05-17 16:31:06 (UTC+8) |
Abstract: | 在基因微陣列(microarray)實驗中,基因集分析(gene-set analysis, GSA)的目的為檢定多個基因所形成的集合與外顯表現變數(phenotype)的相關顯著性。目前已有多個公開資料庫提供基因組相關資訊。例如分子特徵資料庫(MSigDB)中包含數個系列,其中包括彙整其他基因資料庫以及生物醫學相關學術期刊的結果所定義之基因庫。這些基因集合依據基因之生物功能將基因歸類。當外顯表現變數為二元或類別型態時,文獻上已發表的基因集分析方法多數是偵測基因表現量的平均差異。Cook與Weisberg (1991)曾提出的「切片平均變異法」(sliced average variance estimation)來估計基因資料的充分維度縮減(sufficient dimension reduction)中央子空間(central subspace),若基因集與外顯變數無關,則該空間的維度應當為零。所以我們提出以檢定”中央子空間維度為零”的假設以評估該基因集的顯著性。本方法將可掘取及運用資料中更豐富的資訊,而且本方法將適用於類別、量化的外顯變數資料。運用電腦模擬,我們驗證本方法的有效性。 Gene set analysis (GSA) aims to evaluate the association between the expression of biological pathways, or a priori defined gene sets, and a particular phenotype. Numerous GSA methods have been proposed to assess the enrichment of sets of genes. However, most methods are developed with respect to a specific alternative scenario, such as a differential mean pattern or a differential coexpression. Moreover, a very limited number of methods can handle either binary, categorical, or continuous phenotypes. In this paper, we develop two novel GSA tests, called SDRs, based on the sufficient dimension reduction technique, which aims to capture sufficient information about the relationship between genes and the phenotype. The advantages of our proposed methods are that they allow for categorical and continuous phenotypes, and they are also able to identify a variety of enriched gene sets. Through simulation studies, we compared the type I error and power of SDRs with existing GSA methods for binary, triple, and continuous phenotypes. We found that SDR methods adequately control the type I error rate at the pre-specified nominal level, and they have a satisfactory power to detect gene sets with differential coexpression and to test non-linear associations between gene sets and a continuous phenotype. In addition, the SDR methods were compared with seven widely-used GSA methods using two real microarray datasets for illustration. We concluded that the SDR methods outperform the others because of their flexibility with regard to handling different kinds of phenotypes and their power to detect a wide range of alternative scenarios. Our real data analysis highlights the differences between GSA methods for detecting enriched gene sets. |
Relation: | MOST 104-2118-M-004-002 |
Data Type: | report |
Appears in Collections: | [統計學系] 國科會研究計畫
|
Files in This Item:
File |
Description |
Size | Format | |
104-2118-M-004-002.pdf | | 1691Kb | Adobe PDF2 | 409 | View/Open |
|
All items in 政大典藏 are protected by copyright, with all rights reserved.
|
著作權政策宣告 Copyright Announcement1.本網站之數位內容為國立政治大學所收錄之機構典藏,無償提供學術研究與公眾教育等公益性使用,惟仍請適度,合理使用本網站之內容,以尊重著作權人之權益。商業上之利用,則請先取得著作權人之授權。
The digital content of this website is part of National Chengchi University Institutional Repository. It provides free access to academic research and public education for non-commercial use. Please utilize it in a proper and reasonable manner and respect the rights of copyright owners. For commercial use, please obtain authorization from the copyright owner in advance.
2.本網站之製作,已盡力防止侵害著作權人之權益,如仍發現本網站之數位內容有侵害著作權人權益情事者,請權利人通知本網站維護人員(
nccur@nccu.edu.tw),維護人員將立即採取移除該數位著作等補救措施。
NCCU Institutional Repository is made to protect the interests of copyright owners. If you believe that any material on the website infringes copyright, please contact our staff(
nccur@nccu.edu.tw). We will remove the work from the repository and investigate your claim.