Loading...
|
Please use this identifier to cite or link to this item:
https://nccur.lib.nccu.edu.tw/handle/140.119/94723
|
Title: | 結合家庭、病例及病例-對照分析中疾病遺傳訊息的統計方法 Statistical Methods for Combining Genetic Association Information from Family, Case-Only and Case-Control Analyses |
Authors: | 林惠文 Lin, Hui Wen |
Contributors: | 劉惠美 程毅豪 林惠文 Lin,Hui Wen |
Keywords: | 病例對照研究 病例研究 病例父母研究 基因與環境交互作用 TDT 結合家庭資料與無相關控制組資料 結合病例對照和單純病例分析 結合病例父母對照和單純病例分析 |
Date: | 2008 |
Issue Date: | 2016-05-09 11:37:54 (UTC+8) |
Abstract: | 近年來,基因與疾病之關聯分析 (association analysis)
越來越受到研究學者重視,因為在複雜性疾病與易感性基因之探討中
傳統的連鎖方法 (linkage method)
已不適用,所以複雜性疾病與易感性基因的關聯分析也蓬勃發展起來。在本文中我們主要是在探討
關聯分析中以家庭為研究資料與以群體為研究資料之間的優缺點,進而取長補短提出結合兩種資料之新的關聯分析方法
來增加估計與檢定之效力。我們同時考慮環境因素,探討基因因素與環境因素之交互作用。
本研究共分為三部份。第一部份探討如何整合病例-父母/病例-同胞
(case-parent/case-sibling) 與病例-對照 (case-control)
研究。我們提出一個加權最小平方 (Weighted Least Squares)
的方法將病例-父母/病例-同胞與病例-對照分析之估計式加以結合,以增進統計檢定之效力。
第二部分旨在探討基因-環境之交互作用。我們提出一個二階段研究設計法。在第一階段研究中,先收集病例資料;
在第二階段研究中,再收集其相對應之控制組資料。我們提出一個迴歸估計式以結合第一階段之單純病例分析(case-only
analysis)
與第二階段之病例-對照分析。此建議之估計式即使在基因因子與環境因子
獨立之條件 (此條件為單純病例分析所必需)
不成立的情形下,依然可得出正確之統計推論。
第三部份旨在探討群體分層 (population stratification) 存在
之情形下,基因-環境之交互作用。我們提出一個二階段研究設計,以病例資料為第一階段資料,
再從病例資料中隨機抽取一部份病例患者之父母資料為第二階段資料。我們提出一個迴歸估計式結合單純病例研分析與病例-父母分析之估計式。
此新估計式即可整合單純病例分析與病例-父母分析,同時在群體分層存在之情形下,仍可得出有效之統計推論。 In recent years, there are increasing attention to association
studies, because linkage method will not be suitable under complex
disease and susceptible genes. In the thesis, we are probing into
association of family study and population study. And we combine
family study and population study for increased efficiency of
association method. We also consider interesting studies about
gene-environment interactions. The thesis contains three projects.
The first project focuses on examining when and how the two sources
of information offered by such studies, one from the
case-parent/case-sibling analysis, and the other from the
case-control analysis with data from affected subjects and unrelated
controls, can be integrated to enhance statistical power. We propose
a weighted least-squares approach to linearly and optimally combine
separate estimators from the case-parent/case-sibling and the
logistic regression analysis for the association parameters.
In the second project, we focus on examining the situation of
gene-environment interaction. We propose a two-stage design. In the
first stage, we collect patient data, and we seek out control data
with respect to cases in the second stage. We propose regression
analysis estimation in order to combine the case-only analysis in
the first stage and the case-control analysis in the second stage.
This estimation earns the correct statistical inference when genes
and environment factors are not independent.
In the third project, we explore gene-environment interactions under
population stratification. We propose a two-stage design. In the
first stage, we collect patient data, and we randomly collect a
partial data of patient`s parent from the cases in the second stage.
We propose regression analysis estimation in order to combine the
case-only analysis and the case-parent analysis. This estimation can
combine the case-only analysis and the case-parent analysis, and
attains effective statistical inference under population
stratification. |
Reference: | Albert PS, Ratnaasinghe D, Tangrea J, et al. (2001) Limitations of the case-only designfor identifying gene-environment interactions. Am J Epidemiol, 154(8):687-693
Allen AS, Rathouz, PJ, Satten GA. (2003) Informative missingness in genetic association studies: case-parent designs. Am J Hum Genet 72:671-680
Allen AS, Satten GA (2007) Inference on haplotype/disease association using parentaffected-child data: the projection conditional on parental haplotypes method.Genet Epidemiol 31:211-223
Andrieu N, Goldstein AM (1998) Epidemiologic and Genetic Approaches in the Study of Gene-Environment Interaction: an Overview of Available Methods. Epidemiol Rev 20:139-147
Bacanu SA, Devlin B, Roeder K. (2000) The power of genomic control. Am J Hum Genet. 66(6):1933-44
Breslow NE, Day NE (1980) Statistical methods in cancer research, volume I - the analysis of case-control studies. IARC Sci Publ, Lyon, pp 70-71
Campbell CD, Ogburn EL, Lunetta KL, Lyon HN, Freedman ML, Groop LC, Altshuler D, Ardlie KG, Hirschhorn JN (2005) Demonstrating stratification in a European American population. Nat Genet 37:868-872
Cardon LR, Bell JI (2001) Association study designs for complex diseases. Nat Rev Genet 2:91-99
Carlson CS, Eberle MA, Kruglyak L, Nickerson DA (2004) Mapping complex disease loci in whole-genome association studies. Nature 429:446-452
Chen YH (2004) New Approach to Association Testing in Case-Parent Designs Under Informative Parental Missingness. Genetic Epidemiology 27: 131-140
Chen YH, Chen H (2000) A unified approach to regression analysis under doublesampling designs. J R Statistic Soc B 62:449-460
Clayton DG (1999) A generalization of the transmission/disequilibrium test for uncertainhaplotype
transmission. Am J Hum Genet 65:1170-1177
Cordell HJ, Clayton DG (2002) A unified stepwise regression procedure for evaluating the relative effects of polymorphisms within a gene using case/control or family data:application to HLA in type 1 diabetes. Am J Hum Genet 70:124-141
Cornfield J. (1951) A method of estimating comparative rates from clinical data; applications to cancer of the lung, breast, and cervix. J Natl Cancer Inst. 1951
Jun;11(6):1269-75.
Curtis D (1997) Use of siblings as controls in case-control association studies. Ann Hum Genet 61:319-333
Devlin B, Roeder K (1999) Genomic control for association studies. Biometrics 55:997-1004
Epstein MP, Satten GA (2003) Inference on haplotype effects in case-control studies using unphased genotype data. Am J Hum Genet 73:1316-1329
Epstein MP, Veal CD, Trembath RC, Barker JN, Li C, Satten GA (2005) Genetic association analysis using data from triads and unrelated subjects. Am J Hum Genet 76:592-608
Fahrmeir L, Tutz G (2001) Multivariate statistical modelling based on generalized linear models. Springer-Verlag New York.
Falk CT, Rubinstein P (1987) Haplotype relative risks: an easy reliable way to construct a proper control sample for risk calculations. Ann Hum Genet 51:227-233
Fay MP, Graubard, BI, Freedman, LS, Midthune, DN (1998) Conditional logistic regression with sandwich estimators: application to a meta analysis. Biometrics 54:195-208
Hamajima N, Yuasa H, Matsuo K, Kurobe Y (1999) Detection of Gene-Environment Interaction by Case-only Studies. Japanese Journal of Clinical Oncology 29:490-493
Hoggart CJ, Parra EJ, Shriver MD, Bonilla C, Kittles RA, Clayton DG, McKeigue PM(2003) Control of confounding of genetic associations in stratified populations. Am J Hum Genet. 72(6):1492-1504.
Hsu L (2003) Genetic association tests with age at onset. Genet Epidemiol 24:118-127.
Hwang SJ, Beaty TH, McIntosh I,Hefferon T, SR (1998) Association between homeoboxcontaining gene MSX1 and the occurrence of limb deficiency. Am. J. Med. Genet 75:419-423.
Kazeem GR, Farrall M (2005) Integrating case-control and TDT Studies. Ann Hum Genet 69:329-335.
Keavney B (2000) Genetic association studies in complex diseases. Journal of Human Hypertension (2000) 14, 361-367.
Khoury MJ, Flanders WD (1996) Nontraditional Epidemiologic Approaches in the Anal-ysis of Gene Environment Interaction: Case-Control Studies with No Controls.
American Journal of Epidemiology 144(3):207-213.
Laird NM, Lange C (2006) Family-based designs in the age of large-scale gene-association studies. Nat Rev Genet 7:385-394.
Lander ES, Schork NJ (1994) Genetic dissection of complex traits. Science 265:2037-2048.
Lewontin RC (1988) On measures of gametic disequilibrium. Genetics 120:849-852.
Liang KY, Zeger SL (1986) Longitudinal data analysis using generalized linear models.Biometrika 73:13-22.
Martin ER, Kaplan NL (2000) A Nonte Carlo procedure for two-stage tests with correlated data. Genet Epidemiol 18:48-62.
Mitchell LE (2000) Relationship between case-control studies and the transmission/disequilibrium test. Genet Epidemiol 19:193-201.
Nagelkerke NJ, Hoebee B, Teunis P, Kimman TG (2004) Combining the transmission disequilibrium test and case-control methodology using generalized logistic regression.
Eur J Hum Genet 12:964-970.
Piegorsch WW, Weinberg CR, Taylor JA (1994) Non-hierarchical logistic models and case-only designs for assessing susceptibility in population-based case-control studies. Stat Med. 13(2):153-62.
Prentice RL, Pyke R (1979) Logistic disease incidence models and case-control studies. Biometrika 66:403-11.83
Pritchard JK, Donnelly P (2001) Case-control studies of association in structured or admixtured populations. Theor Pop Biol 60:227-237.
Pritchard JK, Stephens M, Rosenberg NA, Donnelly P (2000) Association mapping in structured population. Am J Hum Genet, 67:170-181.
Reich DE, Goldstein DB. (2001) Detecting association in a case-control study while correcting for population stratification.Genet Epidemiol. 22(2):196-201.
Risch NJ (2000) Searching for genetic determinants in the new millennium. Nature 405:847-856.
Rosenberg NA, Pritchard JK,Weber JL, Cann HM, Kidd KK, Zhivotovsky L A, Feldman M W (2002) Genetic structure of human populations. Science 298:2381-2385.
Schaid DJ (1996) General score tests for associations of genetic markers with disease using cases and their parents. Genet Epidemiol 13:423-429.
Schaid DJ (1999) Case-Parents Design for Gene-Environment Interaction. Genetic Epidemiology 16:261-273.
Schaid DJ, Rowland C (1998) Use of parents, sibs, and unrelated controls for detection of associations between genetic markers and disease. Am J Hum Genet 63:1492-1506.
Schaid DJ, Sommer SS (1993) Genotype relative risks: methods for design and analysis of candidate-gene association studies. Am J Hum Genet 53:1114-1126.
Schmidt S, Schaid DJ (1999) Potential misinterpretation of the case-only study to assess gene-environment interaction. American Journal of Epidemiology. 150(8):878-885.
Seber GA (1997) Linear regression analysis. John Wiley and Sons, New York, pp 61-62.
Self SG, Longton G, Kopecky KJ, Liang KY (1991) On estimating HLA-disease association with application to a study of aplastic anemia. Biometrics 47:53-61.
Shih MC, Whittemore AS (2002) Tests for genetic association using family data. Genetic Epidemiology 22:128-145.
Siegmund KD, Langholtz B, Kraft P, Thomas DC (2000) Testing linkage disequilibrium in sibships. Am J Hum Genet 67:244-248.
Spielman RS, Ewens WJ (1998) A sibship test for linkage in the presence of association: The sib transmission/disequilibrium test. Am J Hum Genet 62:450-458.
Spielman RS, McGinnis RE, Ewens WJ (1993) Transmission test for linkage disequilibrium:the insulin gene region and insulin-dependent diabetes mellitus (IDDM). Am J Hum Genet 52:506-516.
Stram DO, Pearce L, Henderson BE, Thomas DC (2003) Modeling and E-M estimationof haplotype-specific relative risks from genotype data for a case-control study of
unrelated individuals. Hum Hered 55:179-190.
Sun F, Flanders WD, Yang Q, Khoury MJ. (1999) Transmission disequilibrium test(TDT) when only one parent is available: the 1-TDT. Am J Epidemiol 150:97-104.
Umbach DM, Weinberg CR (1997). Designing and analyzing case-control studies to exploit independence of genotype and exposure. Stat Med 16:1731-1743.
Weinberg CR,Umbach DM (2000) Choosing a retrospective design to assess joint genetic and environmental contributions to risk. Am J Epidemiology, 152(3):197-203.
Yang Q, Khoury MJ, Flanders WD (1997) Sample Size Requirements in Case-Only De-signs to Detect Gene-Environment Interaction. American Journal of Epidemiology
146(9):713-720.
Zeger SL, Liang KY, Albert PS (1988) Models for longitudinal data: a generalized estimating equation approach. Biometrics 44:1049-1060.
Zhao LP, Li SS, Khalid N (2003) A method for the assessment of disease associations with single-nucleotide polymorphism haplotypes and environmental variables in casecontrol
studies. Am J Hum Genet 72:1231-1250.
Zhu X, Zhang SL, Zhao HY, Cooper RS (2002) Association mapping using admixture
model for complex traits. Genet Epidemiol 23:181-196. |
Description: | 博士 國立政治大學 統計學系 92354505 |
Source URI: | http://thesis.lib.nccu.edu.tw/record/#G0923545051 |
Data Type: | thesis |
Appears in Collections: | [統計學系] 學位論文
|
Files in This Item:
File |
Size | Format | |
index.html | 0Kb | HTML2 | 233 | View/Open |
|
All items in 政大典藏 are protected by copyright, with all rights reserved.
|