Identifying genetic variation that underlies the etiology of common and complex disease is one of important interests in current molecular medicine and pharmacogenomics Out of all the genetic variations, the SNPs are known to contribute to 90% of them...
Identifying genetic variation that underlies the etiology of common and complex disease is one of important interests in current molecular medicine and pharmacogenomics Out of all the genetic variations, the SNPs are known to contribute to 90% of them with being almost uniformly distributed across the genome. In addition, the presence of some SNPs can be involved in increasing the risk of complex disease, although most SNPs are not responsible for causing any disease phenotype. Ranking and finding single nucleotide polymorphism (SNPs) that are involved in human diseases, such as cancer, is a one of primary challenge in current disease association studies. Knowledge of such SNPs is expected to enable timely diagnosis, effective treatment and prevention of human disease. Further , many attentions are paid to understand the biological meaning of the identified associated genes of such SNPs. In Genome Wide Association Study (GWAS), SNPs are being genotyped for case-control studies, where the association of SNPs with phenotype is studied. Case and control studies compare frequency of SNP allele in two well defined groups’ individuals.
In GWAS study or disease association study, all the SNPs in given dataset are firstly ordered according to their statistical significance for the further analysis. However, performing ranking SNPs based on only genotype call data impose certain restrictions.
In this thesis, we propose an integrative ranking method, SNPRank, which improves the prioritization of SNPs in disease association study by using Linkage Disequilibrium network. When the dependence between SNP is high, the two SNPs are considered to be in a state of high LD. Proposed SNPRank method allows us to combine linkage disequilibrium connectivity and conventional rank statistics to produce more robust SNP markers in disease association study, compared with traditional methods only based SNP genotype frequency.
For evaluation, we applied this method to identify SNP markers associated with prostate cancer in Castro et al. (2009).