Skip to main navigation Skip to search Skip to main content

A proximity-based method to identify genomic regions correlated with a continuously varying eenvironmental variable

  • Cornelia Di Gaetano
  • , Ggiuseppe Matullo
  • , Alberto Piazza
  • , Moreno Ursino
  • , Mauro Ggasparini

Research output: Contribution to journalArticlepeer-review

Abstract

Knowledge of markers in the human genome which show spatial patterns and display extreme correlation with different environmental determinants play an important role in understanding the factors which affect the biological evolution of our species. We used the genotype data of more than half a million single nucleotide polymorphisms (SNPs) from the data set Human Genome Diversity Panel (HGDP-CEPH -CEPH) and we calculated Spearman's correlation between absolute latitude and one of the two allele frequencies of each SNP. We selected SNPs with a correlation coefficient within the upper 1% tail of the distribution. We then used a criterion of proximity between significant variants to focus on DNA regions showing a continuous signal over a portion of the genome. Based on external information and genome annotations, we demonstrated that most regions with the strongest signals also have biological relevance. We believe this proximity requirement adds an edge to our novel method compared to the existing literature, highlighting several genes (for example DTNB, DOT1L, TPCN2, RELN, MSRA, NRG3) related to body size or shape, human height, hair color, and schizophrenia. Our approach can be applied generally to any measure of association between polymorphic frequencies and continuously varying environmental variables.

Original languageEnglish
Pages (from-to)29-42
Number of pages14
JournalEvolutionary Bioinformatics
Volume2013
Issue number9
DOIs
Publication statusPublished - 2013
Externally publishedYes

Keywords

  • Adaptations
  • Latitude
  • Outlier approach
  • Point processes
  • Spatial patterns

Fingerprint

Dive into the research topics of 'A proximity-based method to identify genomic regions correlated with a continuously varying eenvironmental variable'. Together they form a unique fingerprint.

Cite this