TY - JOUR
T1 - Efficient network-guided multi-locus association mapping with graph cuts
AU - Azencott, Chloé Agathe
AU - Grimm, Dominik
AU - Sugiyama, Mahito
AU - Kawahara, Yoshinobu
AU - Borgwardt, Karsten M.
N1 - Funding Information:
Funding: C.A. is funded by an Alexander von Humboldt fellowship. This work was partially funded by the DFG project Kernels for Large, Labeled Graphs (LaLa).
PY - 2013/7/1
Y1 - 2013/7/1
N2 - Motivation: As an increasing number of genome-wide association studies reveal the limitations of the attempt to explain phenotypic heritability by single genetic loci, there is a recent focus on associating complex phenotypes with sets of genetic loci. Although several methods for multi-locus mapping have been proposed, it is often unclear how to relate the detected loci to the growing knowledge about gene pathways and networks. The few methods that take biological pathways or networks into account are either restricted to investigating a limited number of predetermined sets of loci or do not scale to genome-wide settings.Results: We present SConES, a new efficient method to discover sets of genetic loci that are maximally associated with a phenotype while being connected in an underlying network. Our approach is based on a minimum cut reformulation of the problem of selecting features under sparsity and connectivity constraints, which can be solved exactly and rapidly.SConES outperforms state-of-the-art competitors in terms of runtime, scales to hundreds of thousands of genetic loci and exhibits higher power in detecting causal SNPs in simulation studies than other methods. On flowering time phenotypes and genotypes from Arabidopsis thaliana, SConES detects loci that enable accurate phenotype prediction and that are supported by the literature.
AB - Motivation: As an increasing number of genome-wide association studies reveal the limitations of the attempt to explain phenotypic heritability by single genetic loci, there is a recent focus on associating complex phenotypes with sets of genetic loci. Although several methods for multi-locus mapping have been proposed, it is often unclear how to relate the detected loci to the growing knowledge about gene pathways and networks. The few methods that take biological pathways or networks into account are either restricted to investigating a limited number of predetermined sets of loci or do not scale to genome-wide settings.Results: We present SConES, a new efficient method to discover sets of genetic loci that are maximally associated with a phenotype while being connected in an underlying network. Our approach is based on a minimum cut reformulation of the problem of selecting features under sparsity and connectivity constraints, which can be solved exactly and rapidly.SConES outperforms state-of-the-art competitors in terms of runtime, scales to hundreds of thousands of genetic loci and exhibits higher power in detecting causal SNPs in simulation studies than other methods. On flowering time phenotypes and genotypes from Arabidopsis thaliana, SConES detects loci that enable accurate phenotype prediction and that are supported by the literature.
UR - http://www.scopus.com/inward/record.url?scp=84879900677&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84879900677&partnerID=8YFLogxK
U2 - 10.1093/bioinformatics/btt238
DO - 10.1093/bioinformatics/btt238
M3 - Article
C2 - 23812981
AN - SCOPUS:84879900677
SN - 1367-4803
VL - 29
SP - i171-i179
JO - Bioinformatics
JF - Bioinformatics
IS - 13
ER -