An integrative approach to predicting the functional effects of non-coding and coding sequence variation

HA Shihab, MF Rogers, J Gough, M Mort… - …, 2015 - academic.oup.com
Bioinformatics, 2015academic.oup.com
Motivation: Technological advances have enabled the identification of an increasingly large
spectrum of single nucleotide variants within the human genome, many of which may be
associated with monogenic disease or complex traits. Here, we propose an integrative
approach, named FATHMM-MKL, to predict the functional consequences of both coding and
non-coding sequence variants. Our method utilizes various genomic annotations, which
have recently become available, and learns to weight the significance of each component …
Abstract
Motivation: Technological advances have enabled the identification of an increasingly large spectrum of single nucleotide variants within the human genome, many of which may be associated with monogenic disease or complex traits. Here, we propose an integrative approach, named FATHMM-MKL, to predict the functional consequences of both coding and non-coding sequence variants. Our method utilizes various genomic annotations, which have recently become available, and learns to weight the significance of each component annotation source.
Results: We show that our method outperforms current state-of-the-art algorithms, CADD and GWAVA, when predicting the functional consequences of non-coding variants. In addition, FATHMM-MKL is comparable to the best of these algorithms when predicting the impact of coding variants. The method includes a confidence measure to rank order predictions.
Availability and implementation: The FATHMM-MKL webserver is available at: http://fathmm.biocompute.org.uk
Contact:  H.Shihab@bristol.ac.uk   or  Mark.Rogers@bristol.ac.uk  or  C.Campbell@bristol.ac.uk
Supplementary information:  Supplementary data are available at Bioinformatics online.
Oxford University Press