Bayesian Model Selection for Interacting QTL with Many Effects | Papers Genetics

DOI: 10.1534/genetics.107.071365

An Efficient Bayesian Model Selection Approach for Interacting

Quantitative Trait Loci Models With Many Effects

Nengjun Yi,*

Daniel Shriner,* Samprit Banerjee,* Tapan Mehta,*

Daniel Pomp

†

and Brian S. Yandell

‡

*Department of Biostatistics, Section on Statistical Genetics, University of Alabama, Birmingham, Alabama 35294,

†

Departments of

Nutrition, Cell and Molecular Physiology, University of North Carolina, Chapel Hill, North Carolina 27599 and

‡

Departments of

Statistics, Horticulture and Biostatistics and Medical Informatics, University of Wisconsin, Madison, Wisconsin 53706

Manuscript received January 24, 2007

Accepted for publication April 23, 2007

ABSTRACT

We extend our Bayesian model selection framework for mapping epistatic QTL in experimental crosses

to include environmental effects and gene–environment interactions. We propose a new, fast Markov

chain Monte Carlo algorithm to explore the posterior distribution of unknowns. In addition, we take

advantage of any prior knowledge about genetic architecture to increase posterior probability on more

probable models. These enhancements have significant computational advantages in models with many

effects. We illustrate the proposed method by detecting new epistatic and gene–sex interactions for

obesity-related traits in two real data sets of mice. Our method has been implemented in the freely

available package R/qtlbim (http://www.qtlbim.org) to facilitate the general usage of the Bayesian

methodology for genomewide interacting QTL analysis.

MAPPING quantitative trait loci (QTL) involves

inferring the genetic architecture of complex

traits in terms of genomic regions, gene effect, gene ac-

tion, and possible interactions, given observed pheno-

type and marker genotype data (Lynch and Walsh

1998). The variation of most complex traits results from

interacting networks of multiple QTL and environ-

mental factors (Reifsnyder et al. 2000; Carlborg and

Haley 2004; Moore 2005; Stylianou et al. 2006;

Valdar et al. 2006; Wang et al. 2006). Inclusion of gene–

gene interactions (epistasis) and gene–environment

interactions in mapping QTL is expected to aid the

discovery of more QTL, improve the accuracy and pre-

cision of estimates of their genomic positions and

genetic effects, and enhance our ability to understand

the genetic basis of complex traits ( Jansen 2003;

Carlborg and Haley 2004).

Identification of genomewide interacting QTL has been

a formidable challenge for geneticists and statisticians,

mainly due to numerous possible variables associated with

hundreds or thousands of genomic loci (markers and/or

loci within marker intervals) that lead to a huge number of

possible models (e.g.,Yiet al. 2005). The problem is further

complicated by the facts that the genomic loci on the same

chromosome are highly correlated and the genotypes at

many loci are unobservable. Traditional QTL mapping

methods utilize prespecified simple statistical models,

which fit the effects of only one or two QTL whose putative

positions are scanned across the genome (e.g.,Lander and

Botstein 1989; Haley and Knott 1992; Jansen and

Stam 1994; Zeng 1994). Although successful in many

applications, such approaches require prohibitive correc-

tions for multiple testing and ignore the nature of complex

traits in statistical modeling.

Multiple-QTL mapping has been viewed as a model

selection issue (Broman and Speed 2002; Sillanpa¨a¨and

Corander 2002; Yi2004). Rather than fitting prespeci-

fied models to the observed data, model selection

approachesproceed by identifyingthe QTL models from

a set of potential QTL models that are best supported by

the data. Various model selection methods have been

recently proposed for genomewide multiple-QTL map-

ping from both frequentist and Bayesian perspectives.

Frequentist approaches sequentially add or delete QTL

using forward and backward or stepwise selection proce-

dures and apply criteria such as P-values or a modified

Bayesianinformation criterion(BIC) to identify the ‘‘best

multiple-QTL model’’ (Kao et al. 1999; Carlborg et al.

2000; Reifsnyder et al. 2000; Bogdan et al. 2004; Baierl

et al. 2006). Such methods usually pick a single ‘‘good’’

(and maybe useful) model, ignoring the uncertainty

about the model itself in the final inference (Raftery

et al. 1997; George 2000; Kadane and Lazar 2004).

Several Bayesian model selection approaches for map-

ping multiple QTL have been developed over the past

decade (Satagopan and Yandell 1996; Satagopan et al.

1996; Heath 1997; Sillanpa¨a¨and Arjas 1998; Stephens

and Fisch 1998; Gaffney 2001; Hoeschele 2001; Sen

and Churchill 2001; Xu2003; Wang et al. 2005; Zhang

Corresponding author: Department of Biostatistics, University of Ala-

bama, Birmingham, AL 35294-0022. E-mail: [email protected]

Genetics 176: 1865–1877 ( July 2007)

Bayesian Model Selection for Interacting QTL with Many Effects, Papers of Genetics

Related documents

Partial preview of the text

Download Bayesian Model Selection for Interacting QTL with Many Effects and more Papers Genetics in PDF only on Docsity!

An Efficient Bayesian Model Selection Approach for Interacting

Quantitative Trait Loci Models With Many Effects

Nengjun Yi,* ,1^ Daniel Shriner,* Samprit Banerjee,* Tapan Mehta,*

Daniel Pomp†^ and Brian S. Yandell ‡

M

L 1 ¼

L 1 C

L C

L 1

L 0

L 0

L 1