Heritability Estimates in Twin Studies: Comparing NACE and Falconer's Formula | Assignments Mathematical Modeling and Simulation

ISYE 6644: SIMULATION: EVALUATING CONVERGENCE OF HERITABILTIY ESTIMATES

ACROSS TRAITS OF INTEREST VIA GENETIC MODELS AND SIMULATION

Georgia Institute of Technology, USA

ABSTRACT

Assessing the genetic impact of an individual is of crucial im-

portance across a variety of domains in human health, disease,

and function. Inferring this impact necessitates the utilization

of twin data and genetic modelling techniques. Such genetic

models make inherent assumptions about the data, and it is

of interest to investigate the model performance across dif-

ferent conditions using simulation. For two models, Normal

ACE model and Falconer’s method, input data simulated from

bivariate Lagrangian Poisson (blgp), multivariate t, and bi-

variate normal and relevant modulations. It was found that

an increase in sample size resulted in a decrease in the av-

erage and true standard error of genetic estimates. When the

data was normally distributed, the standard error (average and

true) converged. The greater the normality of the simulated

twin data, the greater performance both models had across

coverage rates and average and true standard errors. When the

model assumptions for ACE were violated (non-normality of

data, unequal variances across MZ and DZ groups) the model

resulted in biased estimates of heritability and associated cov-

erage rates. Lastly, the coverage rates for Falconer’s model

appears to contradict that found in [1], but otherwise the re-

sults strongly converge to that of [1].

Index Terms—Twin study, heritability, simulation

1. BACKGROUND & DESCRIPTION OF PROBLEM

Assessing the enviornmental and genetic impact of an indi-

vidual is of crucial importance across a variety of domains in

human health, disease, and function. Inferring these factors

necessitates either strict genetic data (one’s genome) or, more

often, the utilization of twin data [1]. Twins can be separated

into two types, monozygotic (MZ) or dizygotic (DZ) [1].

What defines a twin in general is that they are born at the

same time but in which two distinct processes had occurred

even so. Specifically, MZ twins share 100% of their genomic

information with one another, and DZ twins share 50% on

average [1]. The similarity in MZ twins, and the fact that

they were born at the same time point (essentially), allows for

any differences in the twins to be more indicative of environ-

mental influences than that of genetic influences (since, by

definition, their genomes are identical). However, this attri-

bution of genetic influences can also be decomposed further

into additive genetics and dominance (the second of which is

not an interest in this study) [1]. Specifically, it is possible,

using both MZ and DZ twins to evaluate heritability which

is a function of the genetic components (defined differently

per model) of the trait of interest. Two models widely used

in the twin study literature (and genetics literature) are Fal-

coner’s Formula and the Normal ACE (NACE) model, both

of which make implict assumptions about the input data from

MZ and DZ twins, and both of which have their own benefits

and disadvantages. Hence, it is important to talk about the

mathematical properties of both models.

The NACE model utilizes MZ and DZ twin data in order

to model the heritability of a given trait of interest. In doing

so, it effectively breaks down the trait covariance of each

MZ or DZ twin into additive genetics (A), common shared

environment (C), and nonshared environment (E) variance

components [1]. This approach is done via structural equa-

tion modelling (SEM). For the NACE model, notably, there

are key assumptions which allow it to achieve the results it

does. Firstly, any given trait of interest (which is what would

be investigated using the data of the MZ and DZ twin pairs

that one has access to) is normally distributed [1]. Secondly,

the NACE model assumes that the ACE variance parameters

are equal for both MZ twin pairs and DZ twin pairs [1].

In contrast, another model, Falconer’s Formula, which is

a distribution free method of moment estimators [1] makes no

assumptions regarding the variance across MZ twin pairs and

DZ twin pairs (i.e. they are allowed to be unequal), with the

additional assumption that the proportion of the total variance

(for genetics or environmental effects) are the same [1].

Therefore there are key differences in assumptions be-

tween the NACE model and Falconer’s formula. Further-

more, given the wide use of these models across the literature

in looking at differently distributed traits (which are of wide

interest in a variety of fields as they relate to genetics, human

behavior, disease, and biomarkers), there is a great need to

evaluate the effect of these assumptions that these models

hold on the resultant parameter estimates in these scenarios.

In doing so, this work can be considered for future traits of

interest when utilizing models, the approach can be consid-

ered for other models of interest, and modulations therein can

be expanded upon based on the implementation herein [1].

Heritability Estimates in Twin Studies: Comparing NACE and Falconer's Formula, Assignments of Mathematical Modeling and Simulation

Related documents

Partial preview of the text

Download Heritability Estimates in Twin Studies: Comparing NACE and Falconer's Formula and more Assignments Mathematical Modeling and Simulation in PDF only on Docsity!

ISYE 6644: SIMULATION: EVALUATING CONVERGENCE OF HERITABILTIY ESTIMATES

ACROSS TRAITS OF INTEREST VIA GENETIC MODELS AND SIMULATION

Georgia Institute of Technology, USA

ABSTRACT

1. BACKGROUND & DESCRIPTION OF PROBLEM

2. THEORY

I =

[

]

, J =

[

]

[

]

[

]

[

7. REFERENCES