TY - JOUR
T1 - The Effect of Gene Conversion on the Divergence between Duplicated Genes
AU - Teshima, Kosuke M.
AU - Innan, Hideki
PY - 2004/3
Y1 - 2004/3
N2 - Nonindependent evolution of duplicated genes is called concerted evolution. In this article, we study the evolutionary process of duplicated regions that involves concerted evolution. The model incorporates mutation and gene conversion: the former increases d, the divergence between two duplicated regions, while the latter decreases d. It is demonstrated that the process consists of three phases. Phase I is the time until d reaches its equilibrium value, d6. In phase II d fluctuates around d0, and d increases again in phase III. Our simulation results demonstrate that the length of concerted evolution (i.e., phase II) is highly variable, while the lengths of the other two phases are relatively constant. It is also demonstrated that the length of phase II approximately follows an exponential distribution with mean τ, which is a function of many parameters including gene conversion rate and the length of gene conversion tract. On the basis of these findings, we obtain the probability distribution of the level of divergence between a pair of duplicated regions as a function of time, mutation rate, and T. Finally, we discuss potential problems in genomic data analysis of duplicated genes when it is based on the molecular clock but concerted evolution is common.
AB - Nonindependent evolution of duplicated genes is called concerted evolution. In this article, we study the evolutionary process of duplicated regions that involves concerted evolution. The model incorporates mutation and gene conversion: the former increases d, the divergence between two duplicated regions, while the latter decreases d. It is demonstrated that the process consists of three phases. Phase I is the time until d reaches its equilibrium value, d6. In phase II d fluctuates around d0, and d increases again in phase III. Our simulation results demonstrate that the length of concerted evolution (i.e., phase II) is highly variable, while the lengths of the other two phases are relatively constant. It is also demonstrated that the length of phase II approximately follows an exponential distribution with mean τ, which is a function of many parameters including gene conversion rate and the length of gene conversion tract. On the basis of these findings, we obtain the probability distribution of the level of divergence between a pair of duplicated regions as a function of time, mutation rate, and T. Finally, we discuss potential problems in genomic data analysis of duplicated genes when it is based on the molecular clock but concerted evolution is common.
UR - http://www.scopus.com/inward/record.url?scp=1942485886&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=1942485886&partnerID=8YFLogxK
U2 - 10.1534/genetics.166.3.1553
DO - 10.1534/genetics.166.3.1553
M3 - Article
C2 - 15082568
AN - SCOPUS:1942485886
SN - 0016-6731
VL - 166
SP - 1553
EP - 1560
JO - Genetics
JF - Genetics
IS - 3
ER -