TY - JOUR
T1 - Structural and phylogenetic comparison of napsin genes
T2 - The duplication, loss of function and human-specific pseudogenization of napsin B
AU - Nishishita, Kazuhisa
AU - Sakai, Eiko
AU - Okamoto, Kuniaki
AU - Tsukuba, Takayuki
N1 - Funding Information:
This study was supported in part through a Grant-in-Aid for Scientific Research from the Ministry of Education, Culture, Sports, Science and Technology of Japan ( 20592175 and 24592836 to KN).
PY - 2013/4/1
Y1 - 2013/4/1
N2 - Aspartic proteinases form a widely distributed protein superfamily, including cathepsin D, cathepsin E, pepsins, renin, BACE and napsin. Human napsin genes are located on human chromosome 19q13, which comprises napsin A and napsin B. Napsin B has been annotated as a pseudogene because it lacks an in-frame stop codon; its nascent chains are cotranslationally degraded. Until recently, there have been no studies concerning the molecular evolution of the napsin protein family in the human genome. In the present study, we investigated the evolution and gene organization of the napsin protein family. Napsin B orthologs are primarily distributed in primates, while napsin A orthologs are the only napsin genes in other species. The corresponding regions of napsin B in the available sequences from primate species contain an in-frame stop codon at a position equivalent to that of human napsin A. In addition, a rare single-nucleotide polymorphism (SNP) that creates a proper stop codon in human napsin B was identified using HapMap populations. Recombinant protein expression and three-dimensional comparative modeling revealed that napsin B exhibits residual activity toward synthetic aspartic protease substrates compared with napsin A, presumably through a napsin B-specific Arg287 residue. Thus, napsin B was duplicated from napsin A during the early stages of primate evolution, and the subsequent loss of napsin B function during primate evolution reflected ongoing human-specific napsin B pseudogenization.
AB - Aspartic proteinases form a widely distributed protein superfamily, including cathepsin D, cathepsin E, pepsins, renin, BACE and napsin. Human napsin genes are located on human chromosome 19q13, which comprises napsin A and napsin B. Napsin B has been annotated as a pseudogene because it lacks an in-frame stop codon; its nascent chains are cotranslationally degraded. Until recently, there have been no studies concerning the molecular evolution of the napsin protein family in the human genome. In the present study, we investigated the evolution and gene organization of the napsin protein family. Napsin B orthologs are primarily distributed in primates, while napsin A orthologs are the only napsin genes in other species. The corresponding regions of napsin B in the available sequences from primate species contain an in-frame stop codon at a position equivalent to that of human napsin A. In addition, a rare single-nucleotide polymorphism (SNP) that creates a proper stop codon in human napsin B was identified using HapMap populations. Recombinant protein expression and three-dimensional comparative modeling revealed that napsin B exhibits residual activity toward synthetic aspartic protease substrates compared with napsin A, presumably through a napsin B-specific Arg287 residue. Thus, napsin B was duplicated from napsin A during the early stages of primate evolution, and the subsequent loss of napsin B function during primate evolution reflected ongoing human-specific napsin B pseudogenization.
KW - Gene duplication
KW - Napsin
KW - Phylogeny
KW - Pseudogenization
KW - SNP
KW - Synteny
UR - http://www.scopus.com/inward/record.url?scp=84873746742&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84873746742&partnerID=8YFLogxK
U2 - 10.1016/j.gene.2013.01.013
DO - 10.1016/j.gene.2013.01.013
M3 - Article
C2 - 23333608
AN - SCOPUS:84873746742
SN - 0378-1119
VL - 517
SP - 147
EP - 157
JO - Gene
JF - Gene
IS - 2
ER -