Abstract
Sepiapterin reductase (SR) catalyzes the final steps of BH4 biosynthesis. Previously, a gene encoding SR has been cloned and characterized from a Drosophila cDNA library in vitro. The present study reports the identification of another SR gene in the Drosophila genome and the structural characteristics and differences of the two Drosophila SRs, using homology modeling analysis. Homology modeling of SRs for protein structure and function prediction showed that the two SRs have different surface electrostatic distributions and different shapes of the substrate (sepiapterin)-binding sites. These results provide valuable insight into the possibility of diverse functions of Drosophila SRs in vivo.
Introduction
Sepiapterin reductase (SR) is an important enzyme that catalyzes the final step of tetrahydrobiopterin (BH4) biosynthesis [1, 2], i.e., NADP(H)-dependent conversion of 6-pyruvoyl tetrahydrobiopterin to BH4. BH4 is found ubiquitously in higher animals and is a multifunctional cofactor for various physiological processes, including the biosynthesis of monoamine neurotransmitters and nitric oxide, as well as the proliferation of erythroid cells [3–5]. BH4 also acts as an essential cofactor for the oxidative cleavage of ether lipid [6–8]. BH4 deficiency in humans can lead to severe neurological disorders, such as hyperphenylalaninemia, phenylketonuria, Parkinson’s disease, Alzheimer’s disease, and depression [2]. SR has been identified in human, mouse, and rat genomes, and in-depth comparison between all SRs submitted to protein sequence databases has revealed high sequence homology between these SRs [9–11]. Furthermore, X-ray crystallographic analyses of SRs in other organisms have revealed that SRs share well-conserved amino acid residues in their active sites [12, 13].
In Drosophila, SR has been cloned, sequenced, and characterized from the Drosophila adult head cDNA library, and the sequence identity of Drosophila SR compared to mammalian SRs was approximately 29%. In vitro studies show that recombinant SR, obtained from Drosophila cDNA clones, has SR enzymatic activity [14, 15]. Recently, this SR gene (CG12117) was named Sptr. We found another SR gene (CG12116) in the Drosophila genome, near the sptr gene, that has not yet been identified. Because it is not yet known which of the two Drosophila SRs is the main enzyme in the BH4 biosynthetic pathway, or what the functional difference between these two enzymes is, we first determined the three-dimensional (3D) structure of the Drosophila SRs using homology modeling and identified the structural differences in the substrate-binding sites involved in the catalytic reaction of each Drosophila SR.
Materials and methods
Multiple alignments and phylogenetic analysis
Homologous sequences were searched against SR proteins using the BLAST program, available from the National Center for Biotechnology Information (NCBI). The sequence alignment of SR proteins from different organisms was carried out using the ClustalW2 program (EMBL-EBI, Hinxton, Cambridgeshire, UK) and the MUSCLE software version 3.8.31 (EMBL-EBI, Hinxton, Cambridgeshire, UK) [16]. Phylogenic analysis was performed using the ClustalW2 program and the Molecular Evolutionary Genetics Analysis (MEGA, v. 6.0) software program [17]. The validity of the various branches of phylogenetic trees was tested by bootstrap analysis, using 1000 replications.
Homology modeling
Three-dimensional structure predictions of Drosophila SRs and zebrafish SR were generated by the I-TASSER server for protein structure and function prediction based on a threading alignment algorithm [18, 19]. All structural images were prepared with the Molegro Molecular Viewer software v. 2.5.0 (Molegro ApS, Aarhus C, Denmark).
Evaluation of model
The best structural model of Drosophila SR was chosen on the basis of the stereochemistry quality report, generated using the RAMPAGE server (Crystallography and Bioinformatics group at the University of Cambridge, Old Addenbrooke’s, Cambridge, UK).
Results and discussion
Identification of Drosophila SRs
A homology search against the Drosophila database yielded two highly similar genes: CG12117 and CG12116. Two different SR genes are located on chromosome X of Drosophila at the 7E11 (CG12117) and 7F1 (CG12116) loci; these genes do not contain introns. The molecular weight of CG12117 and CG12116, predicted from the deduced amino acid sequences, is 29.2 and 31.0 kDa, respectively. Recently, the CG12117 gene was named Sptr. Although both genes were predicted to have SR activity, only CG12117 showed SR enzymatic activity in vitro [14]. The deduced amino acid sequences were compared with the SR sequences of other organisms [9–11] (Figure 1A). Drosophila CG12117 and CG12116 respectively exhibited approx. 31% and 23% sequence similarities to the SR proteins of other organisms (mouse, rat, human, zebrafish); in addition, 67 and 114 residues were fully conserved and less conserved in the two Drosophila SRs, respectively. Furthermore, both Drosophila SRs were predicted to share the same folds, including eight α-helices and seven β-sheets (Figure 1A). Most SRs, in other organisms, have a Rossmann fold: a -G-X-X-G-X-G- sequence for NADP(H) binding of the N-terminal cofactor, a strictly conserved serine, and a -Y-X-X-X-K- sequence motif (a Ser-Tyr-Lys triad), which is implicated in the substrate-binding site [20, 21]. In the Drosophila CG12117 and CG12116 sequence alignments, CG12117 was similar to other SRs, sharing a common -G-X-X-G-X-G- sequence in the NADP(H) cofactor-binding site and a highly conserved Ser-Tyr-Lys triad motif in the substrate-binding site. However, the amino acid residues of CG12116, corresponding to glycine in the Rossmann fold and to tyrosine in the Ser-Tyr-Lys triad motif of other SRs, are replaced with proline and leucine residues, respectively. These residues, proposed to be important for substrate binding, are relatively less conserved in CG12116 (Figure 1A, black triangles and asterisks).

Multiple sequence alignment of SR homologues. (A) The amino acid sequence alignment of SR homologues from five species was performed using the ClustalW2 program. Strictly conserved residues are shaded in black, and similar residues are colored gray. The NADP(H)-binding site in the N-terminus and the catalytic site in the C-terminus are represented by black triangles (▴) and asterisks (*), respectively. The representative position of the α-helix (curved lines) and β-sheet (black arrow) is shown above the alignment. The amino acid sequences are mouse SR (Mus muculus, NP_035597.2), rat SR (Rattus norvegicus, NP_062054.1), human SR (Homo sapiens, NP_003115.1), zebrafish SR (Danio rerio, NP_001019601.2), fly CG12117 (Drosophila melanogaster, NP_727265.1), and fly CG12116 (Drosophila melanogaster, NP_572481.1). The figure was drawn using the BoxShade server. (B) Phylogenetic analysis of six SRs. Protein sequences were aligned using ClustalW2, and the phylogenic tree was constructed by the MEGA 6.0 program, using the maximum-likelihood method. The genetic distances are displayed on each internal branch.
The phylogenetic relationships between SRs from various organisms were examined by constructing a rooted phylogenetic tree based on the SR protein sequences, using the maximum-likelihood method. The phylogenetic tree shows similarities between the SR protein sequences of the rat, mouse, and human. However, Drosophila CG12116 showed marked differences from other SRs, suggesting a different method of evolution for CG12116 (Figure 1B).
Comparative structural modeling of SRs from various organisms
Although there have been reports on 3D structure analyses of SR from various organisms, i.e., mouse, human, and bacteria, using X-ray crystallography [12, 13], the 3D structure of Drosophila SRs has not been reported yet. To investigate the structural differences and characteristics of Drosophila SRs, structural models of Drosophila SRs were generated, based on the structure of human SR [18, 19, 22]. Five models of CG12117 and four models of CG12116 were generated using the I-TASSER server (Table 1). The most relevant score for models predicted by I-TASSER is the confidence score (C score), with a range of −5 to +2. A model with a C score >−1.5 is considered to be a correct fold and represents higher confidence in the model [18]. Model 1 of both CG12117 and CG12116 with higher C scores was selected as the best structural model and was used for the analyses (Table 1). The model predictions were evaluated using the template modeling score (TM score) and the root mean squared difference (RMSD). The TM score is a measure of the structural similarity between the predicted model and the native structure. A TM score >0.5 indicates a model with correct topology [18]. The TM score of the respective model 1 of CG12117 and CG12116 was 0.75±0.10 and 0.78±0.10, respectively. The expected RMSD was 5.3±3.4 and 5.0±3.2 Å, respectively. Furthermore, the stereochemical quality of model 1 for Drosophila SRs was validated using the RAMPAGE server [23]. A high-quality model would be expected to have >90% of amino acid residues located in the favorable region. RAMPAGE analyses revealed that 90.3% and 86.7% of the residues were located in the favorable region, respectively, and that 6.6% and 9.2% of the residues were present in the allowed region, respectively. Furthermore, only 3.1% and 4.1% of the residues were located in the disallowed region of the Ramachandran plots of the models for CG12117 and CG12116, respectively (Figure 2). These results suggest that our predicted structural models for Drosophila SRs are reliable and of high quality.

Homology model validation using the RAMPAGE server. (A) Ramachandran plot for Model 1 of CG12117. (B) Ramachandran plot for Model 1 of CG12116. RAMPAGE analyses showed that 90.3% and 86.7% of the residues are located in the favorable region and that 6.6% and 9.2% of the residues are present in the allowed region, respectively. The analyses also showed that 3.1% and 4.1% of the residues are located in the disallowed region of the Ramachandran plots of the models for CG12117 and CG12116, respectively.
C scores of the Drosophila SR models predicted by the I-TASSER server.
| CG12117 | CG12116 | ||
|---|---|---|---|
| Structure model | C score | Structure model | C score |
| Model 1 | 0.27 | Model 1 | 0.47 |
| Model 2 | −2.01 | Model 2 | −2.05 |
| Model 3 | −3.05 | Model 3 | 0.26 |
| Model 4 | −1.51 | Model 4 | 0.46 |
| Model 5 | −1.38 | ||
A seven-stranded parallel β-sheet in the center of the SR is sandwiched by two arrays of three α-helices, and these structural characteristics are found in other short-chain dehydrogenase/reductases, including mouse SR, rat SR, human SR, and zebrafish SR [12, 24]. Although the number of stranded parallel β-sheet in the center of the molecule is small (one or two), the overall folding of Drosophila SRs CG12117 and CG12116 is similar to those of other SRs, sharing a common tertiary structure with a cofactor-binding site. The entrance to the NADP(H)-binding sites of the SRs from mouse, rat, human, and zebrafish was shaped like a narrow and deep tunnel (Figure 3A–D). However, the cofactor-binding sites of Drosophila SRs were widely shaped, with open crafts (Figure 3E and F).

Potential electrostatic surfaces of NADP(H)-binding sites. The structures shown are from (A) mouse, (B) rat, (C) human, (D) zebrafish, (E) fly CG12117, and (F) fly CG12116. NADP+ is shown as a stick model, with the carbon, oxygen, nitrogen, and phosphate atoms indicated in gray, red, blue, and orange, respectively.
Different structural characteristics of Drosophila SRs
To find the different structural characteristics of Drosophila SRs, we analyzed the surface electrostatic distributions of the cofactor-binding sites in both Drosophila SRs and compared them to the SRs of other organisms. Interestingly, despite the high structural similarity between CG12117 and CG12116, we found some differences in the NADP(H)-binding site. Consistent with SRs from the other organisms, positively charged amino acid residues in the cofactor-binding sites were observed in CG12117 (Figure 3A–E), whereas only negatively charged amino acid residues were found in CG12116 (Figure 3F). The NADP(H)-binding site of CG12117 contains Gly13, Ser15, Arg16, Gly17, Ile18, Gly41, Arg42, Leu70, Glu71, Thr72, Asn99, Ser125, Leu151, Tyr166, Lys170, Pro196, Gly197, Val198, and Ile199. The NADP(H)-binding site of CG12116 contains Gly18, Ser20, Asn21, Ile22, Gly23, Leu45, Asp46, Leu75, Asp76, Asn110, Glu111, Gly112, Val162, Thr163, Ser164, Leu177, Lys181, Pro207, Gly208, Met210, Thr212, and Cys217. The structural model of Drosophila SRs in the ternary complex (with NADP+ and sepiapterin) revealed the binding site of sepiapterin to the active site of CG12117, but not to that of CG12116 (Figure 4A and B). The entrance and the substrate-binding pocket of CG12116 (Figure 4B) were smaller and narrower than those of CG12117 SR (Figure 4A, white circle). The structure makes it difficult for the substrate, sepiapterin, to gain access to the appropriate location in the substrate-binding pocket of CG12116. This result is also consistent with the sequence alignment data that showed less conserved active site residues in CG12116. The substrate-binding site of CG12117 is formed by the hydrophobic and polar residues Thr153, Leu154, Met163, Tyr166, Ala195, Gly197, and Val198. The residues that play a role in substrate and cofactor binding were confirmed by I-TASSER using the predicted structure of the ternary complex of CG12117 with NADP+ and sepiapterin (Figure 4C). Hydrogen bonds and electrostatic and steric interactions play a crucial role in protein-ligand interaction by increasing the binding affinity between these molecules. Therefore, the putative interactions between sepiapterin and NADP+, and the substrate-binding site of CG12117 were analyzed. Thr153, Leu154, and Val198 were observed to form hydrogen bonds and steric interactions with sepiapterin (Figure 4D). In addition, Thr72, Thr102, Tyr166, and Lys170 were observed to form hydrogen bonds (of lengths ranging from 2.00 to 3.43 Å) with NADP+. Arg16, Ile18, Leu47, Thr102, Thr153, Tyr166, Lys170, Gly97, and Arg42 formed steric and electrostatic interactions (of lengths ranging from 2.00 to 4.25 Å) with NADP+ (Figure 4E). These predicted structural differences could explain why one of the two Drosophila SRs, CG12117, is the only functional sepiapterin reductase in vivo. However, the enzymatic activities of the two SRs must be further investigated and compared. These results suggest the possibility that CG12116 is a pseudogene that exhibits a different substrate selectivity and a different function.

Structural comparison of substrate-binding sites in Drosophila SRs. (A and B) Potential surface electrostatic distributions of the substrate-binding site of two Drosophila SRs: CG12117 (A) and CG12116 (B). Positively charged surface distributions are colored in blue and the negatively charged distributions are colored in red. NADP+ and biopterin are shown as stick models, with the carbon, oxygen, nitrogen, and phosphate atoms indicated in gray, red, blue, and orange, respectively. The substrate-binding pocket is indicated by the white dotted circle. (C) Model displaying the binding of the substrate, NADP+, and biopterin at the substrate-binding site of CG12117. Blue dashed lines indicate the hydrogen bonds. (D and E) Schematic 2D diagram of the predicted interactions between biopterin and NADP+, and of the substrate-binding site of CG12117. Blue, green, and red dashed lines indicate the hydrogen bonds and the electrostatic and steric interactions, respectively. The corresponding bond distances (Å) are displayed according to the color coding of the interactions. The figures were generated using the Molegro Molecular Viewer program.
Conclusions
We describe here the 3D structure of two Drosophila SRs, CG12117 and CG12116, via homology modeling analysis. We showed that the electrostatic surface distributions in the NADP(H)-binding sites of the two SRs are different. Because the structure of the entrance and space of the sepiapterin-binding pocket of CG12116 was smaller and narrower than that of CG12117, sepiapterin cannot bind to the active site of CG12116. These results suggest the possibility that CG12116 has a different substrate selectivity and a different function compared with the other SRs in Drosophila. We will further identify the other substrates for CG12116 to characterize its biochemical properties. Our findings could contribute to a better understanding of the physiological function of Drosophila SRs.
Acknowledgments
This work was supported by the Basic Science Research Program through the National Research Foundation of Korea (NRF), funded by the Ministry of Science, ICT and Future Planning (MSIP) of Korea (2014R1A1A2005183), and by the Soonchunhyang University Research Fund.
References
1. Kaufman S. New tetrahydrobiopterin-dependent systems. Annu Rev Nutr 1993;13:261–86.10.1146/annurev.nu.13.070193.001401Search in Google Scholar PubMed
2. Thony B, Auerbach G, Blau N. Tetrahydrobiopterin biosynthesis, regeneration and functions. Biochem J 2000;347 Pt 1:1–16.10.1042/bj3470001Search in Google Scholar
3. Tanaka K, Kaufman S, Milstien S. Tetrahydrobiopterin, the cofactor for aromatic amino acid hydroxylases, is synthesized by and regulates proliferation of erythroid cells. Proc Natl Acad Sci USA 1989;86:5864–7.10.1073/pnas.86.15.5864Search in Google Scholar PubMed PubMed Central
4. Lowenstein CJ, Snyder SH. Nitric oxide, a novel biologic messenger. Cell 1992;70:705–7.10.1016/0092-8674(92)90301-RSearch in Google Scholar PubMed
5. Marletta MA. Nitric oxide synthase: aspects concerning structure and catalysis. Cell 1994;78:927–30.10.1016/0092-8674(94)90268-2Search in Google Scholar PubMed
6. Watschinger K, Keller MA, McNeill E, Alam MT, Lai S, Sailer S, et al. Tetrahydrobiopterin and alkylglycerol monooxygenase substantially alter the murine macrophage lipidome. Proc Natl Acad Sci USA 2015;112:2431–6.10.1073/pnas.1414887112Search in Google Scholar PubMed PubMed Central
7. Watschinger K, Keller MA, Hermetter A, Golderer G, Werner-Felmayer G, Werner ER. Glyceryl ether monooxygenase resembles aromatic amino acid hydroxylases in metal ion and tetrahydrobiopterin dependence. Biol Chem 2009;390:3–10.10.1515/BC.2009.010Search in Google Scholar PubMed PubMed Central
8. Werner ER, Blau N, Thony B. Tetrahydrobiopterin: biochemistry and pathophysiology. Biochem J 2011;438:397–414.10.1042/BJ20110293Search in Google Scholar PubMed
9. Citron BA, Milstien S, Gutierrez JC, Levine RA, Yanak BL, Kaufman S. Isolation and expression of rat liver sepiapterin reductase cDNA. Proc Natl Acad Sci USA 1990;87:6436–40.10.1073/pnas.87.16.6436Search in Google Scholar PubMed PubMed Central
10. Ota A, Ichinose H, Nagatsu T. Mouse sepiapterin reductase: an enzyme involved in the final step of tetrahydrobiopterin biosynthesis. Primary structure deduced from the cDNA sequence. Biochim Biophys Acta 1995;1260:320–2.10.1016/0167-4781(94)00225-RSearch in Google Scholar PubMed
11. Ichinose H, Katoh S, Sueoka T, Titani K, Fujita K, Nagatsu T. Cloning and sequencing of cDNA encoding human sepiapterin reductase – an enzyme involved in tetrahydrobiopterin biosynthesis. Biochem Biophys Res Commun 1991;179:183–9.10.1016/0006-291X(91)91352-DSearch in Google Scholar
12. Auerbach G, Herrmann A, Gutlich M, Fischer M, Jacob U, Bacher A, et al. The 1.25 A crystal structure of sepiapterin reductase reveals its binding mode to pterins and brain neurotransmitters. EMBO J 1997;16:7219–30.10.1093/emboj/16.24.7219Search in Google Scholar PubMed PubMed Central
13. Supangat S, Seo KH, Choi YK, Park YS, Son D, Han CD, et al. Structure of Chlorobium tepidum sepiapterin reductase complex reveals the novel substrate binding mode for stereospecific production of l-threo-tetrahydrobiopterin. J Biol Chem 2006;281:2249–56.10.1074/jbc.M509343200Search in Google Scholar PubMed
14. Seong C, Kim YA, Chung HJ, Park D, Yim J, Baek K, et al. Isolation and characterization of the Drosophila melanogaster cDNA encoding the sepiapterin reductase. Biochim Biophys Acta 1998;1443:239–44.10.1016/S0167-4781(98)00222-XSearch in Google Scholar
15. Seong C, Baek K, Yoon J. Structure, chromosomal localization, and expression of the Drosophila melanogaster gene encoding sepiapterin reductase. Gene 2000;255:357–61.10.1016/S0378-1119(00)00314-0Search in Google Scholar
16. Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res 2004;32:1792–7.10.1093/nar/gkh340Search in Google Scholar PubMed PubMed Central
17. Tamura K, Stecher G, Peterson D, Filipski A, Kumar S. MEGA6: Molecular Evolutionary Genetics Analysis version 6.0. Mol Biol Evol 2013;30:2725–9.10.1093/molbev/mst197Search in Google Scholar PubMed PubMed Central
18. Roy A, Kucukural A, Zhang Y. I-TASSER: a unified platform for automated protein structure and function prediction. Nat Protoc 2010;5:725–38.10.1038/nprot.2010.5Search in Google Scholar PubMed PubMed Central
19. Zhang Y. I-TASSER server for protein 3D structure prediction. BMC Bioinform 2008;9:40.10.1186/1471-2105-9-40Search in Google Scholar PubMed PubMed Central
20. Fujimoto K, Hara M, Yamada H, Sakurai M, Inaba A, Tomomura A, et al. Role of the conserved Ser-Tyr-Lys triad of the SDR family in sepiapterin reductase. Chem Biol Interact 2001;130-132:825–32.10.1016/S0009-2797(00)00238-6Search in Google Scholar PubMed
21. Jornvall H, Persson B, Krook M, Atrian S, Gonzalez-Duarte R, Jeffery J, et al. Short-chain dehydrogenases/reductases (SDR). Biochemistry 1995;34:6003–13.10.1021/bi00018a001Search in Google Scholar PubMed
22. Roy A, Yang J, Zhang Y. COFACTOR: an accurate comparative algorithm for structure-based protein function annotation. Nucl Acids Res 2012;40:W471–7.10.1093/nar/gks372Search in Google Scholar PubMed PubMed Central
23. Lovell SC, Davis IW, Arendall WB, III, de Bakker PI, Word JM, Prisant MG, et al. Structure validation by Calpha geometry: phi, psi and Cbeta deviation. Proteins 2003;50:437–50.10.1002/prot.10286Search in Google Scholar PubMed
24. Ghosh D, Wawrzak Z, Weeks CM, Duax WL, Erman M. The refined three-dimensional structure of 3 alpha,20 beta-hydroxysteroid dehydrogenase and possible roles of the residues conserved in short-chain dehydrogenases. Structure 1994;2:629–40.10.1016/S0969-2126(00)00064-2Search in Google Scholar
©2015 by De Gruyter
This article is distributed under the terms of the Creative Commons Attribution Non-Commercial License, which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
Articles in the same Issue
- Frontmatter
- Review
- Novel insights in folate receptors and transporters: implications for disease and treatment of immune diseases and cancer
- Original articles
- A comparative structural analysis of sepiapterin reductase from Drosophila by homology modeling
- Functional analysis of sepiapterin reductase in Drosophila melanogaster
- Clinical usefulness of serum neopterin in children with juvenile idiopathic arthritis
- Human breast milk concentration of neopterin at various stages of lactation and during a single feeding
- In psychiatrically healthy individuals, overweight women but not men have lower tryptophan levels
Articles in the same Issue
- Frontmatter
- Review
- Novel insights in folate receptors and transporters: implications for disease and treatment of immune diseases and cancer
- Original articles
- A comparative structural analysis of sepiapterin reductase from Drosophila by homology modeling
- Functional analysis of sepiapterin reductase in Drosophila melanogaster
- Clinical usefulness of serum neopterin in children with juvenile idiopathic arthritis
- Human breast milk concentration of neopterin at various stages of lactation and during a single feeding
- In psychiatrically healthy individuals, overweight women but not men have lower tryptophan levels