Home Sequence-dependent cluster analysis of biomineralization peptides
Article Publicly Available

Sequence-dependent cluster analysis of biomineralization peptides

  • Jose Isagani B. Janairo EMAIL logo , Frumencio Co , Jose Santos Carandang and Divina M. Amalin
Published/Copyright: August 11, 2015
Become an author with De Gruyter Brill

Abstract

A reliable and statistically valid classification of biomineralization peptides is herein presented. 27 biomineralization peptides (BMPep) were randomly selected as representative samples to establish the classification system using k-means method. These biomineralization peptides were either discovered through isolation from various organisms or via phage display. Our findings show that there are two types of biomineralization peptides based on their length, molecular weight, heterogeneity, and aliphatic residues. Type-1 BMPeps are more commonly found and exhibit higher values for these significant clustering variables. In contrast are the type-2 BMPeps, which have lower values for these parameters and are less common. Through our clustering analysis, a more efficient and systematic approach in BMPep selection is possible since previous methods of BMPep classification are unreliable.

1 Introduction

Biomineralization is a nature-inspired method of inorganic nanomaterial synthesis that relies on peptides [1]. The process is based on the inherent ability of most organisms to biosynthesize inorganic nanostructures that will aid them in carrying out vital functions [2]. For instance, the magnetotactic bacteria produce magnetite nanoparticles which serve sensing functions [3]. Recently, biomineralization has been widely utilized to create functional nanomaterials due to the several advantages this biomimetic strategy offers [4]. Some of these advantages include ambient synthetic conditions, diversity of metal and peptide combinations available, among others. Biomineralization peptides (BMPep) occupy a central role in this process since they are responsible for guiding the growth of the inorganic nanostructure. BMPeps accomplish this through different mechanisms such as capping [5], regulation of nucleation and reduction [6]. As a result, current research focusses on understanding, controlling and improving this process either through the BMPep [7] or through the environment [8]. Thus, selection of an appropriate BMPep is the critical first step in achieving these goals. In the absence of an organized and reliable method of grouping BMPep, the abundance of known BMPep sequences can make the selection process difficult and confusing. Currently, BMPeps are conveniently classified according to their inorganic substrate. However this practice is unreliable since most BMPep can bind to numerous inorganic substrates. A good example is the R5 peptide which was isolated from Cylindrotheca fusiformis [9]. This 19-residue peptide natively binds to silica but reports have shown that this BMPep can be used to form nanostructures of Ti [10], Pd [11] and Au [12]. Therefore creating credible and statistically sound clusters will help researchers choose which BMPep is suitable for their system based on a chosen clustering variable. This will facilitate a more efficient and systematic approach in the selection of the BMPep. In addition, establishing relationships among BMPeps on the basis of common and dissimilar features will aid in the development of a general understanding regarding the properties of BMPeps. In this paper, we established a 2-cluster solution for the 27 reported BMPep for different metals. The clustering variables were based from the primary structure since the sequence influences numerous properties of the peptides.

2 Methods

The names and sequences of the 27 BMPep used in this study are shown in Table 1. The clustering variables are based on the amino acid composition of the BMPep. The clustering variables are the BMPep length, molecular weight (MW), isoelectric point (pI),% heterogeneity, % aliphatic residues, % aromatic residues, % polar residues, % acidic residues, % basic residues, and% sulfur-containing residues. The MW and pI were calculated using expasy (http://web.expasy.org/compute_pi/) whereas the other variables were calculated by counting the number of amino acid residues corresponding to each category divided by the length of the BMPep. For the % heterogeneity, this corresponds to the absolute number of amino acids within the BMPep sequence without counting the repetition. Using these variables, a two-cluster solution using k-means method was calculated using Statistica. K-means clustering is a form of nonhierarchical grouping method which starts with a predefined number of clusters [30]. Two divisions were chosen in this study in order to attain simplicity. The significant clustering variables were identified through analysis of variance (ANOVA). A variable that possesses the highest p-value which is greater than the confidence level was discarded and clustering was repeated using the remaining variables. This iterative procedure was repeated until the remaining variables all had p-values lower than 0.05. All statistical analyses, including analysis of variance and t-test were conducted at a 5% confidence level.

Table 1

Biomineralization peptides used in this study including their sequences and references.

NameSequenceSubstrateReferences
HG12HGGGHGHGGGHGCu[13]
HREAHHAHHAADAu, Cu, Pd[14]
R5SSKKSGSYSGSKGSKRRILSilica[9]
A3AYSSGAPPMPPFAu, Pd, Pt[15]
Ag4NPSSLFRYLPSDAg[16]
AgP35WSWRSPTPHVVTAg[17]
Col-P10HYPTLPLGSSTYCo[17]
P7ATLHVSSYPt[18]
FlgDYKDDDKPd, Pt[15]
Pd4TSNAVHPTLRHLPd[19]
Pd2NFMSLPRLGHMHPd[19]
AuBP1WAGAKRLVLRRGEAu[20]
AuBP2WALRRSIRRQSYAu[20]
GBP1MHGKTQATSGTIQSAu[21]
Midas2TGTSVLIATPYVAu[22]
Z1KHKHWHWAu[23]
Z2RMRMKMKAu[23]
AgBP1TGIFKSARAMRNAg[24]
AgBP2EQLGVRKELRGVAg[25]
Ag5SLATQPPRTPPVAg[17]
Col-P2KLHSSPHTLPVQCo[17]
Col-P1HSVRWLLPGAHPCo[17]
Col-P15QYKHHPQKAAHICo[17]
q7QQSWPISPd[26]
B7CTTCGCGNi[27]
LSTB1AHKKPSKSATiO2[28]
Pt-1YQPWKTQRELSVPt[29]

3 Results and discussion

We carried out the classification of BMPep on the basis of their primary structure. The primary structure of the BMPep has tremendous effects on the resulting nanostructure wherein single residue substitutions within the peptide can produce different morphologies of the nanostructure [31]. While the diversity of BMPeps is high, several BMPeps share common motifs and conserved residues. For example, the three silver binding peptides discovered by Naik et al. [16] have identical lengths and the majority of their residues are conserved for all three BMPeps. Thus, classifying BMPeps on the basis of the similarities and differences of their primary structures is ideal since discrimination on the most fundamental level of peptide structure can be achieved. Out of the ten computed clustering variables, only four were deemed significant after a step-wise elimination of insignificant variables (Table 2). This means that only the length, MW, % heterogeneity and % aliphatic residues are significant variables that can differentiate and cluster together the BMPeps into two groups. More specifically, molecular weight carries the most weight in terms of significance in clustering since it has the lowest p-value.

Table 2

Significant clustering variables based on ANOVA.

VariableBetween SSdfWithin SSdfFSignificant p-Value
Length125172.42542.970350.000001
MW1,613,1501667699.22560.399590.000000
% Heterogeneity212816086.5258.741340.006701
% Aliphatic residues135016716.0255.025310.034096

The second most significant clustering variable is the length. This is expected since the peptide length is closely associated with the molecular weight. Peptide heterogeneity is also a significant clustering variable which implies that sequence diversity is an important point of difference among the reported BMPep. The least significant clustering variable is the % aliphatic residues. Among the other variables included which take into consideration the kind of amino acid, only the % aliphatic residues was deemed significant. The importance of aliphatic residues in a given peptide potentially lies in solvent interaction. Since BMPeps are relatively small and short compared to actual proteins, it is expected that all residues are exposed to the solvent. The role of aliphatic residues arises in regulating the interaction of the peptide with the aqueous environment. The other criteria were deemed insignificant probably due to their very frequent occurrence or erratic appearance. For example, the acidic and basic amino acids are common to all BMPeps, since these residues are responsible for metal complexation [32] and capping. BMPeps regulate nanostructure growth by means of capping wherein the peptide attaches itself to the growing nanoparticle at very specific facets in order to arrest further formation. Since capping is a shared characteristic for all BMPeps, the type of capping amino acid was not determined to be a discriminatory variable due to its commonality. The presence of polar residues are also common to all since they help make the peptide more hydrophilic given that aqueous systems are always utilized. Finally, the appearance of aromatic residues is unpredictable while that of the sulfur-containing residues is rare. If ever they are present in a given BMPep, aromatic residues such as tryptophan and tyrosine help in the reduction of the metal ions [33]. The erratic appearance of aromatic residues indicates that not all BMPeps exhibit the ability to reduce metals. This is consistent with the practice of adding reducing agents in order to convert the metal ions into their zero valent state. Common reducing agents used are ascorbic acid, sodium borohydride, among others. Recently [34] concluded that the type of reducing agent added influenced the morphology of the nanostructures yielded from biomineralization. Thus, it is difficult to discriminate and classify BMPep on the basis of aromatic residues due to their erratic and unpredictable occurrence. This is reflected from our analysis wherein aromatic residues were not deemed to be a significant clustering variable. On the other hand, sulfur-containing amino acids such cysteine exert their influence in the secondary structure of the peptides by forming disulfide bonds. Therefore finding a connection among the other BMPep using these criteria might be difficult. Based from the determined significant clustering variables, the 27 BMPeps were then divided into two groups. The first group contains 18 members whereas the second cluster contains 9 members as presented in Table 3.

Table 3

Cluster memberships of 27 biomineralization peptides.

Cluster 1Cluster 2
R5HG12
A3HRE
Ag4P7A
AgP35Flg
Col-P10Z1
Pd4Z2
Pd2q7
AuBP1B7
AuBP2LSTB1
GBP1
Midas2
AgBP1
AuBP2
Ag5
Col-P2
Col-P1
Col-P15
Pt-1

The discriminatory ability of the identified significant clustering variables was validated by conducting a t-test between the means of the two clusters (Table 4). As expected, the previously identified clustering variables were significantly different for each group as indicated by their respective t and p values. This further means that the other properties used as clustering variables are not significantly different among the 27 BMPeps.

Table 4

T-test for the difference of the means of the clustering variables between the two clusters.

VariableMean 2Mean 1t-valuedfp-Value
Length8.000012.556–6.55518250.000001
pI7.48899.051–1.67015250.107364
MW908.11781426.633–7.77172250.000000
% Heterogeneity49.000067.833–2.95658250.006701
% Aliphatic25.666740.667–2.24172250.034096
% Aromatic11.11118.6940.66786250.510340
% Polar16.777826.033–1.40320250.172855
% Acidic7.55562.2941.13070250.268914
% Basic34.000020.0501.90915250.067784
% S-containing4.77782.2560.68599250.499030

Based from the descriptive statistics of each group, cluster 1 possesses a longer sequence, higher molecular weight, more diverse with respect to the amino acid composition and contains more aliphatic amino acids (Table 5). On the other hand, members of the second cluster had lower values for these variables (Table 6).

Table 5

Descriptive statistics for the members of cluster 1.

VariableMeanStandard deviationVariance
Length12.5561.68812.85
MW1426.633177.722731585.35
% Unique residues67.83311.9127141.91
% Aliphatic40.66712.3860153.41
Table 6

Descriptive statistics for the members of cluster 2.

VariableMeanStandard deviationVariance
Length8.00001.73213.00
MW908.1178127.841816343.53
% Unique residues49.000021.4301459.25
% Aliphatic25.666722.6605513.50

Analyzing the members of each cluster reveals that both clusters possess diverse peptides with respect to their inorganic substrate. For example, both clusters contain Au, Ag and Pd binding BMPeps. This indicates that for a given inorganic substrate, two types of BMPep are available. While type-1 BMPeps are more commonly found, the shorter type-2 also exist which are more attractive in terms of cost. Reducing the length of a peptide by several residues will have a drastic effect on the efficiency of the synthesis. For example, both the Pd4 and q7 are palladium biomineralization peptides. Both BMPep are capable of forming sub-5 nm crystalline nanoparticles. The q7 BMPep however is more attractive due to its shorter length and less heterogeneous character. The q7 BMPep is more cost effective to synthesize since it is shorter by 5 residues. Moreover, it only needs 6 types of amino acids compared to Pd4 which requires 9. In a similar manner are the Midas2 and Z1 peptides for Au. Our findings suggest and encourage discovering more type 2 BMPeps, which are shorter, lighter and less heterogeneous which will translate into a more cost-effective nanostructure production. Truncation studies can be carried out wherein a long BMPep sequence can be systematically reduced into a shorter fragment without compromising its ability to direct nanostructure growth. Generally, increasing the known members of type-2 BMPep will further broaden the applicability of biomineralization as a tool for nanomaterial synthesis. The utilization of type-2 BMPeps is more practical since the synthesis is more straightforward at a considerable lower cost.

4 Conclusion

In summary, we have established a reliable classification of biomineralization peptides based on their length, molecular weight, heterogeneity, and aliphatic residues. Type-1 BMPeps are more commonly found and exhibit higher values for these significant clustering variables. In contrast are the type-2 BMPeps which have lower values for these parameters and are less common. Our findings suggest and encourage discovering and developing more type-2 BMPep since these peptides are more cost-effective to prepare. Increasing the known sequences of type-2 BMPep will widen the applicability of biomineralization as a method to prepare inorganic nanomaterials. Through our clustering analysis, a more efficient and systematic approach in BMPep selection is possible since previous methods of BMPep classification are unreliable.


Corresponding author: Jose Isagani B. Janairo, Biology Department, College of Science, De La Salle University, 2401 Taft Avenue, Manila, Philippines, E-mail:

References

1. Nudelman F, Sommerdijk NA. Biomineralization as an inspiration for materials chemistry. Angew Chem Int Ed 2012;51:6582–96.10.1002/anie.201106715Search in Google Scholar PubMed

2. Mann S. Biomineralization: principles and concepts in bioinorganic materials chemistry. UK: Oxford University Press, 2001.Search in Google Scholar

3. Yan L, Zhang S, Chen P, Liu H, Yin H, Li H. Magnetotactic bacteria, magnetosomes and their application. Microbiol Res 2012;167:507–19.10.1016/j.micres.2012.04.002Search in Google Scholar PubMed

4. Briggs B, Knecht MR. Nanotechnology meets biology: peptide-based methods for the fabrication of functional materials. J Phys Chem C 2012;3:405–18.10.1021/jz2016473Search in Google Scholar PubMed

5. Coppage R, Slocik JM, Brigs BD, Frenkel AI, Heinz H, Naik RR, et al. Crystallographic recognition controls peptide binding for bio-based materials. J Am Chem Soc 2011;133:12346–9.10.1021/ja203726nSearch in Google Scholar PubMed

6. Tan YN, Lee JY, Wang DI. Uncovering the design rules for peptide synthesis of metal nanoparticles. J Am Chem Soc 2010;132:5677–86.10.1021/ja907454fSearch in Google Scholar PubMed

7. Janairo JI, Sakaguchi T, Hara K, Fukuoka A, Sakaguchi, K. Effects of biomineralization peptide topology on the structure and catalytic activity of Pd nanomaterials. Chem Commun 2014;50:9259–62.10.1039/C4CC04350BSearch in Google Scholar

8. Janairo JI, Sakaguchi K. Effects of buffer on the structure and catalytic activity of palladium nanomaterials formed by biomineralization. Chem Lett 2014;43:1315–7.10.1246/cl.140405Search in Google Scholar

9. Knecht MR, Wright DW. Functional analysis of the biomimetic silica precipitating activity of the R5 peptide from Cylindrotheca fusiformis. Chem Commun 2003;3038–9.10.1039/b309074dSearch in Google Scholar PubMed

10. Sewell SL, Wright D. Biomimetic synthesis of Titanium dioxide utilizing the R5 peptide derived from Cylindrotheca fusiformis. Chem Mater 2006;18:3108–13.10.1021/cm060342pSearch in Google Scholar

11. Jakhmola A, Bhandari R, Pacardo DB, Knecht, M.R. Peptide template effects for the synthesis and catalytic application of Pd nanoparticle networks. J Mater Chem 2010;20:1522–31.10.1039/B922018FSearch in Google Scholar

12. Bhandari R, Knecht MR. Synthesis, characterization, and catalytic application of networked Au nanostructures fabricated using peptide templates. Catal Sci Technol 2012;2:1360–6.10.1039/c2cy20149fSearch in Google Scholar

13. Banerjee IA, Yu L, Matsui H. Cu nanocrystal growth on peptide nanotubes by biomineralization: size control of Cu nanocrystals by tuning peptide conformation. Proc Natl Acad Sci USA 2003;100:14678–82.10.1073/pnas.2433456100Search in Google Scholar PubMed PubMed Central

14. Slocik JM, Moore JT, Wright DW. Monoclonal antibody recognition of histidine rich peptide encapsulated nanoclusters. Nano Lett 2002;2:169–73.10.1021/nl015706lSearch in Google Scholar

15. Slocik JM, Stone MO, Naik RR. Synthesis of gold nanoparticles using multifunctional peptides. Small 2005;1:1048–52.10.1002/smll.200500172Search in Google Scholar PubMed

16. Naik RR, Stringer SJ, Agarawal G, Jones SE, Stone MO. Biomimetic synthesis and patterning of silver nanoparticles. Nature Mater 2002;1:169–72.10.1038/nmat758Search in Google Scholar PubMed

17. Naik RR, Jones SE, Murray CJ, McAuliffe JC, Vaia RA, Stone MO. Peptide templates for nanoparticle synthesis derived from polymerase chain reaction-driven phage display. Adv Funct Mater 2004;14:25–30.10.1002/adfm.200304501Search in Google Scholar

18. Li Y, Whyburn GP, Huang Y. Specific peptide regulated synthesis of ultrasmall platinum nanocrystals. J Am Chem Soc 2009; 131:15998–9.10.1021/ja907235vSearch in Google Scholar PubMed

19. Pacardo DB, Sethi M, Jones SE, Naik RR, Knecht MR. Biomimetic synthesis of Pd nanocatalysts for the Stille coupling reaction. ACS Nano 2009;3:1288–96.10.1021/nn9002709Search in Google Scholar PubMed

20. Hnilova M, Oren EE, Seker UO, Wilson BR, Collino S, Evans JS, et al. Effect of molecular conformations on the adsorption behavior of bold-binding peptides. Langmuir 2008;24:12440–5.10.1021/la801468cSearch in Google Scholar PubMed

21. Kulp JL III, Sarikaya M, Evans JS. Molecular characterization of a prokaryotic polypeptide sequence that catalyzes Au crystal formation. J Mater Chem 2004;14:2325–32.10.1039/b401260gSearch in Google Scholar

22. Kim J, Rheem Y, Yoo B, Chong Y, Bozhilov KN, Kim D, et al. Peptide-mediated shape- and size- tunable synthesis of gold nanostructures. Acta Biomater 2010;6:6929–33.10.1016/j.actbio.2010.01.019Search in Google Scholar PubMed

23. Peelle BR, Krauland EM, Wittrup KD, Belcher AM. Design criteria for engineering inorganic material-specific peptides. Langmuir 2005;21:6929–33.10.1021/la050261sSearch in Google Scholar PubMed

24. Hnilova M, Liu X, Yuca E, Jia C, Wilson B, Karatas AY, et al. Multifunctional protein-enabled patterning on arrayed ferroelectric materials. ACS Appl Mater Interfaces 2012;4:1865–71.10.1021/am300177tSearch in Google Scholar PubMed

25. Sedlak RH, Hnilova M, Grosh C, Fong H, Baneyx F, Schwartz D, et al. Engineered Escherichia coli silver-binding periplasmic protein that promotes silver tolerance. Appli Environ Microbiol 2012;78:2289–96.10.1128/AEM.06823-11Search in Google Scholar PubMed PubMed Central

26. Chiu C-Y, Li Y, Huang Y. Size-controlled synthesis of Pd nanocrystals using specific multifunctional peptide. Nanoscale 2010;2:927–30.10.1039/c0nr00194eSearch in Google Scholar PubMed

27. Chung KC, Cao L, Dias AV, Pickering IJ, George GN, Zamble DB. A high-affinity metal-binding peptide from Escherichia coli HypB. J Am Chem Soc 2008;130:14056–7.10.1021/ja8055003Search in Google Scholar PubMed

28. Choi N, Tan L, Jang J, Um YM, Yoo PJ, Choe W-S. The interplay of peptide sequence and local structure in TiO2 biomineralization. J Inorg Biochem 2012;115:20–7.10.1016/j.jinorgbio.2012.05.011Search in Google Scholar PubMed

29. Forbes LM, Goodwin AP, Cha JN. Tunable size and shape control of platinum nanocrystals from a single peptide sequence. Chem Mater 2010;22:6524–8.10.1021/cm101389vSearch in Google Scholar

30. Myatt GJ. Making sense of data: practical guide to exploratory data analysis and data mining. New Jersey: John Wiley & Sons, Inc, 2007.10.1002/0470101024Search in Google Scholar

31. Coppage R, Slocik JM, Ramezani-Dakhel H, Bedford NM, Heinz H, Naik RR, et al. Exploiting localized surface binding effects to enhance the catalytic reactivity of peptide-capped nanoparticles. J Am Chem Soc 2013;135: 11048–54.10.1021/ja402215tSearch in Google Scholar PubMed

32. Sovago I, Kallay C, Varnagy K. Peptides as complexing agents: factors influencing the structure and thermodynamic stability of peptide complexes. Coord Chem Rev 2012;256: 2225–33.10.1016/j.ccr.2012.02.026Search in Google Scholar

33. Diamanti S, Elsen A, Naik RR, Vaia R. Relative functionality of buffer and peptide in gold nanoparticle formation. J Phys Chem C 2009;113:9993–7.10.1021/jp8102063Search in Google Scholar

34. Briggs B, Li Y, Swihart MT, Knecht MR. Reductant and sequence effects on the morphology and catalytic activity of peptide-capped Au nanoparticles. ACS Appl Mater Interfaces 2015;7:8843–51.10.1021/acsami.5b01461Search in Google Scholar PubMed

Received: 2014-11-24
Revised: 2015-7-11
Accepted: 2015-7-20
Published Online: 2015-8-11
Published in Print: 2015-7-1

©2015 by De Gruyter

Downloaded on 25.10.2025 from https://www.degruyterbrill.com/document/doi/10.1515/znc-2014-4202/html
Scroll to top button