Here, we describe a protocol for the optimization and parameterization of amino acid residues modified with reactive carbonyl species, adaptable to protein systems. The protocol steps include structure design and optimization, charge assignments, parameter construction, and preparation of protein systems.
Protein carbonylation by reactive aldehydes derived from lipid peroxidation leads to cross-linking, oligomerization, and aggregation of proteins, causing intracellular damage, impaired cell functions, and, ultimately, cell death. It has been described in aging and several age-related chronic conditions. However, the basis of structural changes related to the loss of function in protein targets is still not well understood. Hence, a route to the in silico construction of new parameters for amino acids carbonylated with reactive carbonyl species derived from fatty acid oxidation is described. The Michael adducts for Cys, His, and Lys with 4-hydroxy-2-nonenal (HNE), 4-hydroxy-2-hexenal (HHE), and a furan ring form for 4-Oxo-2-nonenal (ONE), were built, while malondialdehyde (MDA) was directly attached to each residue. The protocol describes details for the construction, geometry optimization, assignment of charges, missing bonds, angles, dihedral angles parameters, and its validation for each modified residue structure. As a result, structural effects induced by the carbonylation with these lipid derivatives have been measured by molecular dynamics simulations on different protein systems such as the thioredoxin enzyme, bovine serum albumin and the membrane Zu-5-ankyrin domain employing root-mean-square deviation (RMSD), root mean square fluctuation (RMSF), structural secondary prediction (DSSP) and the solvent-accessible surface area analysis (SASA), among others.
In the constant pursuit of understanding the molecular behavior of proteins with oxidative modifications, computational chemistry has become a fundamental pillar in the broad field of scientific research. This relies on the use of theoretical models capable of interpreting physical phenomena in electronic systems, using mathematical equations to describe the atomic behavior of molecules. Within this landscape, computational simulations of proteins stand out as crucial tools to analyze the atomic behavior of molecular systems. Based on the evaluation of structural behavior, energetic calculations, and conformational states1, these methods become strategic allies to predict the behavior of biomolecular systems.
These simulations specialize in studying structural changes and assessing the loss or gain of biological functions in protein systems. However, computational approaches have shown significant limitations when applied to protein systems containing modified residues formed by covalent post-translational modifications in the sequence. This is because many available methods lack resources with parameters adaptable to force fields that are compatible with the most common packages of programs for molecular dynamics simulations of proteins2,3,4,5,6. Therefore, the standardization of computational software-compatible force-field adaptive parameters is essential to facilitate the precise coupling of topologies and atomic coordinates with the equation governing the potential energy of the system7.
In response to these challenges, a protocol adaptable to new modified amino acid residues with aldehydes derived from lipid peroxidation has been developed using ab initio methods. In that sense, the optimization of the structural geometry of the new residues allows the assignment of adaptive charges to new bond, angle, and dihedral parameters that can be run in general force fields such as AMBER. Subsequent validation of these parameters allows the determination of the consistency and robustness of the method applicable to molecular dynamics simulations.
One of the notable strengths of this method lies in its ability to adapt to diverse post-translational modifications, from carbonylation to phosphorylation, acetylation, and methylation, among others. This versatility is not only limited to protein systems but extends to macromolecular structures, allowing coupling with atomic topologies and coordinates. In contrast, previous studies reveal that the standard parameterization of post-translational modifications is only suited to a specific type of modification and can only be obtained from published repositories, lacking the ability to create new structures8.
Currently, challenges in protein structure prediction and design are becoming more evident when modeling structures with post-translational modifications. The scarcity of parameters describing alterations at specific amino acid sites underlines the urgent need to develop and apply computational methods that can be adjusted to standard parameterizations. The aim of this protocol is to provide a route for the in silico construction of new parameters for amino acids covalently modified with reactive carbonyl species derived from fatty acid oxidation. These modified amino acids are recognized by the general amber force field (GAFF) and can, therefore, be used to evaluate in silico the structural and functional effects that this kind of carbonylation has on their target proteins.
1. Design and optimization of the new modified amino acid
NOTE: This stage involves drawing the structures of modified residues and optimizing their energy.
Figure 1: Cysteine modified with reactive carbonyls. Representation of the chemical structure of cysteine (black line) modified with HNE, HHE, MDA and ONE (green line), and linked with acetylamide (blue line) and methylamide (red line) substituent groups. Please click here to view a larger version of this figure.
Figure 2: Menu to optimize modified residues synthetized. Reference image illustrating step 1.1 of the protocol, which shows the optimization step of the modified structure in Gaussian program. Please click here to view a larger version of this figure.
2. Parameterization of the modified amino acid residues
Figure 3: Parameter file preparation. (A) Reference image illustrating the expected appearance of the prepin file generated in step 2.1. The visualization of the file was conducted using the GNU nano text editor v2.3.1. (B) Reference image illustrating the expected appearance of the frcmod file generated in step 2.1. Please click here to view a larger version of this figure.
Figure 4: Reference image of XLEaP window. (A) Shows the expected response when typing the mentioned commands.(B) Shows the atoms that need to be removed (yellow) and the option that needs to be selected in order to do so (red). (C) Shows a reference image of what the amino and carbonyl terminal ends of the modified residue should look like after the acetyl and methylamine groups are deleted. Please click here to view a larger version of this figure.
Figure 5: Charge neutralization procedure. (A) Calculation of the total charge after the removal of acetyl and methylamine groups. (B) Determination of the assigned nomenclature for the atoms of the residue. Pay attention to the assigned nomenclature for the N of the amino terminal and C of the carboxyl terminal. (C) Identification of the assigned charges for these two atoms (N1 and C3) in the table. Take the charge value of the atoms (divided by 2) and add the absolute value of the obtained charge. (D) Substitution of the charge values of N1 and C3 with the obtained values. (E) Verification that the resulting charge is now zero. (all data provided is for reference only and may vary depending on the modified residue). Please click here to view a larger version of this figure.
Figure 6: Reference image of the desired structure of the library file (.lib). It is important to note that the image provided only displays a condensed representation of the complete file. Please click here to view a larger version of this figure.
Figure 7: Reference image illustrating the correct positioning of the from-lib.pdb file. It is important to note that the displayed image includes the hydrogens on the N and C termini, which should be excluded before saving the file. The image was taken in the Visualizer software. Please click here to view a larger version of this figure.
Figure 8: PDB file update. Reference image of the procedure for replacing the residue coordinates (in this case Cys32) with the modified residue. The modified residue PDB file refers to the u00-moved.pdb file. Please click here to view a larger version of this figure.
To illustrate the implementation of the protocol and evaluate the results, the following analyses will be considered. The data set generated by assigning new parameters to modified amino acid residues was constructed based on optimization of the electronic structures, which were supported for partial RESP loadings. Figure 9 shows the structural conformation of one of the amino acid residues optimized with the parameter assignment.
Figure 9: Cys-HHE residue synthesized in silico. Representation of HHE-modified cysteine amino acid with assigned topology and coordinate parameters. Please click here to view a larger version of this figure.
The structures obtained from theoretical DFT levels with M062X/6-31G were compared with the classical mechanics structures through molecular dynamics simulations in AMBER. Each of the parameters obtained from the simulations showed a good correlation with the theoretical data from quantum mechanics. The average bond distance errors showed values of approximately 0.001 – 0.002 Å, while the angles were ~ 8.2°. The typology, distances and constants of bonds and angles are listed in Table 1. These data were similar to those reported in the data article by Alviz-Amador et al.9. Parameter files are available at http://research.bmh.manchester.ac.uk/bryce/amber/.
Cys-HHE | ||||||
Methods | Bond | Angle | ||||
(Å, ± Stdev) | (°, ± Stdev) | |||||
QM | S1 –C4 | C6-C8 | C8-C9 | S1-C4-C5 | O2-C6-C8 | C6-C8-C9 |
(m062x/631g(d) | 1.82 | 1.52 | 1.53 | 115.9 | 109.25 | 112.21 |
MM (AMBER) aa alone | 1.85± | 1.55± | 1.54± | 111.66± | 109.77± | 113.16± |
0.002 | 0.002 | 0.002 | 0.152 | 0.14 | 0.148 |
Table 1: Bond distance and angle parameters comparison. The values of bond distances and angle obtained by quantum (QM) and classical methods showed no significant differences.
Once each of the parameters for the modified amino acid residues was generated and validated, the dynamic behaviors were examined through molecular dynamics simulations with trajectories of 1 µs in order to evaluate the effect on the stability of each residue compared to its native counterpart (Figure 10). The RMSD values obtained for each of the modified amino acids did not show significant differences from their native counterpart, and they maintained their conformational stability throughout the entire trajectory.
Figure 10: RMSD graph of synthesized residues in silico. Representative RMSD of unmodified and modified cysteine residue with HHE, HNE, MDA and ONE. Please click here to view a larger version of this figure.
The files resulting from the parameterization of modified amino acid residues have been utilized to substitute natural structural amino acids in proteins that have experimental evidence of carbonylation. This substitution was undertaken to evaluate the structural and functional impacts that may occur in the protein as a result of these modifications. It had been reported by in silico studies of carbonylation by reactive carbonyl species on protein systems like Ankyrin and Thioredoxin10,11.
One of the critical steps in developing the AMBER parameterization protocol was the quantum optimization of the new amino acid residues modified with the lipid peroxidation derivatives, due to the energetic variability related to the minimization and the way of assigning RESP charges in the AMBER antechamber. For this, ab initio optimization methods with Hartree-Fock (HF/6-31G) and semiempirical density functional theory (DFT; B3LYP/6-31G and M062X/6-31G) were established to evaluate the response to the load assignment. As a result, the HF functional presented better performance/computational cost ratio, taking this into account as a previous step to the protocol. This was also demonstrated in the study by Zhou et al.12.
During the application of the protocol there can be several sources of errors. The possible steric hindrances generated by the additional structures of the modification often lead to errors that are often solved through the minimization steps of the molecular system. On the other hand, the parameters of the dihedral angles are usually adjusted at the end of the parameterization process and therefore, sometimes they tend to show as a possible error, in this particular case it is suggested to adjust the parameters by homology, as reported by Alviz-Amador9 and add it in the new format to eliminate the error.
One of the limitations of the method is the effort required for the step-by-step development of the parameterizations. The generation of new parameters from the new electronic structures and then adapting these parameters to protein systems requires a lot of dedication for its good execution. Therefore, a good strategy when implementing our protocol is to follow the step-by-step instructions and read the guide carefully.
In the landscape of molecular dynamics simulations, the significance of the AMBER protocol becomes evident. Its adaptive nature and versatility make it a valuable tool for researchers exploring diverse research areas. Beyond its application in protein systems, its extension to macromolecular structures opens doors to new possibilities. This adaptability not only addresses the existing gaps in standard parameterization methods but also offers a pathway for the creation of novel structures, thus expanding the horizons of molecular dynamics research. On the contrary, other investigations demonstrate that the conventional parameterization of post-translational modifications is limited to a particular modification type and is exclusively derived from publicly available repositories8, lacking the capability to generate novel structures.
Modifications resulting from the presence of reactive carbonyl species are frequently associated with a range of pathologies, including cancer, metabolic disorders, and degenerative diseases following different mechanisms13,14 . The support provided by this protocol is useful to assess various crucial properties, such as conformational stability, atomic flexibility, loss of secondary structures, solvent accessibility, and protein-protein interaction energy, among others. Consequently, the measurement of these properties could prove beneficial in situations where carbonylated proteins can induce irreversible alterations in biological systems, leading to conformational instability, increased or decreased atomic flexibility, and loss of secondary structure10,11.
In conclusion, the AMBER parameterization protocol, with its critical steps, adaptability, and versatility, stands as a pioneering method in the realm of molecular dynamics simulations. While acknowledging its limitations, its significance is underscored by its ability to address the shortcomings of existing methods, providing researchers with a powerful tool to explore the intricacies of molecular structures and behaviors across a spectrum of biological and chemical systems.
The authors have nothing to disclose.
This work was supported by research grant code 1107-844-67943 from Ministerio de Ciencia, Tecnología e Innovación (Minciencias) and the University of Cartagena (Colombia) for grant to support the research groups 2021 and Acta 017-2022.
AmberTools16 or Upper | The Amber Project | Amber is a suite of biomolecular simulation programs | |
Gaussian 09 or Upper | Gaussian Inc | Draw and optimize structures | |
Linux Ubuntu | GNU/Linux | Platform for AmberTools | |
NVIDIA GPUs GTX 1080 or Upper | Nvidia | Compatible with PMEMD |