Research Ideas and Outcomes :
Research Article
|
Corresponding author: Victor Padilla-Sanchez (70padillasan@cua.edu)
Received: 09 Jun 2020 | Published: 16 Jun 2020
© 2020 Victor Padilla-Sanchez
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation: Padilla-Sanchez V (2020) In silico analysis of SARS-CoV-2 spike glycoprotein and insights into antibody binding. Research Ideas and Outcomes 6: e55281. https://doi.org/10.3897/rio.6.e55281
|
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) emerged in Wuhan, China in December 2019. Since then, COVID-19, the disease caused by SARS-CoV-2, has become a rapidly spreading pandemic that has reached most countries in the world. So far, there are no vaccines or therapeutics to fight this virus. Here, I present an in silico analysis of the virus spike glycoprotein (recently determined at atomic resolution) and provide insights into how antibodies against the 2002 virus SARS-CoV might be modified to neutralize SARS-CoV-2. I ran docking experiments with Rosetta Dock to determine which substitutions in the 80R and m396 antibodies might improve the binding of these to SARS-CoV-2 and used molecular visualization and analysis software, including UCSF Chimera and Rosetta Dock, as well as other bioinformatics tools, including SWISS-MODEL. Supercomputers, including Bridges Large, Stampede and Frontera, were used for macromolecular assemblies and large scale analysis and visualization.
COVID-19, SARS-CoV-2, immunology, computational biology, in silico, structural biology, coronavirus, antiviral therapy
COVID-19, the disease caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), presents with symptoms of fever, severe respiratory illness and pneumonia. As of April 15, 2020, SARS-CoV-2 has infected approximately 2,000,000 people, with more than 120,000 deaths. The virus is spreading exponentially worldwide and has been declared a public health emergency by the World Health Organization (WHO) (
Structural model of SARS-CoV-2 infection. This structural model was built with UCSF Chimera using high-performance computers (Bridges Large and Frontera). The model shows 16 viruses, with the spike proteins shown in green (PDB ID: 6VSB) and an actual lipid bilayer membrane, with ACE2 dimers shown in magenta. All these structures are at atomic resolution. The length of the membrane is approximately 1 micrometer.
Structural analysis was performed using UCSF Chimera software (
The tertiary structure of the SARS-CoV spike protein bound to ACE2 (
Structural analysis of SARS-CoV spike glycoprotein. In A the SARS-CoV spike protein (PDB ID: 6ACG) is shown bound to ACE2 (brown) and 80R antibody (cyan), superimposed on the same binding site. In B the spike protein is shown bound only to the 80R antibody (PDB ID: 2GHW), with the structural model of the RBD of the SARS-CoV-2 spike protein (magenta) containing the missing loops. This homology model served as the basis for the docking experiments. In C it is shown a spike colored by subunit and showing the glycans. There are only two possible glycans in RBD region at 331 and 343 and neither of these sites affect the 80R binding.
Sequence alignments between SARS-CoV RBD and SARS-CoV-2 RBD were built in UCSF Chimera. There are many sequence differences between SARS-CoV and SARS-CoV-2 in the 80R-RBD interface, which explains why the 80R antibody binds with high affinity to the spike protein of SARS-CoV but not to the spike protein of SARS-CoV-2. In the SARS-CoV-2 RBD, polar residues are replaced by neutral residues, which disrupts the binding interactions between 80R and the RBD in the SARS-CoV-2 spike protein. Insertion of a glycine residue at position 482 also twists one of the interacting loops located at E484, which then clashes with the 80R antibody in the superimpositions. To avoid this clash and allow better antibody binding to the RBD in SARS-CoV-2, the 80R partner residue should be replaced by a different residue. I carefully selected six alternative residues, introduced these one at a time, and ran docking experiments using Rosetta Dock (
Docking interface between the modified 80R antibody and the RBD of the SARS-CoV-2 spike protein. The model shows the structural interface with the 80R antibody above and the RBD below. The seven substitutions in 80R are shown in magenta and RBD residues are shown in cyan. Notice how the substitutions in 80R allow new aromatic-aromatic interactions that improve binding to the RBD and are not present in wild type 80R. E484 is shown pointing towards the beta strand of 80R and a glycine substitution was therefore introduced to avoid clashes.
I made a structural model of mutated 80R with SWISS-MODEL and positioned this model close to the RBD region. I used Rosetta Dock in Stampede2 supercomputer to run docking experiments with the following mutations in 80R R102F, S103F, R152F, S186F, T206G, S210F and N227F. These mutations were intended mainly to increase aromatic-aromatic interactions and to avoid contact with E484 (T206G) (Fig.
Similarly, m396 antibody that neutralizes SARS-CoV binding to its RBD domain was structurally analyzed and five mutations were introduced. In this case, the analysis pointed to electrostatic interactions to maximize and one aromatic. Therefore, five mutations were introduced: in the heavy chain T52F, I56E, N58K, Q61E; and in the light chain S94E. A structure with these mutations was put close to SARS-CoV-2 RBD and docking experiments were run in Stampede2. The results (Fig.
m396 mutations docking results. A shows Rosetta Dock funnels for the original partners SARS-CoV and m396, SARS-CoV-2 and m396 and the SARS-CoV-2 and mutated m396. Notice how the binding is improved to the level of the original partners. B shows the ΔG energies again notice the improvement of binding when the mutations are introduced. Finally, C shows the developability flags with only one warning that is not that far from green flag.
Docking experiments showed that appropriate amino acid substitutions in 80R and m396 should increase binding interactions between the antibodies and the SARS-CoV-2 RBD, thus providing new antibodies with sufficient affinity for the SARS-CoV-2 spike protein to neutralize the virus. This new antibody should be expressed in vitro to study its solubility, stability, specificity and binding kinetic, and these results would be the basis for further mutations to correct some of these parameters or even improve the affinity. This methodology could be the basis for a rapid and effective generation of neutralizing therapeutic antibodies against COVID-19. In silico analysis is a very useful tool that structural bioinformaticians can use to guide mutagenesis to achieve a goal, in this case better affinity for the RBD of the SARS-CoV-2 spike protein. The results obtained using this relatively new branch of science must be taken with caution, but the method is becoming increasingly successful with the rapid improvement in bioinformatics tools. This type of analysis was not possible a decade ago, when scientists had to conduct mutagenesis experiments in wet laboratories. It is now possible for many scientists to use a bioinformatics approach to shorten the time needed to find new therapeutics.Further in silico experiments, including molecular dynamics simulations, can be performed to analyze interactions in real time, and in the future, I plan to conduct these types of experiment. Humanity is in a race to find therapeutics and/or vaccines against COVID-19 and my experimental analysis and findings should help the scientific community to quickly discover novel therapeutics.
Molecular graphics and analyses were performed with UCSF Chimera, developed by the Resource for Biocomputing, Visualization, and Informatics at the University of California, San Francisco, with support from NIH P41-GM103311. The author also wishes to acknowledge the Pittsburgh Supercomputing Center and Texas Advanced Computing Center for providing supercomputers (Bridges, Stampede2 and Frontera) for the in silico analyses.
Only in silico tools were used.
Dr Victor Padilla-Sanchez, PhD designed, executed and wrote this research.
The author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.