The Advanced Photon Source
a U.S. Department of Energy Office of Science User Facility

Computers Hallucinate Proteins Not Found in Nature

Proteins do most of the work in our bodies—digesting food, making muscles contract, connecting brain cells, etc. All proteins are made of strings of amino acids; the order of amino acids in the string determines a protein’s shape, and the protein’s shape determines its function. Mathematically, there are approximately 100-tredecillion ways in which the amino acids could be arranged, but life on Earth uses only a fraction of them. For the past few decades, scientists have been exploring the possibility of creating proteins out of amino acid sequences that nature has ignored. Now, using the U.S. Department of Energy’s Advanced Photon Source (APS) and a computer program that “hallucinates” new protein shapes, a team of scientists has created and verified 10 protein shapes that never existed before. The researchers are hoping to eventually create new proteins with new capacities, like gobbling up plastic in the sea or making machines at the nanoscale. These results were published in the journal Science.

With the advent of anthropogenic climate change, novel emerging diseases, and technological breakthroughs, humanity is facing new challenges that synthetic biology may be able to address by designing new proteins. Scientists in this study wanted to explore the possibility of creating new cyclic protein oligomers, a class of proteins that play key roles in nearly all biological processes and are useful in biomedical and biotechnological applications.

Using only the canonical (or 20 standard) amino acids, the team set out to discover what new protein shapes could be produced by assembling the amino acids in novel sequences. They knew that chasing uncharted designs required a new approach, so they prompted a computer program to hallucinate new shapes on its own.

First, they created algorithms that randomly tweaked various amino acids in the sequence step-by-step. At each step, they would ask the AlphaFold2 neural network, “Is this an oligomer of the size and shape we want?” If the answer was yes or going in the right direction, they kept the tweak and repeated the procedure. If not, they rejected it and chose another. Eventually they ended up with a potential protein structure.

The procedure generated hundreds of potentially viable designs. The team then needed to test the designs in the real world.

To that end, they encoded the amino acid sequences of the hallucinated designs into synthetic genes, which they inserted into an E. coli bacterium. The bacterium produced a protein from the synthetic gene, and voila! A real-life protein generated from a sequence hallucinated by a computer.

The next step was to validate how closely the real-life proteins approximated the hallucinated design and how viable they were in terms of biophysical characteristics such as solubility and structural accuracy.

Using x-ray crystallography at Northeastern Collaborative Access Team 24-ID-C x-ray beamline at the APS, these researchers solved the structures of seven of the crystals, confirming that the hallucinated proteins were viable in the real world. The APS is a Department of Energy Office of Science user facility at Argonne National Laboratory.

The proteins produced in this work have no designated biological function besides their designed symmetry—they do not perform a job in and of themselves. Rather, they could serve as scaffolds for enzymes or proteins that bind targets on cells. Alternatively, they could be included within larger protein assemblies.

This has already found real-world applications: the Institute for Protein Design at the University of Washington, home institution for several of these researchers, created a nanoparticle from a large assembly of proteins, including parts of the spike of SARS-CoV-2, the virus that causes COVID. This nanoparticle vaccine became SKYCovione, an approved vaccine in use in South Korea.

Beyond the ten proteins detailed in this research, the process invented by the team unlocks the potential of generative protein design, where scientists are free to sample the structural space of proteins in an unconstrained manner, their imaginations extended into uncharted territory. Moreover, the ability to tweak the sequences as they go along gives scientists precise control over the arrangement of side chains.

Considering the fast pace of the field, exciting new developments, among them therapeutics and potentially new enzymes, could be in reach.  ― Judy Myers

See: B.I.M. Wicky, L.F. Milles, A. Courbet, R.J. Ragotte, J. Dauparas, E. Kinfu, S. Tipps, R.D. Kibler, M. Baek, F. DiMaio, X. Li, L. Carter, A. Kang, H. Nguyen, A.K. Bera, D. Baker*Hallucinating symmetric protein assemblies,” Science 378, 56 (7 October 2022). DOI: 10.1126/science.add1964

Author affiliations: University of Washington

Corresponding author: * dabaker@uw.edu

This work was supported with funds provided by the Audacious Project at the Institute for Protein Design (A.K., L.C., X.L., E.K., S.T., and D.B.), a grant from the National Institute of General Medical Sciences (P41 GM 103533-24, to R.D.K.), an EMBO long-term fellowship (ALTF 139-2018, to B.I.M.W.), a grant from the National Science Foundation (CHE-1629214, to D.B.), the Open Philanthropy Project Improving Protein Design Fund (H.N., A.K.B., R.J.R., J.D., and D.B.), an Alfred P. Sloan Foundation Matter-to-Life Program Grant (G-2021-16899, to A.C. and D.B.), a Human Frontier Science Program Cross Disciplinary Fellowship (LT000395/2020-C, to L.F.M.), an EMBO Non-Stipendiary Fellowship (ALTF 1047-2019, to L.F.M.), and the Howard Hughes Medical Institute (A.C. and D.B.). Cryo-EM was performed on a Glacios microscope purchased by the University of Washington Arnold and Mabel Beckman Center for Cryo-EM (D.B.) with an S10 award (S10OD032290), and at the Fred Hutchinson Cancer Center Electron Microscopy Shared Resource (supported by Cancer Center Support Grant P30 CA015704-40). The Northeastern Collaborative Access Team beamlines are funded by the National Institute of General Medical Sciences from the National Institutes of Health (P30 GM124165). This research used resources of the Advanced Photon Source, a U.S. Department of Energy (DOE) Office of Science user facility operated for the DOE Office of Science by Argonne National Laboratory under Contract No. DE-AC02-06CH11357.

The U.S. Department of Energy's APS at Argonne National Laboratory is one of the world’s most productive x-ray light source facilities. Each year, the APS provides high-brightness x-ray beams to a diverse community of more than 5,000 researchers in materials science, chemistry, condensed matter physics, the life and environmental sciences, and applied research. Researchers using the APS produce over 2,000 publications each year detailing impactful discoveries, and solve more vital biological protein structures than users of any other x-ray light source research facility. APS x-rays are ideally suited for explorations of materials and biological structures; elemental distribution; chemical, magnetic, electronic states; and a wide range of technologically important engineering systems from batteries to fuel injector sprays, all of which are the foundations of our nation’s economic, technological, and physical well-being.

Argonne National Laboratory seeks solutions to pressing national problems in science and technology. The nation's first national laboratory, Argonne conducts leading-edge basic and applied scientific research in virtually every scientific discipline. Argonne researchers work closely with researchers from hundreds of companies, universities, and federal, state and municipal agencies to help them solve their specific problems, advance America's scientific leadership and prepare the nation for a better future. With employees from more than 60 nations, Argonne is managed by UChicago Argonne, LLC, for the U.S. DOE Office of Science.

The U.S. Department of Energy's Office of Science is the single largest supporter of basic research in the physical sciences in the United States and is working to address some of the most pressing challenges of our time. For more information, visit the Office of Science website.

 

Published Date