NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Series GSE266299 Query DataSets for GSE266299
Status Public on May 13, 2024
Title Genetics, energetics and allostery during a billion years of hydrophobic protein core evolution
Organism Saccharomyces cerevisiae
Experiment type Other
Summary Protein folding is driven by the burial of hydrophobic amino acids in a tightly-packed core that excludes water. The genetics, biophysics and evolution of hydrophobic cores are not well understood, in part because of a lack of systematic experimental data on sequence combinations that do - and do not - constitute stable and functional cores. Here we randomize protein hydrophobic cores and evaluate their stability and function at scale. The data show that vast numbers of amino acid combinations can constitute stable protein cores but that these alternative cores frequently disrupt protein function because of allosteric effects. These strong allosteric effects are not due to complicated, highly epistatic fitness landscapes but rather, to the pervasive nature of allostery, with many individually small energy changes combining to disrupt function. Indeed both protein stability and ligand binding can be accurately predicted over very large evolutionary distances using additive energy models with a small contribution from pairwise energetic couplings. As a result, energy models trained on one protein can accurately predict core stability across hundreds of millions of years of protein evolution, with only rare energetic couplings that we experimentally identify limiting the transplantation of cores between highly diverged proteins. Our results reveal the simple energetic architecture of protein hydrophobic cores and suggest that allostery is a major constraint on sequence evolution.
 
Overall design We built combinatorial libraries in the hydrophobic cores of three small protein domains (FYN-SH3, CI-2A and CspA) using a reduced alphabet consisting of the amino acids F, L, M, V, I encoded by the DTS degenerate codon. By bottlenecking and pooling the libraries, in the sparse_DTS_core_mutagenesis experiment we sparsely measured the intracellular abundance of protein variants in yeast cells using abundancePCA, a protein complementation assay that couples cell growth rate with query protein intracellular abundance under selection by methotrexate. For the SH3 domain of the human FYN kinase, we selected a few query core amino acid combinations that are severely deleterious in abundance fitness and designed a suppressor "permissivity" library by introducing non-core mutations associated with SH3 domains naturally carrying such query core combinations that are deleterious in FYN (FYN-SH3_core_permissivity experiment). Also for FYN-SH3, we assessed the impact of core reconfiguration in function by measuring the binding to its short linear motif ligand PRD1super using bindingPCA, a protein complementation assay that couples cell growth rate with query variant intracellular binding to an interacting partner under selection by methotrexate (FYN-SH3_core_DTS_binding experiment).
Web link https://www.biorxiv.org/content/10.1101/2024.05.11.593672v1
 
Contributor(s) Escobedo A, Voigt G, Faure AJ, Lehner B
Citation missing Has this study been published? Please login to update or notify GEO.
Submission date Apr 30, 2024
Last update date May 13, 2024
Contact name Albert Escobedo
Phone +34933160209
Organization name CRG
Department Systems and synthetic biology
Lab Ben Lehner
Street address Carrer Dr Aiguader 88
City Barcelona
State/province Catalonia
ZIP/Postal code 08003
Country Spain
 
Platforms (2)
GPL19756 Illumina NextSeq 500 (Saccharomyces cerevisiae)
GPL31112 NextSeq 2000 (Saccharomyces cerevisiae)
Samples (16)
GSM8244559 sparse_DTS_cores_input_1
GSM8244560 sparse_DTS_cores_input_2
GSM8244561 sparse_DTS_cores_output_1
Relations
BioProject PRJNA1106529

Download family Format
SOFT formatted family file(s) SOFTHelp
MINiML formatted family file(s) MINiMLHelp
Series Matrix File(s) TXTHelp

Supplementary file Size Download File type/resource
GSE266299_FYN-SH3_core_DTS_binding_scores_PRD1super.csv.gz 2.0 Mb (ftp)(http) CSV
GSE266299_FYN-SH3_core_permissivity_scores.csv.gz 500.7 Kb (ftp)(http) CSV
GSE266299_sparse_DTS_core_mutagenesis_scores.csv.gz 1.1 Mb (ftp)(http) CSV
SRA Run SelectorHelp
Raw data are available in SRA

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap