Expanding the Functional Landscape of GIY-YIG Nuclease Domains: Mechanism, Modularity, Evolution, and Engineering
Date
Authors
Abstract
GIY-YIG homing endonucleases are compact, site-specific nucleases that promote the mobility of introns and inteins through DNA cleavage. Despite their broad phylogenetic distribution and potential utility as programmable nucleases, the structural basis for their sequence preferences, cleavage modalities, and evolutionary adaptability has remained poorly understood. The overarching goal of this thesis was to define the modular determinants that control GIY-YIG nuclease function and to establish frameworks for their systematic characterization and reprogramming. To this end, I combined biochemical reconstitution, domain and subdomain shuffling, directed evolution, bioprospecting, and phylogenetic analysis. A buffer-tuning approach was used to decouple nuclease activity from DNA-binding scaffolds, while fluorescent capillary electrophoresis and Oxford Nanopore sequencing provided scalable and sensitive methods to profile cleavage preferences without complex libraries or Illumina sequencing. Using recombination between diverse nucleases (I-TevI, I-BmoI, I-BamI, and F-SP01I), I identified discrete structural elements controlling sequence preference and strand bias. The first α-helix and adjacent loop emerged as a portable “preference switch region,” while the second α-helix encoded cleavage modality. Substitution of only two residues (R47E + S48E) was sufficient to convert a double-strand cleavase into a strand-selective nickase. Bioprospecting uncovered I-BamI as the first nickase GIY-YIG homing endonuclease, and revealed scaffolds such as F-SP01I that tolerate extensive recombination, supporting an evolutionary model in which nicking represents the ancestral state of the domain. These findings demonstrate that GIY-YIG nuclease domains are defined by modular elements that can be reprogrammed to alter both sequence preference and cleavage modality. This modularity provides the mechanistic basis for their evolutionary diversification and establishes practical routes for their engineering. By expanding the functional landscape of GIY-YIG nuclease domains, this thesis positions them as accessible, evolvable, and engineerable platforms with broad relevance for molecular biology, synthetic biology, and genome editing.