Open-ended molecular recording of sequential cellular events into dna


Play all audios:

Loading...

ABSTRACT Genetically encoded DNA recorders noninvasively convert transient biological events into durable mutations in a cell’s genome, allowing for the later reconstruction of cellular


experiences by DNA sequencing. We present a DNA recorder, peCHYRON, that achieves high-information, durable, and temporally resolved multiplexed recording of multiple cellular signals in


mammalian cells. In each step of recording, prime editor, a Cas9-reverse transcriptase fusion protein, inserts a variable triplet DNA sequence alongside a constant propagator sequence that


deactivates the previous and activates the next step of insertion. Insertions accumulate sequentially in a unidirectional order, editing can continue indefinitely, and high information is


achieved by coexpressing a variety of prime editing guide RNAs (pegRNAs), each harboring unique triplet DNA sequences. We demonstrate that the constitutive expression of pegRNA collections


generates insertion patterns for the straightforward reconstruction of cell lineage relationships and that the inducible expression of specific pegRNAs results in the accurate recording of


exposures to biological stimuli. Access through your institution Buy or subscribe This is a preview of subscription content, access via your institution ACCESS OPTIONS Access through your


institution Access Nature and 54 other Nature Portfolio journals Get Nature+, our best-value online-access subscription $32.99 / 30 days cancel any time Learn more Subscribe to this journal


Receive 12 print issues and online access $259.00 per year only $21.58 per issue Learn more Buy this article * Purchase on SpringerLink * Instant access to full article PDF Buy now Prices


may be subject to local taxes which are calculated during checkout ADDITIONAL ACCESS OPTIONS: * Log in * Learn about institutional subscriptions * Read our FAQs * Contact customer support


SIMILAR CONTENT BEING VIEWED BY OTHERS LINEAGE TRACING AND ANALOG RECORDING IN MAMMALIAN CELLS BY SINGLE-SITE DNA WRITING Article 22 March 2021 A TIME-RESOLVED, MULTI-SYMBOL MOLECULAR


RECORDER VIA SEQUENTIAL GENOME EDITING Article Open access 06 July 2022 MOLECULAR RECORDING USING DNA TYPEWRITER Article 06 June 2024 DATA AVAILABILITY All NGS datasets can be found at the


National Center for Biotechnology Information (NCBI)’s Sequence Read Archive, accession number PRJNA1159905, and demultiplexing instructions can be found at


https://gthub.com/liusynevolab/peCHYRON-sequencing_data. Flow cytometry data and analysis can be found at http://github.com/liusynevolab/peCHYRON-flow-cytometry. Full plasmid maps are


available at https://github.com/liusynevolab/peCHYRON-plasmids-July2023. Plasmids we believe are most likely to be useful will be available at Addgene. See Supplementary Dataset 3 for a


guide to these data and reagents. Please contact C.C.L. and T.B.L. for other reagents. Source data are provided with this paper. CODE AVAILABILITY Code and all data necessary to rerun the


analyses for each of Figs. 5 and 6, and their associated Extended Data and Supplementary Figures are available at http://github.com/liusynevolab/peCHYRON-Fig5 and


http://github.com/liusynevolab/peCHYRON-Fig6. All other code is available athttp://github.com/liusynevolab/peCHYRON. REFERENCES * Sheth, R. U. & Wang, H. H. DNA-based memory devices for


recording cellular events. _Nat. Rev. Genet._ 19, 718–732 (2018). Article  CAS  PubMed  PubMed Central  Google Scholar  * McKenna, A. et al. Whole-organism lineage tracing by combinatorial


and cumulative genome editing. _Science_ 353, aaf7907 (2016). Article  PubMed  PubMed Central  Google Scholar  * Kalhor, R. et al. Developmental barcoding of whole mouse via homing CRISPR.


_Science_ 361, eaat9804 (2018). Article  PubMed  PubMed Central  Google Scholar  * Schmidt, S. T., Zimmerman, S. M., Wang, J., Kim, S. K. & Quake, S. R. Quantitative analysis of


synthetic cell lineage tracing using nuclease barcoding. _ACS Synth. Biol._ https://doi.org/10.1021/acssynbio.6b00309 (2017). Article  PubMed  PubMed Central  Google Scholar  * Sheth, R. U.,


Yim, S. S., Wu, F. L. & Wang, H. H. Multiplex recording of cellular events over time on CRISPR biological tape. _Science_ 358, 1457–1461 (2017). Article  CAS  PubMed  PubMed Central 


Google Scholar  * Shipman, S. L., Nivala, J., Macklis, J. D. & Church, G. M. CRISPR–Cas encoding of a digital movie into the genomes of a population of living bacteria. _Nature_ 547,


345–349 (2017). Article  CAS  PubMed  PubMed Central  Google Scholar  * Shipman, S. L., Nivala, J., Macklis, J. D. & Church, G. M. Molecular recordings by directed CRISPR spacer


acquisition. _Science_ 353, aaf1175 (2016). Article  PubMed  PubMed Central  Google Scholar  * Schmidt, F., Cherepkova, M. Y. & Platt, R. J. Transcriptional recording by CRISPR spacer


acquisition from RNA. _Nature_ 562, 380–385 (2018). Article  CAS  PubMed  Google Scholar  * Hwang, B. et al. Lineage tracing using a Cas9-deaminase barcoding system targeting endogenous L1


elements. _Nat. Commun._ 10, 1234 (2019). Article  PubMed  PubMed Central  Google Scholar  * Chan, M. M. et al. Molecular recording of mammalian embryogenesis. _Nature_ 570, 77–82 (2019).


Article  CAS  PubMed  PubMed Central  Google Scholar  * Bowling, S. et al. An engineered CRISPR–Cas9 mouse line for simultaneous readout of lineage histories and gene expression profiles in


single cells. _Cell_ 181, 1410–1422.e27 (2020). Article  CAS  PubMed  PubMed Central  Google Scholar  * Alemany, A., Florescu, M., Baron, C. S., Peterson-Maduro, J. & van Oudenaarden, A.


Whole-organism clone tracing using single-cell sequencing. _Nature_ 556, 108–112 (2018). Article  CAS  PubMed  Google Scholar  * Chow, K.-H. K. et al. Imaging cell lineage with a synthetic


digital recording system. _Science_ 372, eabb3099 (2021). Article  CAS  PubMed  Google Scholar  * Loveless, T. B. et al. Lineage tracing and analog recording in mammalian cells by


single-site DNA writing. _Nat. Chem. Biol._ 17, 739–747 (2021). Article  CAS  PubMed  PubMed Central  Google Scholar  * Choi, J. et al. A time-resolved, multi-symbol molecular recorder via


sequential genome editing. _Nature_ 608, 98–107 (2021). Article  Google Scholar  * Perli, S. D., Cui, C. H. & Lu, T. K. Continuous genetic recording with self-targeting CRISPR–Cas in


human cells. _Science_ 353, aag0511 (2016). Article  PubMed  Google Scholar  * Kalhor, R., Mali, P. & Church, G. M. Rapidly evolving homing CRISPR barcodes. _Nat. Methods_ 14, 195–200


(2017). Article  CAS  PubMed  Google Scholar  * Frieda, K. L. et al. Synthetic recording and in situ readout of lineage information in single cells. _Nature_ 541, 107–111 (2017). Article 


CAS  PubMed  Google Scholar  * Tang, W. & Liu, D. R. Rewritable multi-event analog recording in bacterial and mammalian cells. _Science_ 360, eaap8992 (2018). Article  PubMed  PubMed


Central  Google Scholar  * Chen, W. et al. Multiplex genomic recording of enhancer and signal transduction activity in mammalian cells. Preprint at _bioRxiv_


https://doi.org/10.1101/2021.11.05.467434 (2021). * Anzalone, A. V. et al. Search-and-replace genome editing without double-strand breaks or donor DNA. _Nature_ 576, 149–157 (2019). Article


  CAS  PubMed  PubMed Central  Google Scholar  * Loveless, T. B. et al. Molecular recording of sequential cellular events into DNA. Preprint at _bioRxiv_


https://doi.org/10.1101/2021.11.05.467507 (2024). * Chen, P. J. et al. Enhanced prime editing systems by manipulating cellular determinants of editing outcomes. _Cell_ 184, 5635–5652.e29


(2021). Article  CAS  PubMed  PubMed Central  Google Scholar  * Komor, A. C., Kim, Y. B., Packer, M. S., Zuris, J. A. & Liu, D. R. Programmable editing of a target base in genomic DNA


without double-stranded DNA cleavage. _Nature_ 533, 420–424 (2016). Article  CAS  PubMed  PubMed Central  Google Scholar  * Cabera, A. et al. The sound of silence: transgene silencing in


mammalian cell engineering. _Cell Syst._ 13, 950–973 (2022). Article  CAS  PubMed Central  Google Scholar  * Gould, S. I. et al. High-throughput evaluation of genetic variants with prime


editing sensor libraries. _Nat. Biotechnol_. https://doi.org/10.1038/s41587-024-02172-9 (2024). * Yan, J. et al. Improving prime editing with an endogenous small RNA-binding protein.


_Nature_ 628, 639–647 (2024). Article  CAS  PubMed  PubMed Central  Google Scholar  * Shannon, C. E. A mathematical theory of communication. _Bell Syst. Tech. J._ 27, 379–423 (1948). Article


  Google Scholar  * Jaccard, P. The distribution of the flora in the alpine zone. _New Phytol._ 11, 37–50 (1912). Article  Google Scholar  * Schmidt, F. et al. Noninvasive assessment of gut


function using transcriptional recording sentinel cells. _Science_ 376, eabm6038 (2022). Article  CAS  PubMed  PubMed Central  Google Scholar  * Bhattarai-Kline, S. et al. Recording gene


expression order in DNA by CRISPR addition of retron barcodes. _Nature_ 608, 217–225 (2022). Article  CAS  PubMed  PubMed Central  Google Scholar  * Nambiar, T. S., Baudrier, L., Billon, P.


& Ciccia, A. CRISPR-based genome editing through the lens of DNA repair. _Mol. Cell_ 82, 348–388 (2022). Article  CAS  PubMed  PubMed Central  Google Scholar  * Balakrishnan, L. &


Bambara, R. A. Flap Endonuclease 1. _Annu. Rev. Biochem._ 82, 119–138 (2013). Article  CAS  PubMed  PubMed Central  Google Scholar  * Sallmyr, A., Rashid, I., Bhandari, S. K., Naila, T.


& Tomkinson, A. E. Human DNA ligases in replication and repair. _DNA Repair_ 93, 102908 (2020). Article  CAS  PubMed  PubMed Central  Google Scholar  * Bothmer, A. et al.


Characterization of the interplay between DNA repair and CRISPR/Cas9-induced DNA lesions at an endogenous locus. _Nat. Commun._ 8, 13905 (2017). Article  CAS  PubMed  PubMed Central  Google


Scholar  * Trojan, J. et al. Functional analysis of hMLH1 variants and HNPCC-related mutations using a human expression system. _Gastroenterology_ 122, 211–219 (2002). Article  CAS  PubMed 


Google Scholar  * Koeppel, J. et al. Prediction of prime editing insertion efficiencies using sequence features and DNA repair determinants. _Nat. Biotechnol._ 41, 1446–1456 (2023). Article


  CAS  PubMed  PubMed Central  Google Scholar  * Balzano, E., Pelliccia, F. & Giunta, S. Genome (in)stability at tandem repeats. _Semin. Cell Dev. Biol._ 113, 97–112 (2021). Article  CAS


  PubMed  Google Scholar  * Truong, D.-J. J. et al. Exonuclease-enhanced prime editors. _Nat. Methods_ 21, 455–464 (2024). Article  CAS  PubMed  PubMed Central  Google Scholar  * Bianconi,


E. et al. An estimation of the number of cells in the human body. _Ann. Hum. Biol._ 40, 463–471 (2013). Article  PubMed  Google Scholar  * Gibson, D. G. et al. Complete chemical synthesis,


assembly, and cloning of a mycoplasma genitalium genome. _Science_ 319, 1215–1220 (2008). Article  CAS  PubMed  Google Scholar  * Koblan, L. W. et al. Improving cytidine and adenine base


editors by expression optimization and ancestral reconstruction. _Nat. Biotechnol._ 36, 843–846 (2018). Article  CAS  PubMed  PubMed Central  Google Scholar  * Fonseca, J. P. et al. A


toolkit for rapid modular construction of biological circuits in mammalian cells. _ACS Synth. Biol._ 8, 2593–2606 (2019). Article  CAS  PubMed  Google Scholar  * Clement, K. et al.


CRISPResso2 provides accurate and rapid genome editing sequence analysis. _Nat. Biotechnol._ 37, 224–226 (2019). Article  CAS  PubMed  PubMed Central  Google Scholar  * Zhang, J., Kobert,


K., Flouri, T. & Stamatakis, A. PEAR: a fast and accurate Illumina paired-end read merger. _Bioinformatics_ 30, 614–620 (2013). Article  PubMed  PubMed Central  Google Scholar  *


Satopää, V., Albrecht, J., Irwin, D. & Raghavan, B. Finding a ‘kneedle’ in a haystack: detecting knee points in system behavior. In _Proc._ _2011 31st_ _International Conference on


Distributed Computing Systems Workshop_ 166–171 (IEEE, 2011). * Müllner, D. Modern hierarchical, agglomerative clustering algorithms. Preprint at a_rXiv_


https://doi.org/10.48550/arXiv.1109.2378 (2011). * Robinson, D. F. & Foulds, L. R. Comparison of phylogenetic trees. _Math. Biosci._ 53, 131–147 (1981). Article  Google Scholar  *


Huerta-Cepas, J., Serra, F. & Bork, P. ETE 3: reconstruction, analysis, and visualization of phylogenomic data. _Mol Biol Evol_ 33, 1635–1638 (2016). Article  CAS  PubMed  PubMed Central


  Google Scholar  Download references ACKNOWLEDGEMENTS We thank C. Duong and S. K. Paul for technical assistance; L. Koblan, J.J. Li, K. Simeonov, members of the Liu Laboratory, O.


Razorenova and J. Woytash for helpful discussions and D. Liu (Broad Institute), T. Lu (MIT) and H. El-Samad (UCSF) for plasmids. This work was funded by National Institutes of Health (NIH)


grant nos. R35GM136297, DP2GM119163 and R21GM126287 to C.C.L.; NIH grant nos. K99GM140254 and R00GM140254, and start-up funds from Rice University to T.B.L.; NSF GRFP and AHA Predoctoral


Fellowships to C.K.C. and a fellowship from the NSF-Simons Center for Multiscale Cell Fate Research (NSF Award no. 1763272) to T.B.L. V.J.H. is supported by Medical Scientist Training


Program grant no. T32-GM008620. Sequencing on the Illumina MiniSeq was performed by the Genetic Design and Engineering Center at Rice University, which receives financial support from the


Cancer Prevention and Research Institute of Texas (Award no. RP210116). This work used resources of the UCI GRT Hub, parts of which are supported by NIH grants to the Comprehensive Cancer


Center (grant no. P30CA-062203) and the UCI Skin Biology Resource Based Center (grant no. P30AR075047) at the University of California, Irvine, as well as to the GRT Hub for instrumentation


(grant nos. 1S10OD010794-01 and 1S10OD021718-01). AUTHOR INFORMATION Author notes * These authors contributed equally: Theresa B. Loveless, Courtney K. Carlson. AUTHORS AND AFFILIATIONS *


Department of Biomedical Engineering, University of California, Irvine, CA, USA Theresa B. Loveless, Courtney K. Carlson, Catalina A. Dentzel Helmy, Vincent J. Hu, Guohao Liang, Michelle


Ficht, Arushi Singhai, Marcello J. Pajoh-Casco & Chang C. Liu * Center for Synthetic Biology, University of California, Irvine, CA, USA Theresa B. Loveless, Courtney K. Carlson, Catalina


A. Dentzel Helmy, Vincent J. Hu, Guohao Liang, Michelle Ficht, Arushi Singhai, Marcello J. Pajoh-Casco & Chang C. Liu * NSF-Simons Center for Multiscale Cell Fate, University of


California, Irvine, CA, USA Theresa B. Loveless & Chang C. Liu * Department of BioSciences, Rice University, Houston, TX, USA Theresa B. Loveless, Catalina A. Dentzel Helmy, Sara K.


Ross, Matt C. Demelo & Ali Murtaza * Graduate Program in Mathematical, Computational and Systems Biology, University of California, Irvine, CA, USA Vincent J. Hu * Department of


Chemistry, University of California, Irvine, CA, USA Chang C. Liu * Department of Molecular Biology and Biochemistry, University of California, Irvine, CA, USA Chang C. Liu Authors * Theresa


B. Loveless View author publications You can also search for this author inPubMed Google Scholar * Courtney K. Carlson View author publications You can also search for this author inPubMed 


Google Scholar * Catalina A. Dentzel Helmy View author publications You can also search for this author inPubMed Google Scholar * Vincent J. Hu View author publications You can also search


for this author inPubMed Google Scholar * Sara K. Ross View author publications You can also search for this author inPubMed Google Scholar * Matt C. Demelo View author publications You can


also search for this author inPubMed Google Scholar * Ali Murtaza View author publications You can also search for this author inPubMed Google Scholar * Guohao Liang View author publications


You can also search for this author inPubMed Google Scholar * Michelle Ficht View author publications You can also search for this author inPubMed Google Scholar * Arushi Singhai View


author publications You can also search for this author inPubMed Google Scholar * Marcello J. Pajoh-Casco View author publications You can also search for this author inPubMed Google Scholar


* Chang C. Liu View author publications You can also search for this author inPubMed Google Scholar CONTRIBUTIONS T.B.L. and C.C.L. conceived the project. T.B.L., C.K.C. and C.C.L.


coinvented the recording architecture. C.K.C. wrote code used for analysis in all figures. Experiments were carried out as follows: Fig. 2 (and associated Extended Data and Supplementary


figures) by T.B.L., C.K.C., C.A.D.H., G.L., M.F., S.K.R. and A.S.; Fig. 3 by C.A.D.H., S.K.R., T.B.L., M.J.P.-C. and M.C.D.; Fig. 4 by T.B.L., M.F. and C.K.C. and Figs. 5 and 6 by C.A.D.H


and T.B.L. Analysis was carried out as follows: Fig. 2 by C.K.C. and T.B.L.; Fig. 3 by T.B.L., C.K.C. and M.J.P.-C.; Fig. 4 by V.J.H. and T.B.L.; Fig. 5 by M.C.D. and Fig. 6 by C.K.C.,


T.B.L. and A.M. Supervision was done and funding acquired by C.C.L and T.B.L. All figures were created by T.B.L. Visualizations were made as follows: C.K.C. made the Graphical Abstract,


Figs. 1 and 4a, Extended Data Figs. 3b, 7 and 10 and Supplementary Figs. 1c and 6b and M.C.D. made Fig. 2d. T.B.L., C.K.C. and C.C.L. wrote the manuscript, which was reviewed and approved by


all authors. CORRESPONDING AUTHORS Correspondence to Theresa B. Loveless or Chang C. Liu. ETHICS DECLARATIONS COMPETING INTERESTS The authors declare no competing interests. PEER REVIEW


PEER REVIEW INFORMATION _Nature Chemical Biology_ thanks Harris Wang and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. ADDITIONAL INFORMATION


PUBLISHER’S NOTE Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations. EXTENDED DATA EXTENDED DATA FIG. 1 SCREEN FOR


EFFICIENT INSERTION SEQUENCES. A, Efficient pegRNAs that target 6xHis and insert a different 20-bp sequence (A→B) were rare. A high-throughput screen was used to identify pegRNAs that could


alternate with the 20-bp variant of the 6xHis tag to iteratively edit the peCHYRON locus. In the screen, a library of ~100,000 pegRNAs with variable insertion sequences was transfected into


293T cells along with PE2. High enrichment values correspond to pegRNAs that frequently edited the 6xHis target site without being over-represented in the original transfection mix. The


50,000 library members whose enrichment could be calculated were plotted, and the inset shows the top 50. B, Sequences that can efficiently insert downstream of 6xHis were identified. Top


hits and some non-enriched control sequences from the high-throughput screen were individually cloned, and their insertion efficiencies were assayed. All experiments were conducted in


293T6xHis cells with PEmax; we used lower concentrations of plasmids than usual for this transfection, so insertion efficiencies are expected to be lower than in other experiments. Insertion


sequences are ordered from left to right from most-enriched to least-enriched in the original screen. Each point represents one of three separately-transfected biological replicates, and


error bars correspond to the standard deviation. EXTENDED DATA FIG. 2 IDENTIFICATION OF EFFICIENT PROPAGATING PEGRNAS. A, Heatmap of editing efficiencies for the top ten A→B propagator


sequences identified in an initial screen similar to Extended Data Fig. 1b. For each A→B propagator sequence (B1-B10), a library of signatures was tested, using PBS lengths ranging from 9-15


nts (row labels). B, Heatmap of editing efficiencies for pegRNAs that target the top ten propagator sequences shown in (a) and insert the 20-bp 6xHis sequence (B→A). Libraries encoding


pegRNAs with all possible signatures, at each PBS length from 9-15 nts (row labels), were transfected into 293T6xHis cells in which the corresponding B sequence had been installed at the


recording locus. C, A→B and B→A pegRNAs can achieve propagation. Plasmids expressing PE2 and the top-performing pegRNA libraries identified in (a) and (b) were transfected into 293T6xHis


cells. The lengths of insertions at the recording locus were assayed after 2 rounds of transfection over 6 days. Insertions of 20 bp were considered to correspond to 1 round of recording, 40


 bp to 2 rounds, and so forth. D, A variety of signatures can be installed by the top-performing A-B4. libraries of pegRNAs. From the screens shown in (a) and (b), the log2 enrichment of


each of the 64 possible signatures was calculated for the A→B (left) and B→A (right) pegRNA libraries. EXTENDED DATA FIG. 3 CANNIBALIZATION BY PECHYRON. A, Editing at the recording locus and


pegRNA genes in peCHYRON cells. Prime editor and pegRNAs were integrated into the genome of 293T6xHis cells as PiggyBac Transposase cargo, and integrants were selected with puromycin. 45


days after transfection, the recording locus (where editing is desired) and all pegRNA-expressing loci (where cannibalization occurs) were sequenced. The graph indicates the proportion of


loci that were edited. Data from loci expressing A→B or B→A pegRNAs from each of three types of U6 promoters, human (h), mouse (m), and bovine (b), are shown. B, More detailed cartoon


showing cannibalization of B→A pegRNA gene by the A→B pegRNA. After transcription and coupling with prime editor, the spacer sequence of the A→B pegRNA can recognize either the target site


at the recording locus or the PAM-adjacent target site on the antisense strand of the B→A pegRNA gene. Note the B→A pegRNA only inserts 17 nt of the target sequence recognized by the A→B


pegRNA, but the complete 20-bp target site and PAM are present in the B→A pegRNA gene because prime editing requires a scaffold sequence that complements with the region downstream of the


nick, to enable flap equilibration after reverse transcription. This cartoon only shows cannibalization of the B→A pegRNA gene, but cannibalization can also occur in the inverse direction


(the A→B pegRNA gene is cannibalized by the B→A pegRNA). EXTENDED DATA FIG. 4 LONG-TERM RECORDING WITH PECHYRON. A, At 27 days but not 46 days, the extent of cannibalization with and without


the protective nick were significant by two-tailed unpaired t test (P < 0.0001 for A→B, t = 16.84, df=4, difference between means = 10.72 ± 0.6366, 95% confidence interval 8.955 to


12.49, and P = 0.005 for B→A, t = 5.594, df=4, difference between means = 20.79 ± 3.717, 95% confidence interval 10.47 to 31.11). The unpaired t test assumes equal within-group variance,


which was confirmed by F test in both cases. Throughout this figure, each dot represents one of three biological replicates. B, At early timepoints, in the presence of the protective nicking


guide, we noted that most cannibalization led to shorter pegRNA genes, while most cannibalization in the absence of the protective nick led to the longer genes that would be created by the


expected prime edits. The measure of center is the mean and error bars reflect the standard deviation of 3 biological replicates. C, The fitness cost of peCHYRON components was acceptable,


but silencing occurred. At the time of initial cell selection, 6 days after transfection, 5,000 cells from each protective-nicking guide replicate were each mixed with 5,000 cells from the


parent, non-fluorescent cell line. The proportion of blue, red, and green fluorescent cells was determined by flow cytometry at the indicated time points. The number of cells


‘triple-positive’ for all colors, or positive for any color, is plotted. D, Silencing was tracked as in c. E, Cells tended to lose recording loci over time. In the parent cell line, the


first A site is knocked into one copy of the site3 locus, of which there are three copies in 293T cells. The proportion of site3 reads aligning to the locus including the A site was


calculated at the timepoints indicated, in the cells containing the protective nicking guide. EXTENDED DATA FIG. 5 CHARACTERIZATION OF PECHYRON PERFORMANCE. A more expansive plotting of the


data shown in Fig. 3b. EXTENDED DATA FIG. 6 RATES OF PECHYRON RECORDING. A, Editing rates declined only gradually over the course of the experiment shown in Fig. 3. Based on the distribution


of edit lengths detected at each timepoint, we calculated the rate of editing between each pair of timepoints as the average change in edits from time t to time t + 1. The plotted value at


each timepoint is the calculated rate for the period ending at that timepoint. For example, the value at 27 days is the rate from 0-27 days, the value at 34 days is the rate from 27-34 days,


_etc_. Each line reflects the editing rates of one biological replicate. Transient negative values likely reflect a slightly lower growth rate in cells that edit faster. B, As in a, but


with a single rate calculated from 41-66 days. C, Rates of editing were maintained as arrays grew longer. For the period from 27 to 34 days, the rate of each conversion step (for example,


step 1 converts a locus with no edits to one with a single edit) was calculated for each replicate. D, peCHYRON recording cell lines were created from 293 T and SV40-immortalized mouse


embryonic fibroblasts in parallel and grown for 15 days. Two technical replicates for each of two biological replicates are shown. Source data EXTENDED DATA FIG. 7 RECORDING OF TRANSFECTION


ORDER WITH PECHYRON. A, General approach to recording pulses (or any time-dependent signals) that are linked to expression of pegRNAs with known signature mutations. A pair of A→B and B→A


pegRNAs are expressed together for any signal to be recorded, so that one pegRNA of the pair will record the signal regardless of the state (A or B) of the active link in the recording


locus, and both pegRNAs can continuously record the signal if it is present for a long duration. Colored shapes represent signature mutations. B, Methods of peCHYRON sequence analysis. For


the simplest analysis, which treatments happened in a cell population can be inferred by calculating the proportion of each unigram, or type of signature. The order of treatments can be


deciphered by considering the extended bigrams of signature mutations it contains. Essentially, each signature forms one bigram with every signature that appears after it in a read.


Alternatively, records can be analyzed by stretching each record to a constant length, so it may be pooled with all other records from a population of cells and used to calculate the


proportion of each signature at each position along the stretched records. EXTENDED DATA FIG. 8 RECORDING OF INDUCER DURATION WITH PECHYRON. A, As described for Fig. 5a, the proportions of


signatures in Replicate 1 samples that were D were calculated. We then applied k-means clustering on these values, choosing the cluster number by automatic evaluation of an elbow plot by


identifying the number of clusters for which the second derivative of the inertia is a peak; bars are colored based on cluster membership. B, As in a, for Replicate 2. C, As in a, but for I.


D, As in c, for Replicate 2. EXTENDED DATA FIG. 9 RECORDING OF INDUCER ORDER WITH PECHYRON. A, As described for Fig. 5b, ratios of DC to CD bigrams were calculated. In this case, we


calculated bigrams only for those samples that did not cluster with the lowest D proportions in Extended Data Figure 8a. We then applied k-means clustering to the log2 of the bigram ratios,


choosing the cluster number automatically as in Extended Data Figure 8, and colored bars based on their cluster membership. These data are for Replicate 1. B, As in a, for Replicate 2. C, As


in a, but ratios of IC to CI bigrams. D, As in c, for Replicate 2. E, As in a, but ratios of DI to ID bigrams. F, As in e, for Replicate 2. EXTENDED DATA FIG. 10 SIGNATURE LOCATION ON THE


PECHYRON ARRAY AS A PROXY FOR TIME. Stretched records underlying the analysis shown in Fig. 6. Points represent biological replicates and lines are the mean of the biological replicates.


Data is organized according to the inducer, or lack thereof, that each sample was exposed to during the 3 epochs. SUPPLEMENTARY INFORMATION SUPPLEMENTARY INFORMATION Supplementary Figs. 1–7


and Table 1. REPORTING SUMMARY SUPPLEMENTARY DATASET 1 Detailed results from screen shown in Extended Data Fig. 1a. SUPPLEMENTARY DATASET 2 Data and entropy calculations underlying Extended


Data Fig. 6 and Fig. 4. SUPPLEMENTARY DATASET 3 Guide to plasmids used, HTS datasets available at the NCBI Sequence Read Archive, HTS primers and cell lines. SOURCE DATA SOURCE DATA EXTENDED


DATA FIG. 6 Detailed calculations. RIGHTS AND PERMISSIONS Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing


agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement


and applicable law. Reprints and permissions ABOUT THIS ARTICLE CITE THIS ARTICLE Loveless, T.B., Carlson, C.K., Dentzel Helmy, C.A. _et al._ Open-ended molecular recording of sequential


cellular events into DNA. _Nat Chem Biol_ 21, 512–521 (2025). https://doi.org/10.1038/s41589-024-01764-5 Download citation * Received: 07 July 2023 * Accepted: 29 September 2024 * Published:


14 November 2024 * Issue Date: April 2025 * DOI: https://doi.org/10.1038/s41589-024-01764-5 SHARE THIS ARTICLE Anyone you share the following link with will be able to read this content:


Get shareable link Sorry, a shareable link is not currently available for this article. Copy to clipboard Provided by the Springer Nature SharedIt content-sharing initiative