Background/Objectives: BrdU (5′-bromo-2′-deoxyuridine), a synthetic thymidine (T) analog, is widely used to study cell proliferation and DNA synthesis. To precisely identify where and when DNA replication starts and terminates, it is essential to determine the BrdU incorporation rate and sites at a single-nucleotide resolution. Although several deep learning-based methods have been developed for detecting BrdU using Oxford nanopore sequencing data, there is a lack of accessible, easy-to-follow tutorials to guide researchers in preparing training data and implementing deep learning approaches as the nanopore sequencing technologies continue to evolve. Methods: Due to the lack of ground truth BrdU-positive data generated on the latest R10 flow cells, we prepared model training data from legacy R9 flow cells, consistent with existing tools. We processed publicly available synthetic and real nanopore DNA sequencing datasets, with and without BrdU incorporation, using a combination of open-source and custom software tools. Subsequently, we trained bidirectional gated recurrent unit (BiGRU)-based recurrent neural networks (RNNs) for BrdU detection using the TensorFlow library on the Google Colab platform. Results: We trained BiGRU-based RNNs for BrdU detection with a high specificity (>94%) but a moderate sensitivity due to limited BrdU-positive data. We detail the setup, training, testing, and fine-tuning of the model using both synthetic and real DNA sequencing data. Conclusions: Though the models were trained with data generated on legacy flow cells, we believe that this detailed protocol, covering both data preparation and model development, can be readily extended to R10 flow cells and basecallers for other base modifications. This work will facilitate the broader adoption of deep learning neural networks in biological research, particularly RNNs, which are well suited for modeling sequential and time-series data.
Training Recurrent Neural Networks for BrdU Detection with Oxford Nanopore Sequencing: Guidance and Lessons Learned
Haibo Liu,William A. Flavahan,L. J. Zhu
Published 2025 in Genes
ABSTRACT
PUBLICATION RECORD
- Publication year
2025
- Venue
Genes
- Publication date
2025-11-01
- Fields of study
Biology, Medicine, Computer Science
- Identifiers
- External record
- Source metadata
Semantic Scholar, PubMed
CITATION MAP
EXTRACTION MAP
CLAIMS
- No claims are published for this paper.
CONCEPTS
- No concepts are published for this paper.
REFERENCES
Showing 1-39 of 39 references · Page 1 of 1
CITED BY
- No citing papers are available for this paper.
Showing 0-0 of 0 citing papers · Page 1 of 1