How DNA encodes the start of transcription

Jun Wang,Vikram Agarwal

Published 2024 in Science

ABSTRACT

A deep-learning model reveals the rules that define transcription initiation The central dogma of molecular biology outlines the flow of genetic information from DNA to RNA to proteins. With a limited vocabulary of four nucleotides (A, C, G, and T), DNA encodes an extensive instruction set, including the chromosomal positions where RNAs begin to be transcribed and the magnitudes of their expression. This process, known as transcription initiation (1), begins at transcription start sites (TSSs) and depends on RNA polymerase II recruitment to promoter sequences (2). However, the sequences and rules that govern transcription initiation remain elusive. On page 405 of this issue, Dudnyk et al. (3) use an explainable deep-learning model to find a small set of DNA sequence motifs that predict the position and activity of most TSSs in the human genome. The findings define a set of rules that govern transcription initiation and highlight the potential of using deep-learning approaches to understand how information is genetically encoded.

PUBLICATION RECORD

CITATION MAP

EXTRACTION MAP

CLAIMS

  • No claims are published for this paper.

CONCEPTS

  • No concepts are published for this paper.

REFERENCES

Showing 1-26 of 26 references · Page 1 of 1