Back

Machine Learning Analysis of the Human Initiator Region Reveals Key Features of Different Types of Core Promoters

Rhyne-Carrigg, T. E.; Vo ngoc, L.; Medrano, C.; Gillespie, K. E.; Kadonaga, J. T.

2026-01-05 molecular biology
10.1101/2025.11.21.689830 bioRxiv
Show abstract

The initiator (Inr) is the starting point for the transcription of many genes. Here, we generated highly predictive machine learning models of the human Inr region, and determined that the Inr is present in about 60% of natural promoters, identified a novel TATA-specific Inr, and detected the overlapping but functionally distinct TCT motif. Quantitative genome-wide analyses revealed a strict and synergistic interaction between the Inr and DPR, a duality between the TATA and DPR, a flexible and sometimes independent function of the TATA box in relation to the Inr, and different properties of the TCT motif in humans and Drosophila.

Matching journals

The top 1 journal accounts for 50% of the predicted probability mass.