Correlation Engine 2.0
Clear Search sequence regions

  • benchmark (1)
  • dna sequence (1)
  • gene (5)
  • human cells (1)
  • humans (1)
  • respond (6)
  • Sizes of these terms reflect their relevance to your search.

    The ability to predict which genes will respond to the perturbation of a transcription factor serves as a benchmark for our systems-level understanding of transcriptional regulatory networks. In previous work, machine learning models have been trained to predict static gene expression levels in a biological sample by using data from the same or similar samples, including data on their transcription factor binding locations, histone marks, or DNA sequence. We report on a different challenge-training machine learning models to predict which genes will respond to the perturbation of a transcription factor without using any data from the perturbed cells. We find that existing transcription factor location data (ChIP-seq) from human cells have very little detectable utility for predicting which genes will respond to perturbation of a transcription factor. Features of genes, including their preperturbation expression level and expression variation, are very useful for predicting responses to perturbation of any transcription factor. This shows that some genes are poised to respond to transcription factor perturbations and others are resistant, shedding light on why it has been so difficult to predict responses from binding locations. Certain histone marks, including H3K4me1 and H3K4me3, have some predictive power when located downstream of the transcription start site. However, the predictive power of histone marks is much less than that of gene expression level and expression variation. Sequence-based or epigenetic properties of genes strongly influence their tendency to respond to direct transcription factor perturbations, partially explaining the oft-noted difficulty of predicting responsiveness from transcription factor binding location data. These molecular features are largely reflected in and summarized by the gene's expression level and expression variation. Code is available at © The Author(s) 2022. Published by Oxford University Press on behalf of Genetics Society of America.


    Yiming Kang, Wooseok J Jung, Michael R Brent. Predicting which genes will respond to transcription factor perturbations. G3 (Bethesda, Md.). 2022 Jul 29;12(8)

    Expand section icon Mesh Tags

    Expand section icon Substances

    PMID: 35666184

    View Full Text