Correlation Engine 2.0
Clear Search sequence regions


  • algorithms (1)
  • alkyl (2)
  • forest (1)
  • HDAC (3)
  • HDAC 1 (6)
  • HDAC1 protein (1)
  • human (2)
  • molecular structures (1)
  • protein human (1)
  • switzerland (1)
  • torsions (1)
  • tree (2)
  • Sizes of these terms reflect their relevance to your search.

    Histone deacetylase (HDAC) 1, a member of the histone deacetylases family, plays a pivotal role in various tumors. In this study, we collected 7313 human HDAC1 inhibitors with bioactivities to form a dataset. Then, the dataset was divided into a training set and a test set using two splitting methods: (1) Kohonen's self-organizing map and (2) random splitting. The molecular structures were represented by MACCS fingerprints, RDKit fingerprints, topological torsions fingerprints and ECFP4 fingerprints. A total of 80 classification models were built by using five machine learning methods, including decision tree (DT), random forest, support vector machine, eXtreme Gradient Boosting and deep neural network. Model 15A_2 built by the XGBoost algorithm based on ECFP4 fingerprints showed the best performance, with an accuracy of 88.08% and an MCC value of 0.76 on the test set. Finally, we clustered the 7313 HDAC1 inhibitors into 31 subsets, and the substructural features in each subset were investigated. Moreover, using DT algorithm we analyzed the structure-activity relationship of HDAC1 inhibitors. It may conclude that some substructures have a significant effect on high activity, such as N-(2-amino-phenyl)-benzamide, benzimidazole, AR-42 analogues, hydroxamic acid with a middle chain alkyl and 4-aryl imidazole with a midchain of alkyl whose α carbon is chiral. © 2022. The Author(s), under exclusive licence to Springer Nature Switzerland AG.

    Citation

    Rourou Li, Yujia Tian, Zhenwu Yang, Yueshan Ji, Jiaqi Ding, Aixia Yan. Classification models and SAR analysis on HDAC1 inhibitors using machine learning methods. Molecular diversity. 2023 Jun;27(3):1037-1051

    Expand section icon Mesh Tags

    Expand section icon Substances


    PMID: 35737257

    View Full Text