Efficient lipophilicity prediction of molecules employing deep-learning models
- Title
- Efficient lipophilicity prediction of molecules employing deep-learning models
- Creator
- Datta R.; Das D.; Das S.
- Description
- Lipophilicity, expressed as logP, is a significant physiochemical property and is an indicator of absorption, distribution, metabolism and elimination characteristics of drugs used in medication. It is one of the major deciding factors of the fate of a molecule to be a successful drug. Mol2vec is a convenient and unsupervised machine learning technique which produces high-dimensional vector representations of molecules and its molecular substructures. The work described here aims to simplify prediction of logP values with high-degree of accuracy by using Deep Learning (DL) models paired with Mol2vec. The work described in this paper empirically demonstrates that by using the described DL models paired with Mol2vec, one can achieve results which are much better than the conventional ML techniques as well as more complex and recent algorithms like Message-passing Neural Networks (MPNN), Graph Convolution (GC) and Spatial Graph embedding (C-SGEN). Our RMSE (Root Mean Square Error) scores from the ensemble model is one of the best reported so far in literature. The methods elaborated in this paper are simple, yet effective in predicting logP values to a great degree of accuracy due to the use of Mol2vec and standard TensorFlow operators. The models employed here can be coded and maintained with much more ease compared to the techniques of MPNN, C-SGEN or GC. 2021 Elsevier B.V.
- Source
- Chemometrics and Intelligent Laboratory Systems, Vol-213
- Date
- 2021-01-01
- Publisher
- Elsevier B.V.
- Subject
- Convolutional neural network; Deep learning; Drug design; Lipophilicity; Long-short term memory; Multi-layer perceptron
- Coverage
- Datta R., Department of Chemistry, CHRIST (Deemed to be University), Hosur Road, Bengaluru, 560029, India; Das D., Intel, Bengaluru, 560103, India; Das S., Sprinklr India Pvt Ltd, Yemalur, Bengaluru, 560037, India
- Rights
- Restricted Access
- Relation
- ISSN: 1697439; CODEN: CILSE
- Format
- Online
- Language
- English
- Type
- Article
Collection
Citation
Datta R.; Das D.; Das S., “Efficient lipophilicity prediction of molecules employing deep-learning models,” CHRIST (Deemed To Be University) Institutional Repository, accessed February 27, 2025, https://archives.christuniversity.in/items/show/15784.