A fast and efficient machine learning assisted prediction of urea and its derivatives to screen crystal propensity with experimental validation

[ X ]

Tarih

2025

Dergi Başlığı

Dergi ISSN

Cilt Başlığı

Yayıncı

Elsevier Ltd

Erişim Hakkı

info:eu-repo/semantics/closedAccess

Özet

Predicting crystal propensity is crucial yet challenging in various industries where it significantly influences product stability, performance, and efficacy. Predicting a crystal propensity identifies their optimal chemical structures for desired properties including solubility, bioavailability, shelf-life stability etc. Herein, A machine learning (ML) assisted analysis is performed to predict their crystal propensity by collecting a dataset of 6000 non-crystalline and over 200 crystalline urea and its derivatives. The data is trained by employing a Support Vector Machine (SVM) with its Radial Basis Function (RBF) and linear kernels along with Random Forest regression analysis. The trained data is compared with four other ML models, including Linear Regression, Gradient Boosting, Random Forest and Decision Tree Regressions to predict their crystal propensity. It yields an accuracy of 79 % for identifying their non-crystalline compounds and 59 % in predicting crystallization failure. Their dimensionality reduction via t-SNE reveals their distinct clustering patterns to underscore their complex interplay between molecular structure and crystal propensity. Their experimental validation also corroborates the current findings to demonstrate their efficacy to streamline their crystal engineering for pharmaceutical formulation-based workflows. Notably, the number of rotatable bonds and molecular connectivity index (χov) emerges as pivotal descriptors for enabling their accurate classification with minimal input features. This study elucidates its quantitative structure-crystallinity relationship to provide a valuable tool for crystal design and optimization.

Açıklama

Anahtar Kelimeler

Crystal propensity, Gradient boosting, ML, Support vector machine, Urea

Kaynak

Materials Today Communications

WoS Q Değeri

Q2

Scopus Q Değeri

Cilt

43

Sayı

Künye

Güleryüz, C., Sumrra, S. H., Hassan, A. U., Mohyuddin, A., Noreen, S., & Elnaggar, A. Y. (2025). A fast and efficient machine learning assisted prediction of urea and its derivatives to screen crystal propensity with experimental validation. Materials Today Communications, 43, 111692.