您的位置: 首页 > 院士专题 > 专题 > 详情页

Prediction of Biological Activities of Volatile Metabolites Using Molecular Fingerprints and Machine Learning Methods

基于分子指纹和机器学习方法的挥发性代谢物生物活性预测

关键词:
来源:
Journal of Telecommunication, Electronic and Computer Engineering 期刊
来源地址:
https://www.researchgate.net/publication/330104758_Prediction_of_Biological_Activities_of_Volatile_Metabolites_Using_Molecular_Fingerprints_and_Machine_Learning_Methods
类型:
学术文献
语种:
英语
原文发布日期:
2018-08-20
摘要:
Volatile metabolites are small molecules, comprise a diverse chemical group with various biological activities and have high vapor pressures under ambient conditions. It is crucial to determine the biological activities of volatile metabolites as they play important roles in chemical ecology and human healthcare. In this study, we have accumulated 341 volatiles emitted by biological species associated with 11 types of biological activities and deposited the data into our database, which is called KNApSAcK Metabolite Ecology Database. Using this dataset, we have developed 72 classification models to predict biological activities of volatile metabolites by using various machine learning methods. Eight types of molecular fingerprints were used to represent the molecules, which are PubChem (881 bits), CDK (1024 bits), Extended CDK (1024bits), MACCS (166 bits), Klekota-Roth (4860 bits), Substructure (307 bits), Estate (79 bits), and atom pairs (780 bits). A new type of fingerprint was also proposed by combining all features of these eight fingerprints (Combine, 9121 bits). The best classification model was developed by our proposed fingerprint (Combine, 9121 bits) trained with gradient boosting method algorithm (GBM) with predictive accuracy at 94.43%. The results indicated that molecular fingerprints and machine learning methods could be useful for predicting biological activities of volatile metabolites.
相关推荐

意 见 箱

匿名:登录

个人用户登录

找回密码

第三方账号登录

忘记密码

个人用户注册

必须为有效邮箱
6~16位数字与字母组合
6~16位数字与字母组合
请输入正确的手机号码

信息补充