Levatić, Jurica; Džeroski, Sašo; Supek, Fran; Šmuc, Tomislav
(2013)
Semi-Supervised Learning for Quantitative Structure- Activity Modeling.
Informatica (Ljubljana), 37
(2).
pp. 173-179.
ISSN 0350-5596
Abstract
In this study, we compare the performance of semi- supervised and supervised machine learning methods applied to various problems of modeling quantitative Structure Activity Relationship (QSAR) in sets of chemical compounds. Semi- supervised learning utilizes unlabeled data in addition to labeled data with the goal of building better predictive models than can be learned by using labeled data alone. Typically, labeled QSAR datasets contain tens to hundreds of compounds, while unlabeled data are easily accessible via public databases containing thousands of chemical compounds: this makes QSAR modeling an attractive domain for the application of semi-supervised learning. We tested four different semi-supervised learning algorithms on three different datasets and compared them to five commonly used supervised learning algorithms. While adding unlabeled data does help for certain pairings of dataset and method, semi-supervised learning is not clearly superior to supervised learning across the QSAR classification problems addressed by this study.
Item Type: |
Article
|
Uncontrolled Keywords: |
semi-supervised learning ; supervised learning ; QSAR ; drug design ; machine learning |
Subjects: |
NATURAL SCIENCES > Biology |
Divisions: |
Division of Electronics |
Projects: |
Project title | Project leader | Project code | Project type |
---|
Strojno učenje prediktivnih modela u računalnoj biologiji | Šmuc, Tomislav | 098-0000000-3168 | MZOS |
|
Depositing User: |
Tomislav Šmuc
|
Date Deposited: |
09 Apr 2021 10:51 |
URI: |
http://fulir.irb.hr/id/eprint/6380 |
Actions (login required)
|
View Item |
6380
WOS:000213238700009