hrvatski jezikClear Cookie - decide language by browser settings

Pseudo labels approach to interpretable self-guided subspace clustering

Kopriva, Ivica (2025) Pseudo labels approach to interpretable self-guided subspace clustering. Pattern Recognition, 172 (Part C). ISSN 0031-3203

[img] PDF - Published Version - article
Available under License Creative Commons Attribution Non-commercial.

Download (2MB)

Abstract

Majority subspace clustering (SC) algorithms depend on one or more hyperparameters that need to be tuned for the SC algorithms to achieve high clustering performance. This is often performed using grid-search, assuming that held out set is available. In some domains, such as medicine, this assumption does not hold true in many cases. To address this problem, we propose an approach to label-independent hyperparameter optimization by applying the SC algorithm to the data and use the resulting cluster assignments as pseudo-labels to compute clustering quality metrics (e.g., accuracy (ACC) or normalized mutual information (NMI)) across a predefined hyperparameter grid. Assuming that ACC (or NMI) is a smooth function of hyperparameter values, it is possible to select subintervals of hyperparameters, which are then iteratively further split into halves or thirds until a relative error criterion is satisfied. In principle, the hyperparameters of any SC algorithm can be tuned using the proposed method. We demonstrate this approach on five single-view SC algorithms and two multi-view SC algorithms, comparing the achieved performance with their oracle versions across six datasets for single-view algorithms and three datasets for multi-view algorithms. The proposed method typically achieves clustering performance that is up to 7% lower than that of the oracle versions. We also enhance the interpretability of the proposed method by visualizing subspace bases, estimated from the computed clustering partitions. This aids in the initial selection of the hyperparameter search space.

Item Type: Article
Uncontrolled Keywords: subspace clustering; hyperparameter optimization; pseudo labels; interpretability
Subjects: TECHNICAL SCIENCES > Computing
Divisions: Division of Electronics
Projects:
Project titleProject leaderProject codeProject type
Razvoj algoritama grupiranja podataka, regresije i izdvajanja značajki s primjenama u patologiji, oftalmologiji i metabolomiciIvica KoprivaIP-2022-10-6403Znanstveno-istraživački projekti
Depositing User: Ana Zečević
Date Deposited: 05 Nov 2025 13:13
URI: http://fulir.irb.hr/id/eprint/10148
DOI: 10.1016/j.patcog.2025.112618

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year

Contrast
Increase Font
Decrease Font
Dyslexic Font
Accessibility