NWU Institutional Repository

Unsupervised Fine-tuning of Speaker Diarisation Pipelines using Silhouette Coefficients

Loading...
Thumbnail Image

Date

Researcher ID

Supervisors

Journal Title

Journal ISSN

Volume Title

Publisher

SACAIR

Record Identifier

Abstract

We investigate the use of silhouette coefficients in cluster analysis for speaker diarisation, with the dual purpose of unsupervised fine-tuning during domain adaptation and determining the number of speakers in an audio file. Our main contribution is to demonstrate the use of silhouette coefficients to perform per-file domain adaptation, which we show to deliver an improvement over per-corpus domain adaptation. Secondly, we show that this method of silhouette-based cluster analysis can be used to accurately determine more than one hyperparameter at the same time. Finally, we propose a novel method for calculating the silhouette coefficient of clusters using a PLDA score matrix as input

Sustainable Development Goals

Description

Citation

Van Wyk , L et al. Unsupervised Fine-tuning of Speaker Diarisation Pipelines using Silhouette Coefficients, volume 11: 202-216.[https://engineering.nwu.ac.za/must-deep-learning/publications]

Endorsement

Review

Supplemented By

Referenced By