Unsupervised Fine-tuning of Speaker Diarisation Pipelines using Silhouette Coefficients
Loading...
Date
Researcher ID
Supervisors
Journal Title
Journal ISSN
Volume Title
Publisher
SACAIR
Record Identifier
Abstract
We investigate the use of silhouette coefficients in cluster analysis for speaker diarisation, with the dual purpose of unsupervised fine-tuning during domain adaptation and determining the number of speakers in an audio file. Our main contribution is to demonstrate the use of silhouette coefficients to perform per-file domain adaptation, which we show to deliver an improvement over per-corpus domain adaptation. Secondly, we show that this method of silhouette-based cluster analysis can be used to accurately determine more than one hyperparameter at the same time. Finally, we propose a novel method for calculating the silhouette coefficient of clusters using a PLDA score matrix as input
Sustainable Development Goals
Description
Citation
Van Wyk , L et al. Unsupervised Fine-tuning of Speaker Diarisation Pipelines using Silhouette Coefficients, volume 11: 202-216.[https://engineering.nwu.ac.za/must-deep-learning/publications]
