NWU Institutional Repository

Unsupervised Fine-tuning of Speaker Diarisation Pipelines using Silhouette Coefficients

dc.contributor.authorVan Wyk, Lucas
dc.contributor.authorDavel, Marelie, H
dc.contributor.authorVan Heerden, Charl
dc.date.accessioned2022-10-27T19:43:02Z
dc.date.available2022-10-27T19:43:02Z
dc.date.issued2021
dc.description.abstractWe investigate the use of silhouette coefficients in cluster analysis for speaker diarisation, with the dual purpose of unsupervised fine-tuning during domain adaptation and determining the number of speakers in an audio file. Our main contribution is to demonstrate the use of silhouette coefficients to perform per-file domain adaptation, which we show to deliver an improvement over per-corpus domain adaptation. Secondly, we show that this method of silhouette-based cluster analysis can be used to accurately determine more than one hyperparameter at the same time. Finally, we propose a novel method for calculating the silhouette coefficient of clusters using a PLDA score matrix as inputen_US
dc.identifier.citationVan Wyk , L et al. Unsupervised Fine-tuning of Speaker Diarisation Pipelines using Silhouette Coefficients, volume 11: 202-216.[https://engineering.nwu.ac.za/must-deep-learning/publications]en_US
dc.identifier.urihttp://hdl.handle.net/10394/40042
dc.language.isoenen_US
dc.publisherSACAIRen_US
dc.subjectSpeaker Diarisationen_US
dc.subject· Unsupervised Fine-Tuningen_US
dc.subject· Domain Adaptationen_US
dc.titleUnsupervised Fine-tuning of Speaker Diarisation Pipelines using Silhouette Coefficientsen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Van Wyk-2021-unsupervised-fine-tuning.pdf
Size:
663.22 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.61 KB
Format:
Item-specific license agreed upon to submission
Description: