A Repetition-based Triplet Mining Approach for Music Segmentation - Département Image, Données, Signal Access content directly
Conference Papers Year : 2023

A Repetition-based Triplet Mining Approach for Music Segmentation


Contrastive learning has recently appeared as a well-suited method to find representations of music audio signals that are suitable for structural segmentation. However, most existing unsupervised training strategies omit the notion of repetition and therefore fail at encompassing this essential aspect of music structure. This work introduces a triplet mining method which explicitly considers repeating sequences occurring inside a music track by leveraging common audio descriptors. We study its impact on the learned representations through downstream music segmentation. Because musical repetitions can be of different natures, we give further insight on the role of the audio descriptors employed at the triplet mining stage as well as the trade-off existing between the quality of the triplets mined and the quantity of unlabelled data used for training. We observe that our method requires less non-annotated data while remaining competitive against other unsupervised methods trained on a larger corpus.
Fichier principal
Vignette du fichier
A Repetition-Based Triplet Mining Approach for Music Segmentation.pdf (5.62 Mo) Télécharger le fichier
Origin Files produced by the author(s)

Dates and versions

hal-04202766 , version 1 (28-09-2023)


  • HAL Id : hal-04202766 , version 1


Morgan Buisson, Brian Mcfee, Slim Essid, Helene-Camille Crayencour. A Repetition-based Triplet Mining Approach for Music Segmentation. International Society for Music Information Retrieval (ISMIR), Nov 2023, Milan, Italy. ⟨hal-04202766⟩
321 View
175 Download


Gmail Mastodon Facebook X LinkedIn More