\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\* \*\*\*\*\*\*\*\*\*\*\*\*\*\* English Below \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\* \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\* This folder contains a dataset of 10 European bird species, carefully labeled and mixed with other sound samples at a controlled signal-to-noise ratio (SNR). ---------------------------------------- Directory structure The project is organized into four main directories: - File_origin - File_cleaned_normalised - File_mixed - Programs_used ---------------------------------------- 📁 File_origin ├── BirdOnly This folder contains raw samples from the Xeno-Canto database (only recordings rated *A* or *B*). Each subfolder corresponds to a labeled species class. - Total number of classes: 10 (European species only). - Each `.mp3` file has a corresponding annotation file with the same name and the `_Annotation.xml` extension. Annotation format: start time of the vocalization (in seconds) end time of the vocalization (in seconds) minimum frequency of the vocalization maximum frequency of the vocalization minimum amplitude (between 0 and -0.5) maximum amplitude (between 0 and 0.5) class identifier The mapping between identifiers and species names is defined in the file: labelMap.xml > 🔾 Note: > The `_More_Classes` folder contains 10 additional classes, but their annotations are automatically generated and may be inaccurate (not human-validated). ---------------------------------------- ├── Noise-ESC-50 Sound samples (5 seconds each) from the ESC-50 dataset, containing non-bird sounds. ├── Noise-Pixabay Sound samples of rain and wind from the Pixabay platform. └── Noise-Quebec Sound samples of forest noise recorded in Quebec. ---------------------------------------- 📁 File_cleaned_normalised ├── Bird Contains bird samples from *File_origin*, cleaned from their environmental background. (*Annotation files remain unchanged.*) └── Noise Contains noise samples from *File_origin*, cleaned and normalized. ---------------------------------------- 📁 File_mixed đŸ”č (Probably the folder you are looking for) đŸ”č This folder contains all bird samples mixed with environmental and non-bird sounds, in a biologically consistent manner and with controlled SNR. ---------------------------------------- 📁 Programs_used List and description of scripts used to: - extract files via the Xeno-Canto API, - clean the recordings according to their annotations, - and mix the bird sounds with the selected noise samples. \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\* \*\*\*\*\*\*\*\*\*\*\*\* Explications Français \*\*\*\*\*\*\*\*\*\*\*\*\* \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\* Ce dossier contient un dataset de 10 espĂšces d’oiseaux europĂ©ens, soigneusement labellisĂ©es et mixĂ©es avec d’autres Ă©chantillons sonores Ă  rapport signal/bruit (SNR) contrĂŽlĂ©. ---------------------------------------- Structure du dossier Le projet est organisĂ© en quatre rĂ©pertoires principaux : - File_origin - File_cleaned_normalised - File_mixed - Programs_used ---------------------------------------- 📁 File\_origin ├── BirdOnly Ce dossier contient les Ă©chantillons bruts issus de la base Xeno-Canto (seulement les enregistrements de score A ou B). Chaque sous-dossier correspond Ă  une classe d’espĂšce labellisĂ©e. - Nombre total de classes : 10 (espĂšces europĂ©ennes uniquement). - Chaque fichier audio .mp3 possĂšde un fichier d’annotation associĂ© portant le mĂȘme nom et l’extension _Annotation.xml. Format d’annotation : start time of the vocalization (in seconds) end time of the vocalization (in seconds) minimum frequency of the vocalization maximum frequency of the vocalization minimum amplitude (between 0 and -0.5) maximum amplitude (between 0 and 0.5) class identifier La correspondance entre les identifiants et les noms d’espĂšces est dĂ©finie dans le fichier : *labelMap.xml* 🔾 Remarque : Le dossier \_More\_Classes contient 10 classes supplĂ©mentaires, mais leurs annotations sont automatiques et potentiellement erronĂ©es (non validĂ©es humainement). ├── Noise-ESC-50 Échantillons de bruits non ornithologiques (5 secondes chacun) issus de la base ESC-50. ├── Noise-Pixabay Échantillons de pluie et vent provenant de la plateforme Pixabay. └── Noise-Quebec Échantillons de bruits de forĂȘt enregistrĂ©s au QuĂ©bec. ---------------------------------------- 📁 File\_cleaned\_normalised ├── Bird Contient les Ă©chantillons d’oiseaux issus de *File\_origin*, nettoyĂ©s de leur environnement sonore. *Les fichiers d’annotation sont inchangĂ©s.* └── Noise Contient les Ă©chantillons de bruit issus de *File\_origin*, Ă©galement nettoyĂ©s et normalisĂ©s. ---------------------------------------- 📁 File\_mixed đŸ”č (Probablement le dossier que vous recherchez) đŸ”č Ce dossier contient l’ensemble des Ă©chantillons d’oiseaux mixĂ©s avec des bruits environnementaux et non ornithologiques, de maniĂšre biologiquement cohĂ©rente et Ă  SNR contrĂŽlĂ©. ---------------------------------------- 📁 Programs\_used Liste et description des scripts utilisĂ©s pour : - extraire les fichiers via l’API Xeno-Canto, - nettoyer les enregistrements selon leurs annotations, - et mĂ©langer les sons d’oiseaux avec les bruits souhaitĂ©s.