The list of environmental audio datasets have been launched as DCASE Datalist
Introduction
This small collection of links brings together datasets, tools, and services that support the exploration and analysis of sound—from isolated audio clips and environmental recordings to advanced annotation and augmentation tools. Browse the sections below to find resources tailored to your needs in audio research and development.
If you are looking for environmental audio dataset see DCASE Datalist.
Curated Dataset Lists
Here are some collections of datasets across various domains such as environmental audio, bioacoustics, speech, computer vision, and video. These curated lists are usually maintained by experts and communities to support specialized research and development.
Environmental Audio
Bioacoustics
- Datasets for bioacoustics
- Bioacoustics Datasets list maintained by Justin Salamon
- Bioacoustics Datasets entry in Wikidata
Speech
- Voice datasets list maintained by Jim Schwoebel
- Speech Datasets, ISCA Special Interest Group on Robust Speech Recognition
Computer Vision
Video
- Awesome-Video-Datasets list maintained by Yunhua Zhang
Online Services
Here are some online platforms offering access to isolated sounds, geotagged recordings, and environmental audio. These services range from open community-driven databases to institutionally curated archives, each with unique licensing and access conditions.
Isolated Sounds
- Freesound, isolated sounds, tagged, creative commons
- BBC Sound Effects, isolated sounds, textual description, free for research purposes
- Findsounds, isolated sounds, tagged, mixed licensing
- British Library Sound Archive, isolated sounds and Live recordings, only available for UK universities, restricted licensing
Geotagged Recordings
Environmental Sounds
Source-Specific Libraries
Free sound effect libraries by commercial provider:
Tools
A selection of software tools designed to assist with audio annotation, management, and augmentation. Whether you're labeling sound events, managing ecological recordings, or synthesizing soundscapes, these tools might streamline your workflow.
Annotation
- Label Studio, open source data labeling platform
- Audacity, audio software with basic annotation capabilities. Use label tracks for the annotations, see more info here.
- Audio Labeler App in Matlab, Audio annotation tool introduced in Matlab version R2018b.
- Audio Annotator, Javascript web interface for annotating audio data.
- ELAN, a linguistic annotation tool to create the textual annotations for audio and video files
Audio Management
- Panako, acoustic fingerprinting system which can be used to synchronize audio streams as well
- Pumilio, a Web-Based Management System for Ecological Recordings
Audio Augmentation
- Scaper, soundscape synthesis and augmentation tool
- muda, annotation-aware musical data augmentation, partly applicable for environmental audio (pitch shifting, time stretching). Documentation
- librosa, see time stretching and pitch shifting effects.
- TSM toolbox, MATLAB implementations of various classical time-scale modification (TSM) algorithm.
Prototypes
- Soundscape, a tool for soundscape annotation
- I-SED, an interactive sound event detector, see [Kim2017]
- BAT, BMAT Annotation Tool, see [Melendez-Catalan2017]
- audio-annotator, Audio-annotator, see [Cartwright2017]