Datasets


Environmental sounds

The list of environmental audio datasets have been launched as DCASE Datalist

Introduction

This small collection of links brings together datasets, tools, and services that support the exploration and analysis of sound—from isolated audio clips and environmental recordings to advanced annotation and augmentation tools. Browse the sections below to find resources tailored to your needs in audio research and development.

If you are looking for environmental audio dataset see DCASE Datalist.

Curated Dataset Lists

Here are some collections of datasets across various domains such as environmental audio, bioacoustics, speech, computer vision, and video. These curated lists are usually maintained by experts and communities to support specialized research and development.

Environmental Audio

Bioacoustics

Speech

Computer Vision

Video

Online Services

Here are some online platforms offering access to isolated sounds, geotagged recordings, and environmental audio. These services range from open community-driven databases to institutionally curated archives, each with unique licensing and access conditions.

Isolated Sounds

Geotagged Recordings

Environmental Sounds

Source-Specific Libraries

Free sound effect libraries by commercial provider:

Tools

A selection of software tools designed to assist with audio annotation, management, and augmentation. Whether you're labeling sound events, managing ecological recordings, or synthesizing soundscapes, these tools might streamline your workflow.

Annotation

  • Label Studio, open source data labeling platform
  • Audacity, audio software with basic annotation capabilities. Use label tracks for the annotations, see more info here.
  • Audio Labeler App in Matlab, Audio annotation tool introduced in Matlab version R2018b.
  • Audio Annotator, Javascript web interface for annotating audio data.
  • ELAN, a linguistic annotation tool to create the textual annotations for audio and video files

Audio Management

  • Panako, acoustic fingerprinting system which can be used to synchronize audio streams as well
  • Pumilio, a Web-Based Management System for Ecological Recordings

Audio Augmentation

Prototypes