AudioSet by Google

A massive dataset of manually annotated audio events

MakersThere are no makers yet
You need to become a Contributor to join the discussion.
Chris Messina
Chris MessinaHunter@chrismessina · Product designer & entrepreneur
This is remarkable and will do much to advance the state of voice-based computing. Thanks, Uncle Google! AudioSet consists of an expanding ontology of 632 audio event classes and a collection of 2,084,320 human-labeled 10-second sound clips drawn from YouTube videos. The ontology is specified as a hierarchical graph of event categories, covering a wide range of human and animal sounds, musical instruments and genres, and common everyday environmental sounds. By releasing AudioSet, we hope to provide a common, realistic-scale evaluation task for audio event detection, as well as a starting point for a comprehensive vocabulary of sound events.
Ming Ma
Ming Ma@mingliangma · Founder at Kllect
Thank you so much for sharing this amazing audio dataset! My team is developing a machine learning model that can understand what a video about. This dataset will help us move faster to our goal.
Tom Bielecki
Tom Bielecki@tombielecki · Cofounder, PrintToPeer
I don't consider this a product, it's a dataset, why is it here?