Von: kevin.austin via cec conference
Datum: Thu, 25 May 2017
Betreff: [cec-c] Fwd: free large-scale dataset of annotated musical notes from Google
You may know of this already, but Google has released a freely available large-scale dataset of annotated musical notes.
NSynth is an audio dataset containing 305,979 musical notes, each with a unique pitch, timbre, and envelope. For 1,006 instruments from commercial sample libraries, we generated four second, monophonic 16kHz audio snippets, referred to as notes, by ranging over every pitch of a standard MIDI pian o (21-108) as well as five different velocities (25, 50, 75, 100, 127). The note was held for the first three seconds and allowed to decay for the final second.
Some instruments are not capable of producing all 88 pitches in this range, resulting in an average of 65.4 pitches per instrument. Furthermore, the commercial sample packs occasionally contain duplicate sounds across multiple velocities, leaving an average of 4.75 unique velocities per pitch.
The dataset includes „unusual“ instrument sounds synthesized (or morphed) using Google’s neural networks.
A paper describing the dataset and morphing algorithm can be found on arXiv:
Dr. Charalampos Saitis
Humboldt Research Fellow / Humboldt-Forschungsstipendiat
Audio Communication Group / Fachgebiet Audiokommunikation
Berlin Institute of Technology / Technische Universität Berlin
Research Collaborator, Sound of Vision
ISI Foundation / Fondazione ISI