PhD Positions for 2015

The Interactive Audio Lab of Northwestern University seeks two doctoral students for Fall 2015 to work on research in mixed initiative interfaces (HCI), multi-cue audio source separation algorithms, and machine learning. The goal is to embody these adavances in working systems for media production. Those with interest in audio signal processing, machine learning and human computer interaction (HCI) are encouraged to apply to either the Technology and Social Behavior Program or through the department of Electrical Engineering and Computer Science .


A music search engine you can sing to
Audio Imputation
Estimate missing information in audio
Spatial Source Separation
Amplify sounds coming from a particular direction
Separation by repetition, by repetition, by repetition, ...
Adaptive User Interfaces
Don't learn the tool, let the tool learn you
Separate music sources in real time with the help of a musical score
Score Alignment
Learn about the music
Music Story
Make videos from music
Multi-pitch Estimation & Streaming
Track more than one pitch at once


Demos and Products

  • Reverbalize is a natural-language reverberation tool.
  • SynthAssist is a tool for programming audio synthesizers using vocal imitations.
  • Mixploration rethinks audio mixing from the ground up.
  • Tunebot finds an iTunes version of any song you sing to it.
  • SocialEQ sets your equalizer by having you rate options.
  • Toneboosters TB EZQ, is a VST version of our 2DEQ audio equilizer.
  • SickBeetz turns your beatboxing into drum beats.


  • SocialFX dataset: crowdsourced labels for reverberation, compression, and equalization. Consists of 4297 words from 1233 users. Combines SocialReverb and SocialEQ, and adds data for compression.
  • VocalSketch dataset: 10,000+ vocal imitations and identifications of a large set of diverse sounds.
  • Tunebot dataset: 10,000 sung contributions to Tunebot.
  • SocialReverb dataset: crowdsourced definitions for adjectives describing reverberation. These map between words and reverb settings.
  • SocialEQ dataset: crowdsourced definitions for words to describe equalization. These map from words to the EQ settings that elicit these words
  • Bach10 dataset: a versatile polyphonic music dataset for Multi-pitch Estimation and Tracking, Audio-score Alignment and Source Separation.
  • Jazz Performance dataset: a database of jazz pieces performed by professional Chicago jazz pianists using lead sheets. Performances are alinged to scores.