Audio

We are re-imagining audio technology to empower people to be creative without the tools getting in the way. We are working on dramatically simplifying the creation of audio content, while still maintaining high production value. Our work spans a number of audio research areas including speech enhancement, music information retrieval, speech and music synthesis, computational acoustics, spatial audio, and audio event detection. We also work on problems at the intersection of audio with video, augmented reality, and natural language processing. To advance all of these research areas, we develop new machine learning algorithms, novel signal processing algorithms, and new human computer interaction paradigms.

Meet some of our researchersView More

Juan-Pablo Caceres

Research Engineer

Nicholas J. Bryan

Senior Research Scientist

Zeyu Jin

Research Scientist

View our latest publicationsView More

MakeItTalk: Speaker-Aware Talking Head Animation

Zhou, Y., Li, D., Shechtman, E., Echevarria, J., Han, X., Kalogerakis, E. (Nov. 30, 2020)

SIGGRAPH Asia 2020

Few-Shot Drum Transcription in Polyphonic Music

Wang, Y., Salamon, J., Cartwright, M., Bryan, N., Bello, J. (Oct. 11, 2020)

International Society for Music Information Retrieval Conference (ISMIR)

Metric Learning vs Classification for Disentangled Music Representation Learning

Lee, J., Bryan, N., Salamon, J., Jin, Z., Nam, J. (Oct. 11, 2020)

International Society for Music Information Retrieval Conference (ISMIR)

Project SoundSeek

SoundSeek, an experimental technology, allows users to find any sound in an audio track quickly and easily. All the user has to do is select one or more examples of the target sound, and SoundSeek will find everywhere else in the recording where a similar sound occurs, using a few-shot deep learning model.

View our latest newsView All News

Join us!

We are looking for researchers, engineers, and interns to take our technologies to the next level. We're recruiting, and we would love to hear from you!