ImageBind
Research by Meta AI
More modalities

Using audio to retrieve images

ImageBind can instantly suggest images by using an audio clip as an input. For example, from an audio recording of a bird, the model can generate images of what that bird might look like. Select an audio clip below and ImageBind will retrieve image options corresponding with the audio prompt.

Select audio

Birds singing

A dog barking

Train running