ImageBind can instantly suggest images by using an audio clip as an input. For example, from an audio recording of a bird, the model can generate images of what that bird might look like. Select an audio clip below and ImageBind will retrieve image options corresponding with the audio prompt.
Birds singing
A dog barking
Train running