ImageBind
Research by Meta AI
More modalities

Using an image to retrieve audio

ImageBind can instantly suggest audio by using an image or video as an input. This could be used to enhance an image or video with an associated audio clip, such as adding the sound of waves to an image of a beach. Select an image below and ImageBind will retrieve audio options corresponding with the image prompt.

Select an image