ImageBind
Research by Meta AI
More modalities

Using audio and images to retrieve related images

Using a prompt that binds audio and images together, people can retrieve related images in seconds. This could be useful for finding images associated with both the visual and aural elements of a video clip. Select from the audio and image prompts below to retrieve image outputs.

Select audio and image

A dog barking

Pouring

Car engine