Inference Playground

Upload an image and provide a language instruction. The model predicts the next action.

Click or drag to upload image