Inference Playground
Upload an image and provide a language instruction. The model predicts the next action.
Click or drag to upload image
Language Instruction
Predict Action