antgroup/HumanSense_Omni_Reasoning
Video-Text-to-Text
•
9B
•
Updated
•
26
•
6
None defined yet.
Can We Predict Before Executing Machine Learning Agents?
The World is Your Canvas: Painting Promptable Events with Reference Images, Trajectories, and Text