Accuracy scores on MotionBench.
# | Model | Frames | LLM Params |
Date | Dev Avg (%) | Test Avg (%) | MR (%) | LM (%) | CM (%) | MO (%) | AO (%) | RC (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|
TE Fusion
Zhipu AI & Tsinghua |
16 | 9B | 2024-11-25 | 58 | 58 | 64 | 59 | 51 | 69 | 41 | 39 | |
Qwen2-VL-72B
Alibaba |
1fps | 72B | 2024-11-25 | 57 | 58 | 58 | 61 | 63 | 72 | 47 | 31 | |
InternVL2-40B
Shanghai AI Lab |
16 | 34B | 2024-11-25 | 55 | 54 | 54 | 58 | 49 | 76 | 41 | 30 | |
GLM-4V-Plus
Zhipu AI |
30 | - | 2024-11-25 | 54 | 55 | 57 | 57 | 54 | 69 | 40 | 37 | |
MiniCPM-V2.6
Tsinghua |
64 | 7B | 2024-11-25 | 52 | 53 | 56 | 49 | 45 | 72 | 39 | 33 | |
PLLaVA 34B
Bytedance & NTU |
16 | 34B | 2024-11-25 | 52 | 51 | 55 | 51 | 47 | 66 | 38 | 31 | |
Gemini 1.5 Pro
|
1fps | - | 2024-11-25 | 51 | 50 | 51 | 52 | 54 | 67 | 40 | 22 | |
Oryx-34B
Tsinghua University & Tencent & NTU |
64 | 34B | 2024-11-25 | 49 | 49 | 48 | 52 | 44 | 65 | 42 | 32 | |
LLaVA-NeXT-Video-DPO (34B)
Bytedance & NTU S-Lab |
32 | 34B | 2024-11-25 | 48 | 40 | 53 | 45 | 36 | 66 | 39 | 23 | |
CogVLM2-Video
Zhipu AI |
24 | 8B | 2024-11-25 | 41 | 44 | 43 | 39 | 38 | 64 | 37 | 33 |
Green date indicates the newly added/updated models.