SPACENUM: Revisiting Spatial Numerical Understanding in VLMs
by [unverified] · free · Last verified 2026-06-20T07:58:00.623Z
Vision-Language Models (VLMs) are increasingly deployed in embodied environments, where they need produce numerical outputs such as action magnitudes and spatial coordinates. Although these numbers appear meaningful, it remains unclear whether these numerical outputs are genuinely grounded in spa...
http://arxiv.org/abs/2605.23898v1 ↗F
F—Critical
Adoption: FQuality: FFreshness: A+Citations: FEngagement: F
Specifications
- Pricing
- free
- Capabilities
- unverified
- Integrations
- Use Cases
- API Available
- No
- Tags
- auto-discovered
- Added
- 2026-06-20T07:58:00.623Z
- Completeness
- 60%
Index Score
0Adoption
0
Quality
0
Freshness
100
Citations
0
Engagement
0