Paperroboticsv1.0

VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models

by Stanford University · free · Last verified 2026-03-17

VoxPoser uses LLMs and vision-language models to synthesize 3D voxel-based value and constraint maps that guide robot motion planners, enabling zero-shot generalization to novel language instructions and object configurations. The approach produces trajectories without any robot-specific training by composing affordance maps in 3D space.

https://arxiv.org/abs/2307.05973 ↗

C—Below Average

Adoption: BQuality: AFreshness: B+Citations: FEngagement: F

Specifications

License: Open Access
Pricing: free
Capabilities: 3d-reasoning, zero-shot-manipulation, value-map-synthesis, motion-planning
Integrations
Use Cases: robotic-manipulation, novel-instruction-following, dexterous-robot-tasks
API Available: No
Tags: robotics, 3d, value-maps, language-models, manipulation, zero-shot
Added: 2026-03-17
Completeness: 100%

Index Score

Adoption

Quality

Freshness

Citations

Engagement

Need this tool deployed for your team?

Get a Custom Setup

Explore the full AI ecosystem on Agents as a Service