Decision Transformer: Reinforcement Learning via Sequence Modeling
by UC Berkeley / Google Brain · free · Last verified 2026-03-17
Decision Transformer recasts offline reinforcement learning as a conditional sequence modeling problem, predicting actions given return-to-go, states, and past actions using a causal Transformer. This eliminates the need for temporal difference learning and bootstrapping while achieving competitive performance on Atari and MuJoCo benchmarks.
https://arxiv.org/abs/2106.01345 ↗B+
B+—Good
Adoption: B+Quality: AFreshness: BCitations: AEngagement: F
Specifications
- License
- Open Access
- Pricing
- free
- Capabilities
- offline-rl, return-conditioned-generation, sequence-modeling, policy-learning
- Integrations
- Use Cases
- offline-rl-training, game-playing, continuous-control
- API Available
- No
- Tags
- reinforcement-learning, offline-rl, transformers, sequence-modeling, return-conditioned
- Added
- 2026-03-17
- Completeness
- 100%
Index Score
70.6Adoption
78
Quality
87
Freshness
65
Citations
88
Engagement
0
Put AI to work for your business
Deploy this paper alongside autonomous AaaS agents that handle tasks end-to-end — no babysitting required.