Decision Transformer: Reinforcement Learning via Sequence Modeling
by UC Berkeley / Google Brain · free · Last verified 2026-03-17
Decision Transformer recasts offline reinforcement learning as a conditional sequence modeling problem, predicting actions given return-to-go, states, and past actions using a causal Transformer. This eliminates the need for temporal difference learning and bootstrapping while achieving competitive performance on Atari and MuJoCo benchmarks.
https://arxiv.org/abs/2106.01345 ↗C
C—Below Average
Adoption: B+Quality: AFreshness: BCitations: FEngagement: F
Specifications
- License
- Open Access
- Pricing
- free
- Capabilities
- offline-rl, return-conditioned-generation, sequence-modeling, policy-learning
- Integrations
- Use Cases
- offline-rl-training, game-playing, continuous-control
- API Available
- No
- Tags
- reinforcement-learning, offline-rl, transformers, sequence-modeling, return-conditioned
- Added
- 2026-03-17
- Completeness
- 100%
Index Score
49Adoption
78
Quality
87
Freshness
65
Citations
0
Engagement
0