LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset
by LMSYS / UC Berkeley · free · Last verified 2026-03-17
Introduces LMSYS-Chat-1M, a large-scale dataset of one million real-world conversations with 25 state-of-the-art LLMs collected from the Chatbot Arena platform. Analysis reveals diverse usage patterns, safety violations, and human preference signals, making it a valuable resource for safety evaluation, capability assessment, and alignment research.
https://arxiv.org/abs/2309.11998 ↗B
B—Above Average
Adoption: B+Quality: AFreshness: BCitations: B+Engagement: F
Specifications
- License
- CC BY-NC 4.0
- Pricing
- free
- Capabilities
- conversation-analysis, safety-evaluation, usage-pattern-analysis, alignment-research
- Integrations
- Use Cases
- safety-research, conversation-modeling, capability-analysis, research
- API Available
- No
- Tags
- dataset, evaluation, conversations, real-world, safety, lmsys
- Added
- 2026-03-17
- Completeness
- 100%
Index Score
63.8Adoption
72
Quality
85
Freshness
66
Citations
72
Engagement
0
Put AI to work for your business
Deploy this paper alongside autonomous AaaS agents that handle tasks end-to-end — no babysitting required.