Skip to main content
Datasetinstruction-tuningv200K

UltraChat

by Tsinghua University · open-source · Last verified 2026-03-17

A large-scale, diverse multi-turn conversational dataset with 1.5 million synthesized dialogues spanning questions, information seeking, and creative writing. Generated using ChatGPT with a two-model pipeline to produce high-quality, coherent long-form exchanges for supervised fine-tuning.

https://huggingface.co/datasets/stingning/ultrachat
B
BAbove Average
Adoption: B+Quality: AFreshness: B+Citations: B+Engagement: F

Specifications

License
CC-BY-NC-4.0
Pricing
open-source
Capabilities
instruction-tuning, multi-turn-conversations, supervised-fine-tuning
Integrations
huggingface-datasets
Use Cases
fine-tuning, chatbot-training, dialogue-modeling
API Available
No
Tags
multi-turn, synthetic, conversations, diverse, large-scale
Added
2026-03-17
Completeness
100%

Index Score

66.2
Adoption
78
Quality
80
Freshness
70
Citations
76
Engagement
0

Put AI to work for your business

Deploy this dataset alongside autonomous AaaS agents that handle tasks end-to-end — no babysitting required.

Explore the full AI ecosystem on Agents as a Service