LongTalk-CoT v0.1 - A Very Long Chain-of-Thought Dataset for Reasoning Model Post-Training

A dataset designed for post training o1-like reasoning model. Each response is prompted using QwQ-32B-Preview, and specifically handcrafted system message that encourages more vocalised thinking, and self reflection.

References