Seed-OSS is a series of open-source language models developed by ByteDance's Seed Team, designed for powerful long-context, reasoning, agent and general capabilities, and versatile developer-friendly features. We have released Seed-OSS-36B to the open-source community under the Apache-2.0 license. Trained with 12T tokens, Seed-OSS-36B has achieved impressive results on mainstream benchmarks while maintaining good practical performance at a low cost. Key Features Native Long Context: Trained with up-to-512K long context natively. Flexible Control of Thinking Budget: Allowing users to flexibly adjust the reasoning length as needed. This capability of dynamically controlling the reasoning length enhances inference efficiency in practical application scenarios. Enhanced Reasoning Capability: Specifically optimized for reasoning tasks while maintaining balanced and excellent general capabilities. Agentic Intelligence: Performs well in agentic tasks such as tool-using and issue resolving. Research-Friendly: Given that the inclusion of synthetic instruction data in pre-training may affect the post-training research, we released pre-trained models both with and without instruction data, providing the research community with more diverse options.
Notes: 6 FLOP / parameter / token * 36 * 10^9 parameters * 12 * 10^12 tokens = 2.592e+24 FLOP
Size Notes: "Trained with 12T tokens"