>
SEA-LION version 1 was released in December 2023 and trained from scratch with 1 trillion tokens, 13 percent of which are from SEA.
Notes: 6 FLOP / parameter / token * 3B parameters * 980B tokens = 1.764 × 10^22 FLOP 312000000000000 FLOP / GPU / sec * 336 hours * 3600 sec / hour * 240 GPUs * 0.3 [assumed utilization] = 2.7172454e+22 FLOP sqrt(1.764 × 10^22 * 2.7172454e+22) = 2.1893426e+22 FLOP
Size Notes: "SEA-LION was trained on 980B tokens"