>
Kokoro is an open-weight TTS model with 82 million parameters. Despite its lightweight architecture, it delivers comparable quality to larger models while being significantly faster and more cost-efficient. With Apache-licensed weights, Kokoro can be deployed anywhere from production environments to personal projects.
Notes: 312000000000000 FLOP / GPU / sec * 500 GPU - hours * 3600 sec / hour * 0.3 [assumed utilization] = 1.6848e+20 FLOP
Size Notes: "<100 hrs" of training audio data
Notes: 82M