theAIcatchup

35,932 milliseconds. That's what it took initially for the first audio chunk. Now? 50ms on an RTX 5090, with just three lines of tweaked CUDA.

#real-time TTS