← 返回首页
RT tetsuo: xAI just dropped a Text to Speech API! 5 voices (eve, ara, rex, sal, leo), inline speech tags, WebSocket streaming, and format support from...
RT tetsuo
xAI just dropped a Text to Speech API!
5 voices (eve, ara, rex, sal, leo), inline speech tags, WebSocket streaming, and format support from high-fidelity WAV all the way down to telephony mulaw.
You can write things like: "So I walked in and [pause] there it was. [laugh] I honestly could not believe it!"
Or wrap sections: It is a secret.
Pauses, laughs, chuckles, sighs, breathing, pitch, speed, volume. Actual expressive control baked into the text itself.
Three lines of curl to get started. No SDK needed.
This pairs with the xAI Realtime API. Voice in, voice out, Grok in the middle. The full stack is there now.
Beta pricing. Go play with it.
xAI just dropped a Text to Speech API!
5 voices (eve, ara, rex, sal, leo), inline speech tags, WebSocket streaming, and format support from high-fidelity WAV all the way down to telephony mulaw.
You can write things like: "So I walked in and [pause] there it was. [laugh] I honestly could not believe it!"
Or wrap sections: It is a secret.
Pauses, laughs, chuckles, sighs, breathing, pitch, speed, volume. Actual expressive control baked into the text itself.
Three lines of curl to get started. No SDK needed.
This pairs with the xAI Realtime API. Voice in, voice out, Grok in the middle. The full stack is there now.
Beta pricing. Go play with it.