Takes base64 encoded audio and streams viseme (mouth shapes) predictions using Server-Sent Events (SSE). Requires valid API key.
Bearer authentication header of the form Bearer <token>
, where <token>
is your auth token.
Successful response streams viseme predictions using Server-Sent Events. Each event contains viseme data for a processed audio chunk.
Server-Sent Events stream with the following format: