Speech Playground
Sign in with your @skit.ai account to continue
Sign out
Speech Bench
Compare TTS and ASR models side by side
On-Prem
Third Party
Model
Language
Voice
Language
Voice Type
All
Neural
DragonHD
Voice
Style
Text
Hello, how are you?
Audio Input
Upload File
Record Microphone
Drop audio file here or click to browse
Start Recording
0:00
Stop
Synthesize & Play
Model is cold starting...
The GPU server is waking up from idle. This typically takes 1-3 minutes. Auto-retrying
...
Waiting: 0s
Cancel
Output
Total (E2E)
-
TTFB (Client)
-
Server Processing
-
Download
-
Network Overhead
-
Audio Size
-
Audio Duration
-
Transcription Result
Copy
Audio Duration
-
Processing Time
-
Realtime Factor
-
TTFT
-
Tokens
-
Avg Inter-token
-
P99 Inter-token
-