Spark hardware
—
—
curl example
Show technical logs
WhisperX is a single-container speech pipeline (faster-whisper for transcription + pyannote 3.1 for diarization) designed to handle long audio cleanly. Replaces the Parakeet + Sortformer combo we patched together, which crashed on a 90-min meeting. Pulled and built directly on Spark 2 (~10–15 min first time; you only do this once).
Requires a Hugging Face token at ~/.cache/huggingface/token on Spark 2 (already set up).
Always-on services
Speech model patches
Spark Control adds Sortformer speaker diarization to the third-party Parakeet ASR
container via two Python overlays (diarizer.py + a patched main.py).
Overlays survive container restart but not a fresh redeploy — if the parakeet container is
ever rebuilt, click Reapply patches below to restore them.
LLM swap
RedHatAI/...) are best for Blackwell hardware