Apple Silicon¶

Run OmniVoice natively on M-series Macs via PyTorch's MPS backend.

OmniVoice runs natively on Apple Silicon via PyTorch's MPS backend.

Install¶

pip install strands-omnivoice
pip install torch==2.8.0 torchaudio==2.8.0

from strands_omnivoice import omnivoice_sysinfo
print(omnivoice_sysinfo())

You should see device=mps. The first call to any synthesis tool warms the model.

export STRANDS_OMNIVOICE_DEVICE=mps   # or cpu, cuda
export STRANDS_OMNIVOICE_DTYPE=float16

Or per-call:

omnivoice_tts(text="...", output="...", device="mps")

The first model load on a fresh machine downloads ~1 GB. Subsequent runs reuse the HuggingFace cache (~/.cache/huggingface/hub/).
whisper ASR (used by omnivoice_transcribe and auto-transcription) currently runs on CPU even when synthesis is on MPS — a small one-time cost.