Installation¶

Requirements¶

Python ≥ 3.10
NVIDIA GPU (24 GB+ for 2B model, 32 GB+ for 8B model)
CUDA 12.x

Install¶

pip install strands-cosmos

zsh users

If you see zsh: no matches found, quote the package name: pip install "strands-cosmos"

Platform Compatibility¶

Platform	GPU	Status
Desktop Linux x86_64	A100 / H100 / RTX 4090	✅
Jetson AGX Thor	Thor 132 GB	✅ (with CUBLAS fix)
Jetson Orin	Orin 32/64 GB	✅ (may need CUBLAS fix)
macOS (Apple Silicon)	❌	No CUDA — use strands-mlx

Jetson Setup¶

On NVIDIA Jetson devices, PyTorch's pip-bundled CUBLAS may not support the GPU architecture. Run the included fix after install:

# Fix CUBLAS (auto-detects if needed, safe on any platform)
strands-cosmos-fix-cublas

# Or check without fixing:
strands-cosmos-fix-cublas --check

# Revert if needed:
strands-cosmos-fix-cublas --revert

→ See Jetson Deployment Guide for details.

Verify¶

from strands_cosmos import CosmosVisionModel

model = CosmosVisionModel(model_id="nvidia/Cosmos-Reason2-2B")
print("✅ Model loaded successfully")

First run

The first run downloads the model from HuggingFace (~5 GB for 2B). Subsequent runs load from cache.

What Gets Installed¶

graph LR
    COSMOS["strands-cosmos"] --> SA["strands-agents<br/><i>Agent framework</i>"]
    COSMOS --> HF["transformers<br/><i>Model loading</i>"]
    COSMOS --> TORCH["torch<br/><i>GPU inference</i>"]
    COSMOS --> TV["torchvision<br/><i>Video decoding</i>"]

    style COSMOS fill:#76b900,color:#fff
    style SA fill:#264653,color:#fff

Cosmos 3 Environments (optional)¶

pip install strands-cosmos is all you need for the Cosmos-Reason2 VLM. The Cosmos 3 omnimodal models use dedicated, uv-managed environments because their backends (vLLM / Diffusers / Cosmos Framework) pin CUDA-specific builds:

# Generator extras via pip (Diffusers + cosmos_guardrail + soundfile):
pip install "strands-cosmos[cosmos3-gen]"

# Or build dedicated, CUDA-matched venvs with the justfile:
git clone https://github.com/cagataycali/strands-cosmos && cd strands-cosmos
just c3-doctor          # reports GPU, driver CUDA, and the torch-backend pairing to use
just c3-setup-reason    # Reasoner: vllm + vllm-cosmos3
just c3-setup-gen       # Generator: diffusers(main) + cosmos_guardrail + soundfile
just c3-setup-framework # Action / world-model: Cosmos Framework (optional)

CUDA pairing

The torch backend must match your driver — CUDA 13 → cu130 + vllm==0.21.0; CUDA 12.8 → cu128 + vllm==0.19.1. just c3-doctor prints the recommendation.

See the Cosmos 3 Guide for full setup and usage.

What's Next¶

Quickstart — Your first Cosmos agent in 5 lines
Video Understanding — Process dashcam, robot, and scene videos
Jetson Deployment — Run on edge hardware
Cosmos 3 Guide — Omnimodal reasoning + generation