Skip to main content
This quickstart will walk you through how to install nexaSDK and run your first models on device.

Installation

Download Here:
arm64 with Qualcomm NPU support
x86_64 with Intel NPU support

Running Your First Model

Currently, we support LLM and Multimodal models. More model type support is coming soon! Language Model (LLM)
Windows
nexa infer NexaAI/Qwen3-0.6B
Multimodal Model
Windows
nexa infer NexaAI/Qwen2.5-Omni-3B-GGUF
To try other GGUF models, visit Hugging Face, copy the path of any compatible GGUF model (e.g., unsloth/Qwen2.5-VL-3B-Instruct-GGUF), and replace the model path in the command above.
Currently, LLM (Large Language Model) and VLM (Vision Language Model) are in the testing scope. More modalities are coming soon!

Explore CLI Commands

To see a list of all available CLI commands, run:
Windows
nexa -h

I