🖥️ Supported Devices
NPU models are supported on Qualcomm Neural Processing Units (NPUs):- Nexa CLI: Qualcomm Snapdragon NPU PC
- NexaML: Any Qualcomm NPU device (contact us to request access)
⚙️ Prerequisites
- If you haven’t already, install the nexa-SDK.
- All NPU models require an access token before usage:
- Create an account at sdk.nexa.ai
- Generate a token: Go to Deployment → Create Token
- Activate your SDK: Run the following command in your terminal (replace with your token):
bash
LLM - Language Models
📝 Language models in NPU format. Try out this quick example: Try it out:bash
⌨️ Once model loads, type or paste multi-line text directly into the CLI to chat with the model.
LMM - Multimodal Models
🖼️ Language models that also accept vision and/or audio inputs. LMM in NPU formats. Try out this quick example:bash
⌨️ Drag photos or audio clips directly into the CLI — you can even drop multiple images at once!