What is NexaAI Python SDK?
The NexaAI Python SDK provides a comprehensive API for on-device AI inference across multiple modalities. It supports:- Large Language Models (LLM): Text generation and conversation
- Vision-Language Models (VLM): Multimodal understanding and generation
- Embedder: Text vectorization and similarity computation
- Reranker: Document reranking
- ASR (Automatic Speech Recognition): Speech-to-text transcription
- CV (Computer Vision): OCR/text recognition
Choose Your Platform
The SDK supports multiple platforms with optimized backends. Select your platform for detailed setup instructions:macOS
Apple Silicon optimized
Python 3.10 β’ MLX backend β’ Metal acceleration
Python 3.10 β’ MLX backend β’ Metal acceleration
Windows x64
CPU/GPU acceleration
Python 3.10 β’ GGUF models β’ CUDA support
Python 3.10 β’ GGUF models β’ CUDA support
Windows ARM64
NPU acceleration
Python 3.11-3.13 ARM64 β’ Snapdragon X Elite
Python 3.11-3.13 ARM64 β’ Snapdragon X Elite
Quick Overview
Installation
Each platform has specific installation requirements. Follow your platform guide for detailed instructions:- macOS:
pip install 'nexaai[mlx]' - Windows x64:
pip install nexaai - Windows ARM64:
pip install nexaai
Authentication
Set up your NexaAI token from https://sdk.nexa.ai/:Basic Usage
Hereβs a simple example to get you started:This is a simplified example. For complete setup instructions, model recommendations, and platform-specific optimizations, please refer to your platform guide above.
Next Steps
- Choose your platform and follow the detailed setup guide
- Explore the API Reference for comprehensive documentation
- Check out platform-specific examples in your chosen guide
Was this page helpful?