Nexa Python SDK
The NexaAI provides a comprehensive Python API for on-device AI inference across multiple modalities. It supports Large Language Models (LLM), Vision-Language Models (VLM), text embedding, document reranking, computer vision (OCR), automatic speech recognition (ASR), key-value cache, sampling configuration, chat templates and error handling.Choose Your Platform
The NexaAI Python SDK supports multiple platforms with optimized backends. Select your platform for platform-specific setup and examples:macOS Guide
Complete setup guide for macOS with Apple Silicon optimization using MLX backend.
Windows x64 Guide
Setup guide for Windows x64 systems with CPU/GPU acceleration.
Windows ARM64 Guide
NPU-optimized guide for Windows ARM64 devices (Snapdragon X Elite).
Quick Reference
Quickstart
General overview and platform selection guide for getting started.
API Reference
Complete API documentation for all SDK modules and classes.
Was this page helpful?