Skip to main content

Nexa Python SDK

The NexaAI provides a comprehensive Python API for on-device AI inference across multiple modalities. It supports Large Language Models (LLM), Vision-Language Models (VLM), text embedding, document reranking, computer vision (OCR), automatic speech recognition (ASR), text-to-speech (TTS), image generation, speaker diarization, key-value cache, sampling configuration, chat templates and error handling.

Choose Your Platform

The NexaAI Python SDK supports multiple platforms with optimized backends. Select your platform for platform-specific setup and examples:

macOS Guide

Complete setup guide for macOS with Apple Silicon optimization using MLX backend.

Windows x64 Guide

Setup guide for Windows x64 systems with CPU/GPU acceleration.

Windows ARM64 Guide

NPU-optimized guide for Windows ARM64 devices (Snapdragon X Elite).

Quick Reference

Quickstart

General overview and platform selection guide for getting started.

API Reference

Complete API documentation for all SDK modules and classes.