Android SDK (Beta)

Why NPU on mobile
Key Features
- Multiple Model Types: Support for LLM, VLM, Embeddings, ASR, Reranker, and Computer Vision models
- Hardware Acceleration: CPU, GPU, and NPU (Qualcomm Hexagon NPU) support
- Easy Integration: Simple Kotlin/Java API with builder pattern
Supported Models
The following models are supported on Android ARM64 with NPU acceleration :Beyond the NEXA-optimized models listed above, any GGUF model from the community can also run on Qualcomm Hexagon NPU. Use the
cpu_gpu plugin and set device_id = "dev0", powered by the GGML Hexagon backend.Get Started
Quickstart for Android
Start building with Nexa Android SDK in minutes. Follow our step-by-step guide to integrate AI models into your Android application.
Was this page helpful?