Skip to main content

Android SDK (Beta)

Nexa Android SDK NexaSDK for Android enables on-device AI inference for Android applications. It built upon Nexa AI’s proprietary NexaML engine and enables developers to run Large Language Models (LLMs), Vision-Language Models (VLMs), Embeddings, Speech Recognition (ASR), Reranking, and Computer Vision models directly on Android devices with support for NPU, GPU, and CPU inference.

Key Features

  • Multiple Model Types: Support for LLM, VLM, Embeddings, ASR, Reranker, and Computer Vision models
  • Hardware Acceleration: CPU, GPU, and NPU (Qualcomm Hexagon NPU) support
  • Easy Integration: Simple Kotlin/Java API with builder pattern

Get Started

Quickstart for Android

Start building with Nexa Android SDK in minutes. Follow our step-by-step guide to integrate AI models into your Android application.