Installation
Nexa AI Android SDK is available from Maven Central, the latest version is 0.0.12. You can also find it from Maven Repository which updates a bit slower than Maven Central.Add Dependency
Add the following to yourapp/build.gradle.kts to import NexaSDK:
The default repository is mavenCentral(). You can also add github’s raw repository in the settings.gradle.kts:
Run Your First Model
1
Initialize SDK
Initialize the NexaSDK in your Android application:
2
Download and Load Model
Create a VLM wrapper and load a model for NPU inference. You can download OmniNeural-4B model from huggingface and place it in the folder that your app has permission to access, for example:
/data/data/${your-app-packagename}/files/models/OmniNeural-4BYour app should have permission to access the model folder. For example, in the demo, the model_path is set to
/data/data/com.nexa.demo/files/models/OmniNeural-4B/files-1-1.nexa3
Generate Content
Use the loaded model to generate responses:
Supported Models
We’ve curated collections of compatible models for you to explore:NPU Models
Optimized models for Qualcomm NPU
GGUF Models
Any GGUF format LLM and VLM models
Device Compatibility
NPU Support
NPU Support
- Qualcomm Snapdragon 8 Gen 4
GPU Support
GPU Support
- Qualcomm Adreno GPU
CPU Support
CPU Support
- ARM64-v8a
Next Steps
API Reference
Explore detailed documentation for all model types and learn their usage.
Was this page helpful?