This page documents all available CLI commands with usage examples.
nexa pull <model-name>
, the CLI will prompt:
nexa infer
.
Describe this picture </path/to/image.png>
Convert this audio into text </path/to/audio.mp3>
-m asr
: Sets the model type to ASR.--input
: Specifies the input audio file.--language
: Sets the language code (e.g., en for English, zh for Chinese).-m TTS
: Sets the model type to TTS.--voice-identifier
: Specifies the speaker’s voice.
--voice-identifier
is provided, NexaCLI will return a full list of supported voices in the error message. This is useful for discovering all available voice options.-p
: The text prompt to synthesize.-o
: Output file for the generated .wav audio.-m embedder
: Sets the model type to Embedder.--prompt
: Provide one or more pieces of text to embed.-m reranker
: Sets the model type to Reranker.--query
: The main query string used to evaluate document relevance.--document
: One or more documents to score against the query.nexa serve
.
nexa run
.
--disable-stream|-s
: disable streaming and respond the entire json back.