> ## Documentation Index
> Fetch the complete documentation index at: https://docs.nexa.ai/llms.txt
> Use this file to discover all available pages before exploring further.

# GGUF Models

> GGUF models are a popular open format for running large language models efficiently on a wide range of hardware, including Mac, Windows, and Linux. Enjoy fast, local inference with broad model compatibility!

## ⚙️ **Prerequisites**

* If you haven't already, install the nexa-SDK.
* Below are the GGUF-compatible model types you can experiment with right away.

## **LLM - Language Models**

📝 Language models in GGUF format. Try out this quick example:

Try it out:

```bash bash theme={"dark"}
nexa infer NexaAI/Qwen3-0.6B
```

<Check>
  ⌨️  This will spawn an interactive REPL conversation session with the model.
</Check>

***

## **LMM - Multimodal Models**

🖼️ Multimodal models that accept vision and/or audio inputs. Try out this quick example:

```bash bash theme={"dark"}
nexa infer NexaAI/Qwen2.5-Omni-3B-GGUF
```

<Check>
  ⌨️  Drag images and audio files to the conversation input to chat with images / audio.
</Check>

## **Supported Model List**

We curated a list of top, high quality models in GGUF format.

<Columns cols={2}>
  <Card icon="text" title="LLMs for GGUF" href="https://huggingface.co/collections/NexaAI/llm-gguf-687edb7a94b7dda443a01f1d" cta="View Huggingface Collection 🤗" />

  <Card icon="eye" title="Multimodal for GGUF" href="https://huggingface.co/collections/NexaAI/multimodal-gguf-687edb2e756650304c73d8da" cta="View Huggingface Collection 🤗" />
</Columns>

<Tip>To try other GGUF models, visit Hugging Face, copy the path of any compatible GGUF model (e.g., `unsloth/Qwen2.5-VL-3B-Instruct-GGUF`), and replace the model path in the command above.</Tip>

For more advanced models, you may visit the [Nexa Model Hub](https://sdk.nexa.ai/model). Also, access token is required to download and use these models. To get access token:

* **Create an account** at [sdk.nexa.ai](https://sdk.nexa.ai)
* **Generate a token**: Go to **Deployment → Create Token**
* **Activate your SDK**: Run the following command on the terminal to set your license:

```bash bash theme={"dark"}
nexa config set license '<your_token_here>'
```

## 🙋 **Request New Models**

Missing a model? Vote for it on the **[Nexa Wishlist](/en/community/wishlist)** — we build the most-voted models fast!

You can also submit an issue on the <a href="https://github.com/NexaAI/nexa-sdk/issues" target="_blank">nexa-sdk GitHub</a> or request in our Discord/Slack community.

<br />

<div class="feedback-wrapper">
  <span class="feedback-label">Was this page helpful?</span>

  <div class="feedback-toggle">
    <input type="radio" name="feedback" id="feedback-yes" class="feedback-input" />

    <label for="feedback-yes" class="feedback-button">
      <img src="https://mintcdn.com/nexaai/g8-zBYnunEyVtcK3/Images/FeedBack/thumbs-up.svg?fit=max&auto=format&n=g8-zBYnunEyVtcK3&q=85&s=0b57c51c8db9940403e7552956e5c30e" alt="Thumbs up" class="feedback-icon" noZoom width="14" height="14" data-path="Images/FeedBack/thumbs-up.svg" />

      Yes
    </label>

    <input type="radio" name="feedback" id="feedback-no" class="feedback-input" />

    <label for="feedback-no" class="feedback-button">
      <img src="https://mintcdn.com/nexaai/g8-zBYnunEyVtcK3/Images/FeedBack/thumbs-down.svg?fit=max&auto=format&n=g8-zBYnunEyVtcK3&q=85&s=ebacf61d57c8259c6df243d329b548b3" alt="Thumbs down" class="feedback-icon" noZoom width="14" height="14" data-path="Images/FeedBack/thumbs-down.svg" />

      No
    </label>
  </div>
</div>
