Add support for Ultravox #1207

xenova · 2025-02-25T00:43:29Z

Adds support for ultravox models.

Example usage:

import { UltravoxProcessor, UltravoxModel, read_audio } from "@huggingface/transformers";

const processor = await UltravoxProcessor.from_pretrained(
  "onnx-community/ultravox-v0_5-llama-3_2-1b-ONNX",
);
const model = await UltravoxModel.from_pretrained(
  "onnx-community/ultravox-v0_5-llama-3_2-1b-ONNX",
  {
    dtype: {
      embed_tokens: "q8", // "fp32", "fp16", "q8"
      audio_encoder: "q4", // "fp32", "fp16", "q8", "q4", "q4f16"
      decoder_model_merged: "q4", // "q8", "q4", "q4f16"
    },
  },
);

const audio = await read_audio("http://huggingface.co/datasets/Xenova/transformers.js-docs/resolve/main/mlk.wav", 16000);
const messages = [
  {
    role: "system",
    content: "You are a helpful assistant.",
  },
  { role: "user", content: "Transcribe this audio:<|audio|>" },
];
const text = processor.tokenizer.apply_chat_template(messages, {
  add_generation_prompt: true,
  tokenize: false,
});

const inputs = await processor(text, audio);
const generated_ids = await model.generate({
  ...inputs,
  max_new_tokens: 128,
});

const generated_texts = processor.batch_decode(
  generated_ids.slice(null, [inputs.input_ids.dims.at(-1), null]),
  { skip_special_tokens: true },
);
console.log(generated_texts[0]);
// "I can transcribe the audio for you. Here's the transcription:\n\n\"I have a dream that one day this nation will rise up and live out the true meaning of its creed.\"\n\n- Martin Luther King Jr.\n\nWould you like me to provide the transcription in a specific format (e.g., word-for-word, character-for-character, or a specific font)?"

Closes #1202

HuggingFaceDocBuilderDev · 2025-02-25T00:45:39Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

xenova added 2 commits February 25, 2025 00:34

Add support for ultravox

0de064b

WhisperFeatureExtractor: support specifying max length

a80d2a4

xenova added 4 commits February 26, 2025 11:59

Add more whisper feature extraction unit tests

40b50a8

Merge branch 'main' into add-ultravox

5d919b2

Fix links from merge

ddb4c1d

Merge branch 'main' into add-ultravox

714b077

xenova merged commit 3502ddb into main Feb 26, 2025
4 checks passed

xenova deleted the add-ultravox branch February 26, 2025 22:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add support for Ultravox #1207

Add support for Ultravox #1207

Uh oh!

xenova commented Feb 25, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Feb 25, 2025

Uh oh!

Uh oh!

Uh oh!

Add support for Ultravox #1207

Add support for Ultravox #1207

Uh oh!

Conversation

xenova commented Feb 25, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Feb 25, 2025

Uh oh!

Uh oh!

Uh oh!