Skip to content

Commit aef5c42

Browse files
authored
chore(model gallery): add qwen3-8b (#5272)
Signed-off-by: Ettore Di Giacinto <[email protected]>
1 parent c059f91 commit aef5c42

File tree

1 file changed

+32
-0
lines changed

1 file changed

+32
-0
lines changed

gallery/index.yaml

Lines changed: 32 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -108,6 +108,38 @@
108108
- filename: Qwen3-14B.Q4_K_M.gguf
109109
sha256: ee624d4be12433277bb9a340d3e5aabf5eb68fc788a7048ee99917edaa46494a
110110
uri: huggingface://MaziyarPanahi/Qwen3-14B-GGUF/Qwen3-14B.Q4_K_M.gguf
111+
- !!merge <<: *qwen3
112+
name: "qwen3-8b"
113+
urls:
114+
- https://huggingface.co/Qwen/Qwen3-8B
115+
- https://huggingface.co/MaziyarPanahi/Qwen3-8B-GGUF
116+
description: |
117+
Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. Built upon extensive training, Qwen3 delivers groundbreaking advancements in reasoning, instruction-following, agent capabilities, and multilingual support, with the following key features:
118+
119+
Uniquely support of seamless switching between thinking mode (for complex logical reasoning, math, and coding) and non-thinking mode (for efficient, general-purpose dialogue) within single model, ensuring optimal performance across various scenarios.
120+
Significantly enhancement in its reasoning capabilities, surpassing previous QwQ (in thinking mode) and Qwen2.5 instruct models (in non-thinking mode) on mathematics, code generation, and commonsense logical reasoning.
121+
Superior human preference alignment, excelling in creative writing, role-playing, multi-turn dialogues, and instruction following, to deliver a more natural, engaging, and immersive conversational experience.
122+
Expertise in agent capabilities, enabling precise integration with external tools in both thinking and unthinking modes and achieving leading performance among open-source models in complex agent-based tasks.
123+
Support of 100+ languages and dialects with strong capabilities for multilingual instruction following and translation.
124+
125+
Model Overview
126+
127+
Qwen3-8B has the following features:
128+
129+
Type: Causal Language Models
130+
Training Stage: Pretraining & Post-training
131+
Number of Parameters: 8.2B
132+
Number of Paramaters (Non-Embedding): 6.95B
133+
Number of Layers: 36
134+
Number of Attention Heads (GQA): 32 for Q and 8 for KV
135+
Context Length: 32,768 natively and 131,072 tokens with YaRN.
136+
overrides:
137+
parameters:
138+
model: Qwen3-8B.Q4_K_M.gguf
139+
files:
140+
- filename: Qwen3-8B.Q4_K_M.gguf
141+
sha256: 376902d50612ecfc5bd8b268f376c04d10ad7e480f99a1483b833f04344a549e
142+
uri: huggingface://MaziyarPanahi/Qwen3-8B-GGUF/Qwen3-8B.Q4_K_M.gguf
111143
- &gemma3
112144
url: "github:mudler/LocalAI/gallery/gemma.yaml@master"
113145
name: "gemma-3-27b-it"

0 commit comments

Comments
 (0)