Skip to content

chore(model gallery): add kalomaze_qwen3-16b-a3b #5312

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
May 4, 2025

Conversation

mudler
Copy link
Owner

@mudler mudler commented May 4, 2025

Description

This pull request adds a new model entry to the gallery/index.yaml file. The entry provides details about a custom experiment involving the Qwen3 model family, specifically a pruned and modified version of the original 30B MoE.

New model addition:

  • Model name: kalomaze_qwen3-16b-a3b
    • Description: A custom experiment to measure expert activation probabilities and prune the least-used experts per layer in the Qwen3-30B model. The pruned model retains semi-coherent writing capabilities without additional training or distillation. Includes measurement data and exported weights.
    • Files and metadata: Added associated file kalomaze_Qwen3-16B-A3B-Q4_K_M.gguf with its SHA256 hash and URI for retrieval.

Notes for Reviewers

Signed commits

  • Yes, I signed my commits.

@mudler mudler merged commit 6984749 into master May 4, 2025
18 of 20 checks passed
@mudler mudler deleted the models/kalomaze_qwen3-16b-a3b branch May 4, 2025 07:39
Copy link

netlify bot commented May 4, 2025

Deploy Preview for localai ready!

Name Link
🔨 Latest commit f3d5326
🔍 Latest deploy log https://app.netlify.com/sites/localai/deploys/681719a22ee3c70008534bc7
😎 Deploy Preview https://deploy-preview-5312--localai.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant