Skip to content

split: include the option in ./convert.py and quantize #6260

Open
@phymbert

Description

@phymbert

Context

At the moment it is only possible to split after convertion or quantization. Mentionned by @Artefact2 in this [comment](https://github.com/ggerganov/llama.cpp/pull/6135#issuecomment-2003942162):

as an alternative, add the splitting logic directly to tools that produce ggufs, like convert.py and quantize.

Proposition

Include split options in convert*.py, support splits in quantize

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or requestgood first issueGood for newcomershelp wantedExtra attention is neededsplitGGUF split model sharding

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions