Open
Description
Context
At the moment it is only possible to split after convertion or quantization. Mentionned by @Artefact2 in this [comment](https://github.com/ggerganov/llama.cpp/pull/6135#issuecomment-2003942162)
:
as an alternative, add the splitting logic directly to tools that produce ggufs, like convert.py and quantize.
Proposition
Include split options in convert*.py
, support splits in quantize