Skip to content
Change the repository type filter

All

    Repositories list

    • GPTQModel

      Public
      Production ready LLM model compression/quantization toolkit with hw accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang.
      Python
      Apache License 2.0
      875893112Updated May 29, 2025May 29, 2025
    • A framework for few-shot evaluation of language models.
      Python
      MIT License
      2.4k000Updated Apr 17, 2025Apr 17, 2025
    • LogBar

      Public
      A unified Logger and ProgressBar util with zero dependencies.
      Python
      Apache License 2.0
      0410Updated Apr 1, 2025Apr 1, 2025
    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Apache License 2.0
      7.7k000Updated Mar 27, 2025Mar 27, 2025
    • rockthem

      Public
      Cuda
      Apache License 2.0
      0000Updated Mar 13, 2025Mar 13, 2025
    • Tokenicer

      Public
      A (nicer) tokenizer you want to use for model inference and training: with all known peventable gotchas normalized or auto-fixed.
      Python
      Apache License 2.0
      2801Updated Mar 12, 2025Mar 12, 2025
    • Python
      Creative Commons Attribution 4.0 International
      1000Updated Mar 6, 2025Mar 6, 2025
    • peft

      Public
      🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
      Python
      Apache License 2.0
      1.9k000Updated Mar 4, 2025Mar 4, 2025
    • sglang

      Public
      SGLang is a fast serving framework for large language models and vision language models.
      Python
      Apache License 2.0
      1.9k000Updated Mar 4, 2025Mar 4, 2025
    • Self-contained Python lib with zero-dependencies that give you a unified device properties for gpu, cpu, and npu. No more calling separate tools such as nvidia-smi or /proc/cpuinfo and parsing it yourself.
      Python
      Apache License 2.0
      11121Updated Mar 1, 2025Mar 1, 2025
    • 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
      Python
      Apache License 2.0
      29k000Updated Feb 12, 2025Feb 12, 2025
    • optimum

      Public
      🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools
      Python
      Apache License 2.0
      541100Updated Feb 7, 2025Feb 7, 2025