Deep Infra
- 109 followers
- United States of America
- https://deepinfra.com
- @DeepInfra
- company/deep-infra
- info@deepinfra.com
Popular repositories Loading
-
deepinfra-node
deepinfra-node PublicOfficial TypeScript wrapper for DeepInfra Inference API
-
text-generation-inference
text-generation-inference PublicForked from huggingface/text-generation-inference
Large Language Model Text Generation Inference
-
langchain
langchain PublicForked from langchain-ai/langchain
⚡ Building applications with LLMs through composability ⚡
Python 1
-
deepinfra-chat
deepinfra-chat PublicSample Next.js ai chat app using Deep Infra inference and Vercel ai sdk
Repositories
- TensorRT-LLM Public Forked from NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
deepinfra/TensorRT-LLM’s past year of commit activity - vllm Public Forked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
deepinfra/vllm’s past year of commit activity - sglang Public Forked from sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
deepinfra/sglang’s past year of commit activity - kilocode Public Forked from Kilo-Org/kilocode
Open Source AI coding assistant for planning, building, and fixing code. We're a superset of Roo, Cline, and our own features. Follow us: kilocode.ai/social
deepinfra/kilocode’s past year of commit activity - ocr-tools Public
deepinfra/ocr-tools’s past year of commit activity - transformers Public Forked from huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
deepinfra/transformers’s past year of commit activity - Kokoro-FastAPI Public Forked from remsky/Kokoro-FastAPI
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching
deepinfra/Kokoro-FastAPI’s past year of commit activity - tensorrtllm_backend Public Forked from triton-inference-server/tensorrtllm_backend
The Triton TensorRT-LLM Backend
deepinfra/tensorrtllm_backend’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…