Change the repository type filter
All
Repositories list
32 repositories
- 📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉
- 📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉
torchlm
Public💎An easy-to-use PyTorch library for face landmarks detection: training, evaluation, inference, and 100+ data augmentations.🎉lihang-notes
Public📚《统计学习方法-李航: 笔记》 200页PDF,公式细节讲解🎉.github
PublicHGEMM
Publicffpa-attn
Public⚡️FFPA: Extend FlashAttention-2 with Split-D, achieve ~O(1) SRAM complexity for large headdim, 1.8x~3x↑ vs SDPA.🎉xlite-cli
Publictutorial-template
Public templatenetron-vscode-extension
Publicyolov5face-toolkit
PublicYOLO5Face 2021 with MNN/NCNN/TNN/ONNXRuntimefsanet-toolkit
Publicscrfd-toolkit
Public