Lyken17 / hf-torrentLinks
☆39Updated last year
Alternatives and similar repositories for hf-torrent
Users that are interested in hf-torrent are comparing it to the libraries listed below
Sorting:
- IntLLaMA: A fast and light quantization solution for LLaMA☆18Updated last year
- Accelerate LLM preference tuning via prefix sharing with a single line of code☆41Updated last month
- ☆31Updated last year
- A tiny, didactical implementation of LLAMA 3☆41Updated 6 months ago
- An simple pytorch implementation of Flash MultiHead Attention☆21Updated last year
- Description and applications of OpenAI's paper about DALL-E (2021) and implementation of other (CLIP-guided) zero-shot text-to-image gene…☆33Updated 2 years ago
- Self Reproduction Code of Paper "Reducing Transformer Key-Value Cache Size with Cross-Layer Attention (MIT CSAIL)☆16Updated last year
- Exploring Diffusion Transformer Designs via Grafting☆33Updated last week
- Experimental scripts for researching data adaptive learning rate scheduling.☆23Updated last year
- minisora-DiT, a DiT reproduction based on XTuner from the open source community MiniSora☆40Updated last year
- Patch convolution to avoid large GPU memory usage of Conv2D☆88Updated 5 months ago
- Here we will test various linear attention designs.☆59Updated last year
- ☆37Updated 2 years ago
- imagetokenizer is a python package, helps you encoder visuals and generate visuals token ids from codebook, supports both image and video…☆34Updated last year
- The implementation for MLSys 2023 paper: "Cuttlefish: Low-rank Model Training without All The Tuning"☆45Updated 2 years ago
- A Framework for Decoupling and Assessing the Capabilities of VLMs☆44Updated 11 months ago
- ☆16Updated last year
- TVMScript kernel for deformable attention☆25Updated 3 years ago
- The official code implementation for paper "R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing"☆35Updated last week
- ☆21Updated 2 months ago
- The reproduct of the paper - Aligner: Achieving Efficient Alignment through Weak-to-Strong Correction☆22Updated last year
- Training LLaMA language model with MMEngine! It supports LoRA fine-tuning!☆40Updated 2 years ago
- Low-bit optimizers for PyTorch☆129Updated last year
- [ICML'24 Oral] The official code of "DiJiang: Efficient Large Language Models through Compact Kernelization", a novel DCT-based linear at…☆101Updated last year
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆44Updated 4 months ago
- Here we collect trick questions and failed tasks for open source LLMs to improve them.☆32Updated 2 years ago
- differentiable top-k operator☆21Updated 5 months ago
- Resa: Transparent Reasoning Models via SAEs☆36Updated 2 weeks ago
- ☆21Updated 3 months ago
- ICME2022 Special Session “Beyond Accuracy: Responsible, Responsive, and Robust Multimedia Retrieval ”☆12Updated last year