wjytt / bug-free-pancake
☆10Updated 4 months ago
Alternatives and similar repositories for bug-free-pancake
Users that are interested in bug-free-pancake are comparing it to the libraries listed below
Sorting:
- ☆10Updated 4 months ago
- ☆10Updated 4 months ago
- ☆10Updated 4 months ago
- ☆961Updated 3 months ago
- Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.p…☆1,254Updated last week
- ☆1,544Updated last year
- whisper.cpp bindings for python☆95Updated last year
- Visualize the intermediate output of Mistral 7B☆360Updated 3 months ago
- [ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning☆653Updated 11 months ago
- Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models☆238Updated last year
- ☆543Updated 5 months ago
- ☆532Updated 6 months ago
- Convenience scripts to finetune (chat-)LLaMa3 and other models for any language☆305Updated 11 months ago
- Training LLMs with QLoRA + FSDP☆1,477Updated 6 months ago
- This is our own implementation of 'Layer Selective Rank Reduction'☆238Updated 11 months ago
- LLM Transparency Tool (LLM-TT), an open-source interactive toolkit for analyzing internal workings of Transformer-based language models. …☆817Updated 5 months ago
- Run Mixtral-8x7B models in Colab or consumer desktops☆2,311Updated last year
- Ink Web App☆372Updated this week
- A library for making RepE control vectors☆587Updated 4 months ago
- Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".☆274Updated last year
- For releasing code related to compression methods for transformers, accompanying our publications☆427Updated 4 months ago
- Python bindings for whisper.cpp☆247Updated last week
- Port of Facebook's LLaMA model in C/C++☆11Updated this week
- Official implementation of Half-Quadratic Quantization (HQQ)☆810Updated this week
- ☆22Updated 3 years ago
- Inference of Mamba models in pure C☆188Updated last year
- ☆444Updated last year
- Suno AI's Bark model in C/C++ for fast text-to-speech generation☆811Updated 6 months ago
- Stop messing around with finicky sampling parameters and just use DRµGS!☆349Updated 11 months ago
- A little(lil) Language Model (LM)☆48Updated 3 weeks ago