emrgnt-cmplxty / SmolTrainer
☆20Updated last year
Alternatives and similar repositories for SmolTrainer:
Users that are interested in SmolTrainer are comparing it to the libraries listed below
- ☆48Updated 3 months ago
- Just a bunch of benchmark logs for different LLMs☆119Updated 6 months ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year
- ☆74Updated last year
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆21Updated 2 months ago
- ☆48Updated last year
- Pre-training code for CrystalCoder 7B LLM☆55Updated 9 months ago
- ☆22Updated last year
- QLoRA with Enhanced Multi GPU Support☆36Updated last year
- ☆38Updated last year
- LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMT☆26Updated 11 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated 8 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆64Updated 3 months ago
- ☆52Updated 8 months ago
- entropix style sampling + GUI☆25Updated 3 months ago
- Let's create synthetic textbooks together :)☆73Updated last year
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆21Updated last month
- QLoRA: Efficient Finetuning of Quantized LLMs☆77Updated 10 months ago
- Routing on Random Forest (RoRF)☆112Updated 4 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated 9 months ago
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…☆42Updated last year
- The first dense retrieval model that can be prompted like an LM☆64Updated 4 months ago
- Evaluating LLMs with CommonGen-Lite☆88Updated 10 months ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆100Updated 2 months ago
- Chat Markup Language conversation library☆55Updated last year
- ☆65Updated 8 months ago