LLM360 / amber-data-prep
Data preparation code for Amber 7B LLM
☆76Updated 4 months ago
Related projects: ⓘ
- Evaluation and analysis code for LLM360☆75Updated 3 months ago
- Data preparation code for CrystalCoder 7B LLM☆42Updated 4 months ago
- Pre-training code for Amber 7B LLM☆148Updated 4 months ago
- Evaluating LLMs with CommonGen-Lite☆83Updated 5 months ago
- A pipeline for LLM knowledge distillation☆68Updated last month
- Pre-training code for CrystalCoder 7B LLM☆52Updated 4 months ago
- Small and Efficient Mathematical Reasoning LLMs☆69Updated 7 months ago
- Low-Rank adapter extraction for fine-tuned transformers model☆154Updated 4 months ago
- Expert Specialized Fine-Tuning☆129Updated last month
- ☆75Updated 3 weeks ago
- Just a bunch of benchmark logs for different LLMs☆112Updated last month
- ☆77Updated 3 weeks ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆123Updated 6 months ago
- Code repository for the c-BTM paper☆105Updated 11 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆117Updated 8 months ago
- Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)☆195Updated 3 months ago
- ☆73Updated 8 months ago
- A toolkit for fine-tuning, inferencing, and evaluating GreenBitAI's LLMs.☆68Updated 2 months ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆217Updated 2 months ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆81Updated last year
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆158Updated 2 months ago
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆72Updated 8 months ago
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆107Updated last year
- Benchmarking LLMs with Challenging Tasks from Real Users☆182Updated last month
- Codebase accompanying the Summary of a Haystack paper.☆65Updated 2 months ago
- A simple unified framework for evaluating LLMs☆121Updated this week
- ☆92Updated last year
- ☆35Updated last year
- experiments with inference on llama☆106Updated 3 months ago
- FuseAI Project☆75Updated 3 weeks ago