LLM360 / amber-data-prep
Data preparation code for Amber 7B LLM
☆84Updated 8 months ago
Alternatives and similar repositories for amber-data-prep:
Users that are interested in amber-data-prep are comparing it to the libraries listed below
- Data preparation code for CrystalCoder 7B LLM☆44Updated 8 months ago
- Pre-training code for Amber 7B LLM☆160Updated 8 months ago
- Pre-training code for CrystalCoder 7B LLM☆55Updated 8 months ago
- A toolkit for fine-tuning, inferencing, and evaluating GreenBitAI's LLMs.☆80Updated last week
- Open Implementations of LLM Analyses☆98Updated 3 months ago
- A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).☆157Updated 3 weeks ago
- Code repository for the c-BTM paper☆105Updated last year
- Manage scalable open LLM inference endpoints in Slurm clusters☆249Updated 6 months ago
- FuseAI Project☆80Updated this week
- Just a bunch of benchmark logs for different LLMs☆117Updated 6 months ago
- ☆31Updated 7 months ago
- Experiments on speculative sampling with Llama models☆123Updated last year
- A pipeline for LLM knowledge distillation☆84Updated this week
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆34Updated 9 months ago
- This is the official repository for Inheritune.☆109Updated 3 months ago
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆76Updated last year
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆66Updated 3 months ago
- Train, tune, and infer Bamba model☆80Updated 2 weeks ago
- Evaluating LLMs with CommonGen-Lite☆88Updated 10 months ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆98Updated 4 months ago
- ☆110Updated 4 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated last year
- experiments with inference on llama☆104Updated 7 months ago
- The official repo for "LLoCo: Learning Long Contexts Offline"☆114Updated 7 months ago
- My fork os allen AI's OLMo for educational purposes.☆30Updated last month
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆131Updated 3 months ago
- ☆47Updated 5 months ago
- ☆74Updated last year
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year