LLM360 / crystalcoder-trainLinks
Pre-training code for CrystalCoder 7B LLM
☆54Updated last year
Alternatives and similar repositories for crystalcoder-train
Users that are interested in crystalcoder-train are comparing it to the libraries listed below
Sorting:
- Data preparation code for CrystalCoder 7B LLM☆44Updated last year
- Data preparation code for Amber 7B LLM☆90Updated last year
- Open Implementations of LLM Analyses☆103Updated 7 months ago
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆65Updated last year
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- Pre-training code for Amber 7B LLM☆166Updated last year
- ☆34Updated 11 months ago
- Advanced Reasoning Benchmark Dataset for LLMs☆45Updated last year
- RepoQA: Evaluating Long-Context Code Understanding☆108Updated 7 months ago
- ☆49Updated 6 months ago
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆69Updated 2 weeks ago
- Code repository for the c-BTM paper☆106Updated last year
- ☆38Updated 10 months ago
- ☆75Updated 2 months ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆110Updated 8 months ago
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆78Updated last year
- evol augment any dataset online☆59Updated last year
- Astraios: Parameter-Efficient Instruction Tuning Code Language Models☆58Updated last year
- The official repo for "LLoCo: Learning Long Contexts Offline"☆116Updated 11 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 9 months ago
- Small, simple agent task environments for training and evaluation☆18Updated 7 months ago
- Retrieval Augmented Generation Generalized Evaluation Dataset☆53Updated 6 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated last year
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆123Updated last year
- Repository for analysis and experiments in the BigCode project.☆118Updated last year
- Evaluating LLMs with CommonGen-Lite☆90Updated last year
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆36Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆103Updated 5 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated last year
- Verifiers for LLM Reinforcement Learning☆55Updated last month