LLM360 / crystalcoder-data-prepLinks

Data preparation code for CrystalCoder 7B LLM

☆45

Alternatives and similar repositories for crystalcoder-data-prep

Users that are interested in crystalcoder-data-prep are comparing it to the libraries listed below

Sorting:

LLM360 / crystalcoder-train
Pre-training code for CrystalCoder 7B LLM
☆55Updated last year
LLM360 / amber-data-prep
Data preparation code for Amber 7B LLM
☆92Updated last year
LLM360 / Analysis360
Open Implementations of LLM Analyses
☆107Updated last year
arcee-ai / DAM
☆55Updated 11 months ago
kyegomez / LM-Infinite
Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"
☆39Updated 11 months ago
akjindal53244 / Arithmo
Small and Efficient Mathematical Reasoning LLMs
☆72Updated last year
Zyphra / Zyda_processing
☆39Updated last year
GAIR-NLP / Entropy-ABF
Official implementation for 'Extending LLMs’ Context Window with 100 Samples'
☆80Updated last year
ElleLeonne / Lightning-ReLoRA
A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.
☆35Updated last year
sanyalsunny111 / LLM-Inheritune
This is the official repository for Inheritune.
☆115Updated 8 months ago
18907305772 / FuseAI
FuseAI Project
☆87Updated 8 months ago
TRI-ML / linear_open_lm
A repository for research on medium sized language models.
☆78Updated last year
allenai / CommonGen-Eval
Evaluating LLMs with CommonGen-Lite
☆91Updated last year
du-nlp-lab / MLR-Copilot
☆67Updated 6 months ago
dinobby / MAgICoRE
☆23Updated last year
Tebmer / Rereading-LLM-Reasoning
EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…
☆27Updated 10 months ago
laramohan / wikillm
LLMs as Collaboratively Edited Knowledge Bases
☆45Updated last year
VikParuchuri / classified
Score LLM pretraining data with classifiers
☆54Updated last year
Zoeyyao27 / SirLLM
This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM
☆60Updated last year
TheDuckAI / arb
Advanced Reasoning Benchmark Dataset for LLMs
☆46Updated last year
PootieT / explain-then-translate
Official repo for EMNLP 2023 paper "Explain-then-Translate: An Analysis on Improving Program Translation with Self-generated Explanations…
☆29Updated last year
matthewrenze / jhu-concise-cot
The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models
☆22Updated 10 months ago
bespokelabsai / verifiers
Verifiers for LLM Reinforcement Learning
☆76Updated 6 months ago
bigcode-project / astraios
Astraios: Parameter-Efficient Instruction Tuning Code Language Models
☆62Updated last year
nyunAI / PruneGPT
☆51Updated last year
automix-llm / automix
Mixing Language Models with Self-Verification and Meta-Verification
☆109Updated 10 months ago
tanyuqian / cappy
NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer
☆44Updated last year
kyegomez / Infini-attention
Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…
☆56Updated last week
thomasgauthier / LLM-self-play
Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)
☆29Updated last year
agokrani / distillKitPlus
Easy to use, High Performant Knowledge Distillation for LLMs
☆94Updated 5 months ago