LLM360 / crystalcoder-trainLinks

Pre-training code for CrystalCoder 7B LLM

☆55

Alternatives and similar repositories for crystalcoder-train

Users that are interested in crystalcoder-train are comparing it to the libraries listed below

Sorting:

LLM360 / crystalcoder-data-prep
Data preparation code for CrystalCoder 7B LLM
☆45Updated last year
LLM360 / amber-data-prep
Data preparation code for Amber 7B LLM
☆91Updated last year
LLM360 / Analysis360
Open Implementations of LLM Analyses
☆105Updated 9 months ago
allenai / CommonGen-Eval
Evaluating LLMs with CommonGen-Lite
☆90Updated last year
xingyaoww / LeTI
Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."
☆65Updated 2 years ago
LLM360 / amber-train
Pre-training code for Amber 7B LLM
☆167Updated last year
akjindal53244 / Arithmo
Small and Efficient Mathematical Reasoning LLMs
☆71Updated last year
Zyphra / Zyda_processing
☆37Updated last year
arcee-ai / DAM
☆53Updated 8 months ago
GAIR-NLP / Entropy-ABF
Official implementation for 'Extending LLMs’ Context Window with 100 Samples'
☆79Updated last year
TheDuckAI / arb
Advanced Reasoning Benchmark Dataset for LLMs
☆47Updated last year
automix-llm / automix
Mixing Language Models with Self-Verification and Meta-Verification
☆105Updated 7 months ago
microsoft / ToolTalk
Evaluating tool-augmented LLMs in conversation settings
☆85Updated last year
argilla-io / notus
Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…
☆168Updated last year
dust-tt / llama-ssp
Experiments on speculative sampling with Llama models
☆128Updated 2 years ago
tanyuqian / cappy
NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer
☆43Updated last year
Eureka6174 / LearnNLPlan
Learning to Program with Natural Language
☆6Updated last year
NL2Code / CodeM
☆44Updated last year
daniel-furman / sft-demos
Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.
☆77Updated 9 months ago
togethercomputer / Llama-2-7B-32K-Instruct
☆84Updated last year
VITA-Group / ChainCoder
[ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …
☆40Updated last year
huu4ontocord / MDEL
Multi-Domain Expert Learning
☆67Updated last year
IBM / ModuleFormer
ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward exp…
☆223Updated last year
bigcode-project / astraios
Astraios: Parameter-Efficient Instruction Tuning Code Language Models
☆59Updated last year
sdan / selfextend
an implementation of Self-Extend, to expand the context window via grouped attention
☆119Updated last year
TRI-ML / linear_open_lm
A repository for research on medium sized language models.
☆78Updated last year
kernelmachine / cbtm
Code repository for the c-BTM paper
☆107Updated last year
patronus-ai / Lynx-hallucination-detection
☆41Updated last year
emrgnt-cmplxty / zero-shot-replication
☆74Updated last year
Digitous / LLM-SLERP-Merge
Spherical Merge Pytorch/HF format Language Models with minimal feature loss.
☆135Updated last year