arcee-ai / EvolKitLinks

EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language Models (LLMs).

☆240

Alternatives and similar repositories for EvolKit

Users that are interested in EvolKit are comparing it to the libraries listed below

Sorting:

huggingface / llm-swarm
Manage scalable open LLM inference endpoints in Slurm clusters
☆273Updated last year
microsoft / FILM
Official repo for "Make Your LLM Fully Utilize the Context"
☆259Updated last year
golololologol / LLM-Distillery
A pipeline for LLM knowledge distillation
☆109Updated 6 months ago
Digitous / LLM-SLERP-Merge
Spherical Merge Pytorch/HF format Language Models with minimal feature loss.
☆138Updated 2 years ago
writer / writing-in-the-margins
☆119Updated last year
QuixiAI / spectrum
☆136Updated 2 months ago
Pints-AI / 1.5-Pints
A compact LLM pretrained in 9 days by using high quality data
☆330Updated 6 months ago
VITA-Group / Q-GaLore
Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.
☆202Updated last year
davanstrien / awesome-synthetic-datasets
awesome synthetic (text) datasets
☆302Updated 3 months ago
wang-research-lab / agentinstruct
Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"
☆116Updated this week
JinjieNi / MixEval
The official evaluation suite and dynamic data release for MixEval.
☆250Updated 11 months ago
swj0419 / detect-pretrain-code-contamination
☆77Updated last year
arcee-ai / PruneMe
Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models
☆249Updated last year
xfactlab / orpo
Official repository for ORPO
☆463Updated last year
zai-org / ComplexFuncBench
Complex Function Calling Benchmark.
☆139Updated 9 months ago
casper-hansen / OpenCoconut
OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.
☆172Updated 9 months ago
dwzhu-pku / PoSE
Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)
☆204Updated last year
lm-sys / llm-decontaminator
Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"
☆311Updated last year
ServiceNow / Fast-LLM
Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research
☆252Updated last week
tianyi-lab / Reflection_Tuning
[ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning
☆362Updated last year
jshuadvd / LongRoPE
Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper
☆150Updated last year
huggingface / cosmopedia
☆544Updated 11 months ago
Leeroo-AI / mergoo
A library for easily merging multiple LLM experts, and efficiently train the merged LLM.
☆494Updated last year
jondurbin / bagel
A bagel, with everything.
☆324Updated last year
booydar / babilong
BABILong is a benchmark for LLM evaluation using the needle-in-a-haystack approach.
☆215Updated last month
lamini-ai / Lamini-Memory-Tuning
Banishing LLM Hallucinations Requires Rethinking Generalization
☆275Updated last year
huggingface / data-is-better-together
Let's build better datasets, together!
☆262Updated 10 months ago
OpenPipe / deductive-reasoning
Train your own SOTA deductive reasoning model
☆108Updated 7 months ago
PrimeIntellect-ai / genesys
☆135Updated 7 months ago
LLM360 / amber-train
Pre-training code for Amber 7B LLM
☆168Updated last year