reka-ai / rekaquantLinks

☆62

Alternatives and similar repositories for rekaquant

Users that are interested in rekaquant are comparing it to the libraries listed below

Sorting:

s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆59Updated last month
EduardTalianu / EntropixLab
entropix style sampling + GUI
☆27Updated last year
IST-DASLab / gptq-gguf-toolkit
Efficient non-uniform quantization with GPTQ for GGUF
☆53Updated 2 months ago
attashe / ModifiedBeamSampler
Modified Beam Search with periodical restart
☆12Updated last year
tiiuae / onebitllms
Lightweight toolkit package to train and fine-tune 1.58bit Language models
☆99Updated 6 months ago
kubernetes-bad / reward-composer
Lego for GRPO
☆30Updated 6 months ago
QuixiAI / kraken
☆67Updated last year
minosvasilias / simple_grpo
Simple GRPO scripts and configurations.
☆59Updated 9 months ago
arcee-ai / DAM
☆55Updated last year
cg123 / bitnet
Modeling code for a BitNet b1.58 Llama-style model.
☆25Updated last year
agokrani / distillKitPlus
Easy to use, High Performant Knowledge Distillation for LLMs
☆96Updated 6 months ago
jukofyork / transplant-vocab
Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.
☆46Updated last month
cloneofsimo / auto_llm_codebase_analysis
☆26Updated last year
nexusflowai / NexusBench
Nexusflow function call, tool use, and agent benchmarks.
☆30Updated 11 months ago
kurakurai / Luth
Luth is a state-of-the-art series of fine-tuned LLMs for French
☆40Updated last month
Cerebras / DocChat
GPT-4 Level Conversational QA Trained In a Few Hours
☆66Updated last year
Mihaiii / backtrack_sampler
An easy-to-understand framework for LLM samplers that rewind and revise generated tokens
☆146Updated 9 months ago
OpenPipe / rl-experiments
OpenPipe Reinforcement Learning Experiments
☆32Updated 8 months ago
OpenEvaByte / evabyte
EvaByte: Efficient Byte-level Language Models at Scale
☆111Updated 7 months ago
serp-ai / Parameter-Efficient-MoE
Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks
☆31Updated last year
nyunAI / PruneGPT
☆51Updated last year
QuixiAI / dolphin-utils
☆15Updated 4 months ago
slashml / awesome-finetuning
☆30Updated last year
ritabratamaiti / AnyModal
AnyModal is a Flexible Multimodal Language Model Framework for PyTorch
☆103Updated 11 months ago
codelion / pts
Pivotal Token Search
☆131Updated 4 months ago
rosmineb / unit_test_rl
Project code for training LLMs to write better unit tests + code
☆21Updated 6 months ago
AnswerDotAI / ModernBERT-Instruct-mini-cookbook
☆52Updated 9 months ago
Zyphra / Zyda_processing
☆39Updated last year
collinear-ai / spider
Streamline on-policy/off-policy distillation workflows in a few lines of code
☆65Updated last week
severian42 / Computational-Model-for-Symbolic-Representations
Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …
☆54Updated 9 months ago