SakanaAI / CycleQDLinks

CycleQD is a framework for parameter space model merging.

☆44

Alternatives and similar repositories for CycleQD

Users that are interested in CycleQD are comparing it to the libraries listed below

Sorting:

luchris429 / DiscoPOP
Code for Discovering Preference Optimization Algorithms with and for Large Language Models
☆63Updated last year
SakanaAI / TAID
Official implementation of "TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models"
☆118Updated 3 weeks ago
Ino-Ichan / GIT-LLM
☆22Updated 2 years ago
Aratako / Task-Vector-Merge-Optimzier
☆14Updated last year
iwiwi / epochraft
Checkpointable dataset utilities for foundation model training
☆31Updated last year
kotoba-tech / kotomamba
Mamba training library developed by kotoba technologies
☆69Updated last year
foundation-model-stack / bamba
Train, tune, and infer Bamba model
☆135Updated 4 months ago
ryokamoi / llm-self-correction-papers
List of papers on Self-Correction of LLMs.
☆80Updated 10 months ago
SakanaAI / ALE-Bench
The official repository of ALE-Bench
☆121Updated this week
swallow-llm / swallow-evaluation-instruct
Swallowプロジェクト事後学習済み大規模言語モデル評価フレームワーク
☆21Updated 2 weeks ago
okoge-kaz / llm-recipes
Ongoing Research Project for continaual pre-training LLM(dense mode)
☆42Updated 8 months ago
iwiwi / epochraft-hf-fsdp
Example of using Epochraft to train HuggingFace transformers models with PyTorch FSDP
☆11Updated last year
SakanaAI / Sudoku-Bench
An AI benchmark for creative, human-like problem solving using Sudoku variants
☆105Updated 3 months ago
kiddyboots216 / lottery-ticket-adaptation
Lottery Ticket Adaptation
☆40Updated 11 months ago
TRI-ML / linear_open_lm
A repository for research on medium sized language models.
☆78Updated last year
jdeschena / sdtt
[ICLR 2025] SDTT: a simple and effective distillation method for discrete diffusion models
☆41Updated last month
facebookresearch / MemoryMosaics
Memory Mosaics are networks of associative memories working in concert to achieve a prediction task.
☆48Updated 9 months ago
JacksonCakes / vision-r1
☆12Updated 7 months ago
turingmotors / vlm-recipes
☆20Updated last year
Qualcomm-AI-research / codeit
☆27Updated last year
analokmaus / kaggle-aimo2-fast-math-r1
Kaggle AIMO2 solution with token-efficient reasoning LLM recipes
☆37Updated 2 months ago
shenao-zhang / BARL
Bayes-Adaptive RL for LLM Reasoning
☆40Updated 5 months ago
kyegomez / EvoVLM-JP
Plug in & Play Pytorch Implementation of the paper: "Evolutionary Optimization of Model Merging Recipes" by Sakana AI
☆29Updated 11 months ago
CLAIRE-Labo / EvoTune
Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.
☆116Updated last week
llm-jp / llm-jp-sft
☆61Updated last year
kotoba-tech / kotoba-recipes
Support Continual pre-training & Instruction Tuning forked from llama-recipes
☆33Updated last year
ContextualAI / CLAIR_and_APO
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
☆60Updated last year
epfml / schedules-and-scaling
Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"
☆84Updated last year
flowersteam / LLM-Culture
Code for the "Cultural evolution in populations of Large Language Models" paper
☆31Updated last year
okoge-kaz / moe-recipes
Ongoing research training Mixture of Expert models.
☆21Updated last year