SonicCodes / subcloningLinks

implementation of https://arxiv.org/pdf/2312.09299

☆21

Alternatives and similar repositories for subcloning

Users that are interested in subcloning are comparing it to the libraries listed below

Sorting:

catid / lllm
Latent Large Language Models
☆18Updated 11 months ago
Birch-san / booru-embed
[WIP] Transformer to embed Danbooru labelsets
☆13Updated last year
CERC-AAI / Robin
☆63Updated 10 months ago
cloneofsimo / project_RF
☆24Updated last year
euclaise / supertrainer2000
☆49Updated last year
cg123 / bitnet
Modeling code for a BitNet b1.58 Llama-style model.
☆25Updated last year
cloneofsimo / repa-rf
☆32Updated 9 months ago
ElleLeonne / Lightning-ReLoRA
A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.
☆33Updated last year
zarakiquemparte / zaraki-tools
☆27Updated last year
fal-ai-community / llmdifftracker
Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)
☆33Updated 5 months ago
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆55Updated 6 months ago
recursal / GoldFinch-paper
GoldFinch and other hybrid transformer components
☆46Updated last year
kaiokendev / cutoff-len-is-context-len
Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
☆63Updated 2 years ago
zaydzuhri / flame
Fork of Flame repo for training of some new stuff in development
☆14Updated 2 weeks ago
Zyphra / zcookbook
Training hybrid models for dummies.
☆25Updated 6 months ago
reka-ai / rekaquant
☆58Updated 3 weeks ago
LegallyCoder / mamba-hf
Implementation of the Mamba SSM with hf_integration.
☆56Updated 11 months ago
RWKV / ZeroCoT
https://x.com/BlinkDL_AI/status/1884768989743882276
☆28Updated 3 months ago
sekstini / basedxl
☆18Updated last year
1rgs / tokenwiz
A clone of OpenAI's Tokenizer page for HuggingFace Models
☆45Updated last year
kyegomez / OpenStrawberry
An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO
☆31Updated this week
cloneofsimo / infinite-fractal-stream
☆30Updated 9 months ago
tensoic / Cerule
Cerule - A Tiny Mighty Vision Model
☆66Updated 10 months ago
cloneofsimo / zeroshampoo
☆34Updated 10 months ago
joey00072 / ohara
Collection of autoregressive model implementation
☆86Updated 3 months ago
facebookresearch / DIG-In
This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.
☆20Updated last year
katzurik / Knowledge_Navigator
☆20Updated 5 months ago
diicellman / dynamite-dogs
BH hackathon
☆14Updated last year
Zyphra / Zyda_processing
☆37Updated last year
kubernetes-bad / reward-composer
Lego for GRPO
☆28Updated 2 months ago