macrocosm-os / finetuning
☆10Updated last week
Alternatives and similar repositories for finetuning:
Users that are interested in finetuning are comparing it to the libraries listed below
- ☆51Updated last week
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆91Updated 3 weeks ago
- ☆126Updated 7 months ago
- ☆114Updated 10 months ago
- Video+code lecture on building nanoGPT from scratch☆66Updated 9 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆39Updated last month
- look how they massacred my boy☆63Updated 5 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆63Updated 4 months ago
- ☆111Updated 3 months ago
- Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)☆32Updated last month
- Pretraining☆21Updated this week
- ☆49Updated last year
- An introduction to LLM Sampling☆77Updated 3 months ago
- Gradio UI for a Cog API☆66Updated 11 months ago
- Lego for GRPO☆25Updated 2 weeks ago
- Modify Entropy Based Sampling to work with Mac Silicon via MLX☆50Updated 4 months ago
- ☆66Updated 10 months ago
- Simple GRPO scripts and configurations.☆59Updated last month
- ☆62Updated 3 months ago
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆42Updated 10 months ago
- Community ComfyUI workflows running on fal.ai☆57Updated 7 months ago
- Focused on fast experimentation and simplicity☆70Updated 3 months ago
- Entropy Based Sampling and Parallel CoT Decoding☆17Updated 5 months ago
- Cerule - A Tiny Mighty Vision Model☆67Updated 6 months ago
- ☆28Updated last year
- entropix style sampling + GUI☆25Updated 5 months ago
- ☆99Updated 7 months ago
- Recaption large (Web)Datasets with vllm and save the artifacts.☆48Updated 4 months ago
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆45Updated last year
- ☆52Updated this week