macrocosm-os / finetuningLinks
☆11Updated 8 months ago
Alternatives and similar repositories for finetuning
Users that are interested in finetuning are comparing it to the libraries listed below
Sorting:
- ☆55Updated 5 months ago
- High-throughput tensor loading for PyTorch☆221Updated 3 weeks ago
- Video+code lecture on building nanoGPT from scratch☆68Updated last year
- ☆122Updated last year
- ☆119Updated last year
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆110Updated 11 months ago
- A pipeline parallel training script for LLMs.☆166Updated 9 months ago
- Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)☆34Updated 11 months ago
- Gradio UI for a Cog API☆70Updated last year
- ☆27Updated last year
- ☆68Updated last year
- Testing LLM reasoning abilities with family relationship quizzes.☆63Updated last year
- ☆101Updated last year
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆61Updated last year
- ☆137Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated 3 months ago
- An introduction to LLM Sampling☆79Updated last year
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆112Updated 8 months ago
- Recaption large (Web)Datasets with vllm and save the artifacts.☆52Updated last year
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated last year
- implementation of https://arxiv.org/pdf/2312.09299☆21Updated last year
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆150Updated last month
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆41Updated last year
- ☆52Updated 2 years ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆85Updated 5 months ago
- ☆63Updated 7 months ago
- BH hackathon☆14Updated last year
- ☆63Updated last year
- Distributed Inference for mlx LLm☆100Updated last year
- Efficient non-uniform quantization with GPTQ for GGUF☆58Updated 4 months ago