evintunador / gpt-labLinks
cheap & easy LLM experiments for amateurs (alpha)
β25Updated last month
Alternatives and similar repositories for gpt-lab
Users that are interested in gpt-lab are comparing it to the libraries listed below
Sorting:
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuningβ37Updated 8 months ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" π€β76Updated last year
- Luth is a state-of-the-art series of fine-tuned LLMs for Frenchβ41Updated 3 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optunaβ59Updated 3 months ago
- β105Updated 10 months ago
- Simple GRPO scripts and configurations.β59Updated 11 months ago
- Train your own SOTA deductive reasoning modelβ107Updated 10 months ago
- Collection of resources for RL and Reasoningβ27Updated 11 months ago
- β38Updated 5 months ago
- β68Updated 8 months ago
- Source code for the collaborative reasoner research project at Meta FAIR.β112Updated 9 months ago
- β53Updated 11 months ago
- Train transformer language models with reinforcement learning.β19Updated 11 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafteβ¦β83Updated last year
- Trully flash implementation of DeBERTa disentangled attention mechanism.β67Updated this week
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Modelsβ114Updated 9 months ago
- Build a Recommendation System Agent using LATS Agent Approachβ33Updated 11 months ago
- Learn the building blocks of how to build gpt-oss from scratchβ110Updated 4 months ago
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.β41Updated 9 months ago
- One click away from a locally downloaded, fine-tuned model, hosted on hugging face, with inference built in. In two hours.β23Updated 2 months ago
- Low memory full parameter finetuning of LLMsβ53Updated 6 months ago
- Streamline on-policy/off-policy distillation workflows in a few lines of codeβ93Updated this week
- The code repository of the paper: Competition and Attraction Improve Model Fusionβ168Updated 5 months ago
- β45Updated 8 months ago
- β26Updated last year
- β80Updated last year
- Simple repository for training small reasoning modelsβ48Updated 11 months ago
- π¦Ύπ»π distributed training & serverless inference at scale on RunPodβ19Updated last year
- Set of scripts to finetune LLMsβ38Updated last year
- Train LLM on Hugging Face infraβ67Updated 2 months ago