evintunador / gpt-labLinks
cheap & easy LLM experiments for amateurs (alpha)
β24Updated last week
Alternatives and similar repositories for gpt-lab
Users that are interested in gpt-lab are comparing it to the libraries listed below
Sorting:
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuningβ36Updated 4 months ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" π€β75Updated 10 months ago
- β35Updated 2 months ago
- β95Updated 6 months ago
- Simple GRPO scripts and configurations.β59Updated 8 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optunaβ55Updated 8 months ago
- Luth is a state-of-the-art series of fine-tuned LLMs for Frenchβ33Updated last week
- Simple repository for training small reasoning modelsβ40Updated 8 months ago
- A framework for fine-tuning retrieval-augmented generation (RAG) systems.β130Updated this week
- β49Updated 7 months ago
- Low memory full parameter finetuning of LLMsβ53Updated 2 months ago
- Train your own SOTA deductive reasoning modelβ107Updated 7 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafteβ¦β78Updated 11 months ago
- Build Agentic workflows with function calling using open LLMsβ28Updated last month
- Trully flash implementation of DeBERTa disentangled attention mechanism.β65Updated last week
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)β91Updated 8 months ago
- Build a Recommendation System Agent using LATS Agent Approachβ33Updated 7 months ago
- β80Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated last year
- β68Updated 4 months ago
- Train LLM on Hugging Face infraβ55Updated 3 weeks ago
- β25Updated 4 months ago
- A comprehensive repository of reasoning tasks for Medical LLMs (and beyond)β128Updated last year
- NanoGPT-speedrunning for the poor T4 enjoyersβ72Updated 5 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Modelsβ111Updated 5 months ago
- A collection of lightweight interpretability scripts to understand how LLMs thinkβ56Updated last week
- β24Updated last year
- alternative way to calculating self attentionβ18Updated last year
- Writing Blog Posts with Generative Feedback Loops!β50Updated last year
- An introduction to LLM Samplingβ79Updated 9 months ago