Compiling useful links, papers, benchmarks, ideas, etc.
☆46Mar 16, 2025Updated last year
Alternatives and similar repositories for resources
Users that are interested in resources are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- simple grpo☆12May 28, 2025Updated 10 months ago
- A reading list of relevant papers and projects on foundation model annotation☆28Feb 27, 2025Updated last year
- Benchmark structured generation libraries☆31Oct 25, 2024Updated last year
- Our library for RL environments + evals☆3,986Updated this week
- Build your own visual reasoning model☆421Jan 13, 2026Updated 3 months ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆121Apr 7, 2026Updated last week
- Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.☆70May 5, 2025Updated 11 months ago
- ☆48Aug 29, 2024Updated last year
- Verifiers for LLM Reinforcement Learning☆79Apr 15, 2025Updated 11 months ago
- could we make an ml stack in 100,000 lines of code?☆46Jul 17, 2024Updated last year
- a simple variational auto encoder with some exploration☆12Nov 22, 2024Updated last year
- No code solution for training tabular models☆35Jan 25, 2026Updated 2 months ago
- Testing paligemma2 finetuning on reasoning dataset☆18Dec 28, 2024Updated last year
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆85Aug 20, 2025Updated 7 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- The dataset consists of public social media url pairs and the corresponding entailment label for an external conference (ACL 2021). Each …☆14Aug 16, 2021Updated 4 years ago
- An unnecessarily tiny and minimal implementation of GPT-2 in NumPy.☆11Feb 12, 2023Updated 3 years ago
- Automated Design of Agentic Systems☆10Sep 7, 2024Updated last year
- A lightweight, user-friendly data-plane for LLM training.☆38Sep 10, 2025Updated 7 months ago
- ☆38Feb 18, 2025Updated last year
- Structured output benchmarks comparing DSPy and BAML with different LLMs☆28Dec 23, 2025Updated 3 months ago
- Agentic RL Training at Scale☆1,292Updated this week
- A JupyterLite deployment to try JupyterLab, Jupyter Notebook and IPython in the browser☆13Jan 14, 2026Updated 3 months ago
- A Collection of 88x31px GIFs☆14Jun 21, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- inductive reasoning benchmark with subregular hierarchy for string-to-string transformation☆19Jun 27, 2025Updated 9 months ago
- Collection of resources for RL and Reasoning☆27Feb 3, 2025Updated last year
- Just some nice dice in Python☆21Jan 6, 2026Updated 3 months ago
- Use Hermes-2-Pro-Mistral-7B function calling with your OpenAI API compatible code.☆18May 7, 2024Updated last year
- Simple repository for training small reasoning models☆49Feb 17, 2026Updated last month
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆60Oct 18, 2025Updated 5 months ago
- Code for☆28Dec 16, 2024Updated last year
- A fork of sqlite-utils with CLI etc removed☆17Apr 6, 2026Updated last week
- ☆23Aug 2, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A small collection of terminal colorschemes.☆13Dec 18, 2025Updated 3 months ago
- Structured Generation Evals☆14Sep 25, 2024Updated last year
- A fun PGM experience☆15May 19, 2025Updated 10 months ago
- ☆42Mar 11, 2026Updated last month
- Example code to create high-quality knowledge graphs using entity resolution with Kuzu and Senzing☆24Sep 17, 2025Updated 6 months ago
- Flash Attention in 300-500 lines of CUDA/C++☆36Aug 22, 2025Updated 7 months ago
- Explore training for quantized models☆26Jul 12, 2025Updated 9 months ago