Compiling useful links, papers, benchmarks, ideas, etc.
☆46Mar 16, 2025Updated last year
Alternatives and similar repositories for resources
Users that are interested in resources are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- simple grpo☆12May 28, 2025Updated 9 months ago
- A reading list of relevant papers and projects on foundation model annotation☆28Feb 27, 2025Updated last year
- Benchmark structured generation libraries☆31Oct 25, 2024Updated last year
- Our library for RL environments + evals☆3,918Updated this week
- Build your own visual reasoning model☆419Jan 13, 2026Updated 2 months ago
- Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.☆69May 5, 2025Updated 10 months ago
- ☆48Aug 29, 2024Updated last year
- a simple variational auto encoder with some exploration☆12Nov 22, 2024Updated last year
- No code solution for training tabular models☆35Jan 25, 2026Updated 2 months ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆85Aug 20, 2025Updated 7 months ago
- The dataset consists of public social media url pairs and the corresponding entailment label for an external conference (ACL 2021). Each …☆14Aug 16, 2021Updated 4 years ago
- An unnecessarily tiny and minimal implementation of GPT-2 in NumPy.☆11Feb 12, 2023Updated 3 years ago
- A lightweight, user-friendly data-plane for LLM training.☆38Sep 10, 2025Updated 6 months ago
- A minimalist Docker project to help people getting started with Node, WizardCoder, CTransformers, Python, Express and TypeScript. Ready t…☆14Jun 23, 2023Updated 2 years ago
- Multiple datasets for ARC (Abstraction and Reasoning Corpus)☆86Mar 28, 2025Updated 11 months ago
- ☆38Feb 18, 2025Updated last year
- Async RL Training at Scale☆1,176Updated this week
- Structured output benchmarks comparing DSPy and BAML with different LLMs☆27Dec 23, 2025Updated 3 months ago
- ☆12Jul 12, 2021Updated 4 years ago
- A JupyterLite deployment to try JupyterLab, Jupyter Notebook and IPython in the browser☆13Jan 14, 2026Updated 2 months ago
- Cases for UT47.2☆10Mar 13, 2022Updated 4 years ago
- xcb wm☆21Aug 21, 2020Updated 5 years ago
- Use Hermes-2-Pro-Mistral-7B function calling with your OpenAI API compatible code.☆18May 7, 2024Updated last year
- Simple repository for training small reasoning models☆49Feb 17, 2026Updated last month
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Oct 18, 2025Updated 5 months ago
- A fork of sqlite-utils with CLI etc removed☆17Jan 29, 2026Updated last month
- A fun PGM experience☆15May 19, 2025Updated 10 months ago
- Structured Generation Evals☆14Sep 25, 2024Updated last year
- Open source target discovery for cancer.☆26Mar 23, 2025Updated last year
- Explore training for quantized models☆26Jul 12, 2025Updated 8 months ago
- Flash Attention in 300-500 lines of CUDA/C++☆36Aug 22, 2025Updated 7 months ago
- Cray-LM unified training and inference stack.☆22Jan 30, 2025Updated last year
- ☆20May 2, 2025Updated 10 months ago
- Exploring Applications of GRPO☆252Aug 25, 2025Updated 7 months ago
- An 38 key orthogonal Keyboard with 3d printable case☆12Mar 17, 2024Updated 2 years ago
- Minimal example of MCP for parsing llms.txt☆40Apr 8, 2025Updated 11 months ago
- gpt-3.5-turbo-instruct, prompted with PGN, vs Stockfish Level 4 on LiChess☆15Sep 19, 2023Updated 2 years ago
- Sequential Monte Carlo sampler for PyMC2 models.☆13Apr 4, 2018Updated 7 years ago
- A full fledged mistral+wandb☆13Aug 16, 2024Updated last year