Training tiny models to prove hard theorems
☆59Mar 5, 2026Updated last week
Alternatives and similar repositories for QED-Nano
Users that are interested in QED-Nano are comparing it to the libraries listed below
Sorting:
- Embedding Inversion via Conditional Masked Diffusion: recover original text from embedding vectors using parallel denoising. Live demo + …☆40Mar 7, 2026Updated last week
- UniGen: Enhanced Training & Test-Time Strategies for Unified Multimodal Understanding and Generation☆37Nov 24, 2025Updated 3 months ago
- The code for paper "EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning"☆37Oct 1, 2025Updated 5 months ago
- PeRL: Parameter-Efficient Reinforcement Learning☆73Mar 10, 2026Updated last week
- Official Implementation of the paper "Jointly Reinforcing Diversity and Quality in Language Model Generations"☆57Dec 26, 2025Updated 2 months ago
- ☆19Oct 2, 2023Updated 2 years ago
- Ludic – an LLM-RL library for the era of experience☆61Jan 9, 2026Updated 2 months ago
- Load any clip model with a standardized interface☆22Oct 20, 2025Updated 4 months ago
- A framework aiming to bridge fast robot prototyping, predefined motion primitives, heterogeneous teleoperation, data collection, and flex…☆23Mar 2, 2026Updated 2 weeks ago
- ☆27Mar 10, 2026Updated last week
- Official repository for the paper Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regressi…☆23Oct 1, 2025Updated 5 months ago
- Internal utility libraries for Pkl☆16Mar 10, 2026Updated last week
- Official implementation for paper "How Far Are We from Genuinely Useful Deep Research Agents?"☆64Dec 10, 2025Updated 3 months ago
- ☆85Updated this week
- Powdered Metal — High performance LLM fine-tuning framework for Apple Silicon☆133Updated this week
- MLX Implementation of Recursive Reasoning with Tiny Networks☆79Oct 11, 2025Updated 5 months ago
- Safe SLAC, an algorithm for safe cost-constrained reinforcement learning in high-dimensional POMDPs.☆11Mar 1, 2023Updated 3 years ago
- ☆30Jan 15, 2026Updated 2 months ago
- Simple and Ideal Circuit Simulation☆13Dec 4, 2017Updated 8 years ago
- Evaluate Transformers from the Hub 🔥☆14Nov 27, 2023Updated 2 years ago
- [NeurIPS 2025] RL Tango: Reinforcing Generator and Verifier Together for Language Reasoning☆52Oct 23, 2025Updated 4 months ago
- Kaggle AIMO2 solution with token-efficient reasoning LLM recipes☆44Aug 7, 2025Updated 7 months ago
- Official Implementation for NorMuon paper☆61Mar 11, 2026Updated last week
- Official Repository of Native Parallel Reasoner☆103Feb 5, 2026Updated last month
- TPU support for the fastai library☆13Apr 15, 2021Updated 4 years ago
- ☆21Dec 3, 2025Updated 3 months ago
- A framework for building provenance-based intrusion detection systems with neural networks☆84Mar 6, 2026Updated last week
- An implementation of online data mixing for the Pile dataset, based on the GPT-NeoX library.☆13Jan 9, 2024Updated 2 years ago
- KV Cache & LoRA for minGPT☆59Mar 4, 2026Updated 2 weeks ago
- The official repository of the first version of ACE-Brain foundation model.☆62Updated this week
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 4 months ago
- ☆12Nov 5, 2024Updated last year
- Visualize any repo or codebase into diagram or animation☆22Oct 14, 2024Updated last year
- Forcing Diffuse Distributions out of Language Models☆18Sep 10, 2024Updated last year
- Convert MathML to Latex for OneNote to Markdown☆12Jul 27, 2022Updated 3 years ago
- Code for the main RoboTutor app. Many sound and image assets not included.☆14Nov 5, 2019Updated 6 years ago
- [ACL 2025] Analyzing LLMs' Multilingual Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations☆18Oct 18, 2025Updated 5 months ago
- The raw UserRL repo under construction☆97Sep 25, 2025Updated 5 months ago
- [CVPR 2026 (Findings) 🔥🔥] Self Evolving Large Multimodal Models with Continuous Rewards☆20Mar 5, 2026Updated last week