Training tiny models to prove hard theorems
☆77Mar 5, 2026Updated last month
Alternatives and similar repositories for QED-Nano
Users that are interested in QED-Nano are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- UniGen: Enhanced Training & Test-Time Strategies for Unified Multimodal Understanding and Generation☆39Nov 24, 2025Updated 5 months ago
- The code for paper "EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning"☆38Oct 1, 2025Updated 6 months ago
- PeRL: Parameter-Efficient Reinforcement Learning☆74Apr 21, 2026Updated last week
- Official Implementation of the paper "Jointly Reinforcing Diversity and Quality in Language Model Generations"☆58Apr 13, 2026Updated 2 weeks ago
- Load any clip model with a standardized interface☆22Oct 20, 2025Updated 6 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A framework aiming to bridge fast robot prototyping, predefined motion primitives, heterogeneous teleoperation, data collection, and flex…☆26Apr 4, 2026Updated 3 weeks ago
- ☆27Mar 10, 2026Updated last month
- ☆78Feb 18, 2026Updated 2 months ago
- Official repository Flash Local Linear Attention☆23Updated this week
- ☆30Apr 1, 2025Updated last year
- ☆35Nov 11, 2025Updated 5 months ago
- Internal utility libraries for Pkl☆16Updated this week
- Official implementation for paper "How Far Are We from Genuinely Useful Deep Research Agents?"☆65Dec 10, 2025Updated 4 months ago
- Safe SLAC, an algorithm for safe cost-constrained reinforcement learning in high-dimensional POMDPs.☆11Mar 1, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆63Nov 12, 2025Updated 5 months ago
- MLX Implementation of Recursive Reasoning with Tiny Networks☆78Oct 11, 2025Updated 6 months ago
- Simple and Ideal Circuit Simulation☆13Dec 4, 2017Updated 8 years ago
- Multilingual and Multiculture Benchmark and LLM☆33Apr 21, 2026Updated last week
- Source code for paper "On the Pareto Front of Multilingual Neural Machine Translation" @ NeurIPS 2023☆17Sep 27, 2023Updated 2 years ago
- [NeurIPS 2025] RL Tango: Reinforcing Generator and Verifier Together for Language Reasoning☆54Oct 23, 2025Updated 6 months ago
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆42Apr 4, 2025Updated last year
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards☆36Oct 3, 2025Updated 6 months ago
- ☆100Updated this week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- TPU support for the fastai library☆13Apr 15, 2021Updated 5 years ago
- Official Repository of Native Parallel Reasoner☆107Feb 5, 2026Updated 2 months ago
- Rethinking the Trust Region in LLM Reinforcement Learning☆52Mar 2, 2026Updated last month
- ☆21Dec 3, 2025Updated 4 months ago
- ☆35Apr 21, 2026Updated last week
- An implementation of online data mixing for the Pile dataset, based on the GPT-NeoX library.☆14Jan 9, 2024Updated 2 years ago
- ExpertFingerprinting: Behavioral Pattern Analysis and Specialization Mapping of Experts in GPT-OSS-20B's Mixture-of-Experts Architecture☆26Feb 3, 2026Updated 2 months ago
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Models☆15Mar 8, 2023Updated 3 years ago
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 5 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Visualize any repo or codebase into diagram or animation☆23Oct 14, 2024Updated last year
- Deploying full-stack on-prem deep research agent that can be run entirely on a local machine for $0!☆34Nov 8, 2025Updated 5 months ago
- ☆33Oct 23, 2025Updated 6 months ago
- Training Proactive and Personalized LLM Agents☆107Jan 20, 2026Updated 3 months ago
- A starter kit for evaluating benchmarks on the 🤗 Hub☆16Apr 8, 2026Updated 2 weeks ago
- 🤖 Code for our EMNLP 2022 paper: "BotsTalk: Machine-sourced Framework for Automatic Curation of Large-scale Multi-skill Dialogue Dataset…☆16Oct 7, 2024Updated last year
- Code for the main RoboTutor app. Many sound and image assets not included.☆14Nov 5, 2019Updated 6 years ago