thinking-machines-lab/tinker-project-ideas

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/thinking-machines-lab/tinker-project-ideas)

thinking-machines-lab / tinker-project-ideas

Ideas for projects related to Tinker

☆191

Alternatives and similar repositories for tinker-project-ideas

Users that are interested in tinker-project-ideas are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MaoZiming / papers
View on GitHub
Paper-reading notes for Berkeley OS prelim exam.
☆14Aug 28, 2024Updated last year
thinking-machines-lab / tinker-cookbook
View on GitHub
Post-training with Tinker
☆3,869Updated this week
thinking-machines-lab / tinker
View on GitHub
Training API and CLI
☆637Updated this week
MoonshotAI / WorldVQA
View on GitHub
☆119Feb 4, 2026Updated 5 months ago
Infini-AI-Lab / vortex_torch
View on GitHub
Vortex: Programmable Sparse Attention for Agents as Algorithm Designers
☆68Jun 24, 2026Updated 3 weeks ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Zhiyuan-Zeng / RLVE
View on GitHub
[ICML 2026] RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments
☆225Apr 30, 2026Updated 2 months ago
wenzhe-li / Self-MoA
View on GitHub
☆17Feb 4, 2025Updated last year
open-tinker / OpenTinker
View on GitHub
OpenTinker is an RL-as-a-Service infrastructure for foundation models
☆676Mar 21, 2026Updated 4 months ago
data-for-agents / insta
View on GitHub
Official Repo for InSTA: Towards Internet-Scale Training For Agents
☆56Jul 11, 2025Updated last year
axon-rl / gem
View on GitHub
A Gym for Agentic LLMs
☆502Jan 21, 2026Updated 6 months ago
GAIR-NLP / BeHonest
View on GitHub
BeHonest: Benchmarking Honesty in Large Language Models
☆35Aug 15, 2024Updated last year
NovaSky-AI / SkyRL
View on GitHub
SkyRL: A Modular Full-stack RL Library for LLMs
☆2,085Updated this week
hkust-nlp / Toolathlon
View on GitHub
[ICLR 2026] The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution
☆432Updated this week
maiush / OpenCharacterTraining
View on GitHub
Open Character Training
☆92Apr 4, 2026Updated 3 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
klei30 / tuner-ui
View on GitHub
An open-source research interface for interactive LLM tuning workflows with Tinker: dataset registration, recipe selection, run monitorin…
☆33Jun 18, 2026Updated last month
radixark / miles
View on GitHub
Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.
☆1,761Updated this week
open-thought / reasoning-gym
View on GitHub
[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards
☆1,463Apr 17, 2026Updated 3 months ago
ars22 / e3
View on GitHub
☆20Sep 16, 2025Updated 10 months ago
mert-cemri / autoevolve
View on GitHub
☆24Dec 6, 2025Updated 7 months ago
gso-bench / gso
View on GitHub
[NeurIPS '25] GSO: Challenging Software Optimization Tasks for Evaluating SWE-Agents
☆87Jul 12, 2026Updated last week
feyzaakyurek / rl4f
View on GitHub
Code for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs. ACL 2023.
☆63Nov 27, 2024Updated last year
thinking-machines-lab / batch_invariant_ops
View on GitHub
☆1,046Nov 4, 2025Updated 8 months ago
GAIR-NLP / OctoThinker
View on GitHub
Revisiting Mid-training in the Era of Reinforcement Learning Scaling
☆189Jul 23, 2025Updated 11 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
sdan / continualcode
View on GitHub
pip install continualcode
☆43Feb 10, 2026Updated 5 months ago
hao-ai-lab / DistCA
View on GitHub
Efficient Long-context Language Model Training by Core Attention Disaggregation
☆106Apr 7, 2026Updated 3 months ago
aisa-group / PostTrainBench
View on GitHub
Measuring how well CLI agents like Claude Code or Codex CLI can post-train base LLMs on a single H100 GPU in 10 hours
☆463Updated this week
THUDM / slime
View on GitHub
slime is an LLM post-training framework for RL Scaling.
☆7,569Updated this week
GMISWE / tinker-cloud
View on GitHub
Tinkering RL
☆26Jul 15, 2026Updated last week
sail-sg / odc
View on GitHub
On demand communication
☆34Apr 16, 2026Updated 3 months ago
Infini-AI-Lab / Kinetics
View on GitHub
Kinetics: Rethinking Test-Time Scaling Laws
☆87Jul 11, 2025Updated last year
Timothyxxx / TestTimeTrainingPapers
View on GitHub
☆59Apr 13, 2026Updated 3 months ago
MoonshotAI / Kimina-Prover-Preview
View on GitHub
Technical report of Kimina-Prover Preview.
☆373Jul 10, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
yaof20 / Flash-RL
View on GitHub
Implementation for FP8/INT8 Rollout for RL training without performence drop.
☆306Nov 7, 2025Updated 8 months ago
facebookresearch / meta-agents-research-environments
View on GitHub
Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike stat…
☆528Jun 20, 2026Updated last month
Trinity-data-store / Trinity
View on GitHub
EuroSys '24: "Trinity: A Fast Compressed Multi-attribute Data Store"
☆18Mar 8, 2025Updated last year
DAMO-NLP-SG / RemeMo
View on GitHub
[EMNLP 2023] Once Upon a *Time* in *Graph*: Relative-Time Pretraining for Complex Temporal Reasoning
☆17Oct 31, 2023Updated 2 years ago
Terra-Flux / PolyRL
View on GitHub
[NSDI'26] PolyRL is a reinforcement learning framework for LLM that harvest spot instances on the cloud to reduce cost.
☆19Mar 30, 2026Updated 3 months ago
princeton-nlp / PTP
View on GitHub
Improving Language Understanding from Screenshots. Paper: https://arxiv.org/abs/2402.14073
☆32Jul 9, 2024Updated 2 years ago
SakanaAI / fast-weight-product-key-memory
View on GitHub
Code for Fast-weight Product Key Memory (FwPKM)
☆19Mar 18, 2026Updated 4 months ago