Ideas for projects related to Tinker
☆171Nov 6, 2025Updated 4 months ago
Alternatives and similar repositories for tinker-project-ideas
Users that are interested in tinker-project-ideas are comparing it to the libraries listed below
Sorting:
- Official Repo for InSTA: Towards Internet-Scale Training For Agents☆56Jul 11, 2025Updated 7 months ago
- [NeurIPS 2023] Learning Transformer Programs☆163May 21, 2024Updated last year
- ☆12Mar 7, 2024Updated 2 years ago
- Implementation for the paper "Learning Invariant Representation for Continual Learning" in PyTorch.☆12Jan 31, 2021Updated 5 years ago
- ☆136Jan 26, 2026Updated last month
- Code for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs. ACL 2023.☆64Nov 27, 2024Updated last year
- From Word to World: Can Large Language Models be Implicit Text-based World Models?☆50Dec 25, 2025Updated 2 months ago
- Code, benchmark and environment for "OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic Workflows"☆38Nov 10, 2025Updated 3 months ago
- Official PyTorch implementation of "Efficient Latency-Aware CNN Depth Compression via Two-Stage Dynamic Programming" (ICML'23)☆13Jul 11, 2024Updated last year
- ☆19Oct 2, 2023Updated 2 years ago
- Class Incremental learning, Task Incremental Learning☆17Dec 19, 2022Updated 3 years ago
- ☆42Feb 12, 2026Updated 3 weeks ago
- [ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents☆48Feb 27, 2025Updated last year
- ☆105Dec 5, 2025Updated 3 months ago
- ☆27Sep 22, 2025Updated 5 months ago
- BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions☆25Aug 8, 2024Updated last year
- A high-performance reinforcement learning library in jax specialized for robotic learning☆22Sep 4, 2023Updated 2 years ago
- Code for NeurIPS 2020 Paper --- Continual Learning of a Mixed Sequence of Similar and Dissimilar Tasks☆21Oct 24, 2022Updated 3 years ago
- [ICML 2023] Code for our paper “Compositional Exemplars for In-context Learning”.☆103Mar 15, 2023Updated 2 years ago
- [ICLR 2026] The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution☆235Feb 28, 2026Updated last week
- MLGym A New Framework and Benchmark for Advancing AI Research Agents☆585Aug 10, 2025Updated 6 months ago
- Measuring how well CLI agents like Claude Code or Codex CLI can post-train base LLMs on a single H100 GPU in 10 hours☆166Updated this week
- The official github repo for "Training Optimal Large Diffusion Language Models", the first-ever large-scale diffusion language models sca…☆45Nov 6, 2025Updated 4 months ago
- Sandbox environment for generalizable agent research☆27Aug 19, 2022Updated 3 years ago
- Open source machine learning for graph-structured data☆30May 9, 2019Updated 6 years ago
- Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"☆71Jun 19, 2024Updated last year
- A set of examples based on verl for end-to-end RL training recipes.☆194Mar 2, 2026Updated last week
- Security gateway for AI agents - credential-isolated API proxying and policy-gated remote execution (conclaves). Reduce the blast radius!☆109Feb 27, 2026Updated last week
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆78Aug 17, 2024Updated last year
- A scalable asynchronous reinforcement learning implementation with in-flight weight updates.☆373Feb 26, 2026Updated last week
- Easy Setup, File-based, Offline Capable Federated Learning and Computations☆22Feb 11, 2026Updated 3 weeks ago
- A Claude Code plugin that solves the same problems as community frameworks (GSD, BMAD, Ralph, Agent OS) — but using the tool's native arc…☆28Mar 1, 2026Updated last week
- A collection of awesome think with videos papers.☆91Dec 1, 2025Updated 3 months ago
- ☆56Apr 11, 2024Updated last year
- [CVPR 2026] Official Code for "ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning"☆84Feb 13, 2026Updated 3 weeks ago
- [COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents☆248Jul 13, 2025Updated 7 months ago
- P1: Mastering Physics Olympiads with Reinforcement Learning☆76Dec 29, 2025Updated 2 months ago
- The official code of TACL 2021, "Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies".☆85Oct 31, 2022Updated 3 years ago
- MLX Implementation of Recursive Reasoning with Tiny Networks☆78Oct 11, 2025Updated 4 months ago