Claw-R1: Empowering OpenClaw with Advanced Agentic RL.
☆187Jun 9, 2026Updated last week
Alternatives and similar repositories for Claw-R1
Users that are interested in Claw-R1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Nov 13, 2025Updated 7 months ago
- ☆25Nov 11, 2025Updated 7 months ago
- [ICML 2025] Official PyTorch implementation of the paper: 🎯 TimeDART: A Diffusion Autoregressive Transformer for Self-Supervised Time S…☆152Dec 17, 2025Updated 6 months ago
- ☆22Dec 12, 2024Updated last year
- ☆39Nov 20, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [CVPR 2026] LongVideo-R1: Smart Navigation for Low-cost Long Video Understanding☆49Feb 28, 2026Updated 3 months ago
- [ACL2025 Oral🔥]Turning Trash into Treasure: Accelerating Inference of Large Language Models with Token Recycling☆29Nov 11, 2025Updated 7 months ago
- Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning☆1,470Jun 10, 2026Updated last week
- ☆26Sep 16, 2025Updated 9 months ago
- ☆58Jan 19, 2025Updated last year
- ☆35Mar 23, 2026Updated 2 months ago
- SWE-Lego: Pushing the Limits of Supervised Fine-tuning for Software Issue Resolving☆69Feb 28, 2026Updated 3 months ago
- Scaling Multi-modal Instruction Fine-tuning with Tens of Thousands Vision Task Types☆32Jul 16, 2025Updated 11 months ago
- [ACL 2026] Scaling External Knowledge Input Beyond Context Windows of LLMs via Multi-Agent Collaboration☆24Apr 11, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ICLR 2026] End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning☆391Mar 30, 2026Updated 2 months ago
- ☆24Oct 13, 2024Updated last year
- This is an official implementation of the Reward rAnked Fine-Tuning Algorithm (RAFT), also known as iterative best-of-n fine-tuning or re…☆40Sep 22, 2024Updated last year
- Python package to augment multilingual data☆15Feb 15, 2023Updated 3 years ago
- Use contrastive learning to train a large language model (LLM) as a retriever☆12Jul 19, 2024Updated last year
- ☆18Mar 3, 2025Updated last year
- 📊 A simple command-line utility for querying and monitoring GPU status☆14Aug 3, 2023Updated 2 years ago
- [ICLR2026] The official repository for the CodeGym project: "Generalizable End-to-End Tool-Use RL with Synthetic CodeGym"☆31Oct 14, 2025Updated 8 months ago
- ☆11Oct 2, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Revisiting Character-level Adversarial Attacks for Language Models, ICML 2024☆19Feb 12, 2025Updated last year
- This repository includes code for the paper "Does Localization Inform Editing? Surprising Differences in Where Knowledge Is Stored vs. Ca…☆62May 9, 2023Updated 3 years ago
- ☆17Apr 28, 2022Updated 4 years ago
- 🎉 TrustJudge is accepted to ICLR 2026!☆47Sep 27, 2025Updated 8 months ago
- ☆89Mar 19, 2025Updated last year
- A Mac app used to convert MP3 tags.☆14May 19, 2014Updated 12 years ago
- ☆14Apr 1, 2024Updated 2 years ago
- The source code for "Improving Knowledge-aware Recommendation with Multi-level Interactive Contrastive Learning".☆30Aug 25, 2022Updated 3 years ago
- ArxivDaily☆13Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆14Feb 12, 2024Updated 2 years ago
- 从零快速使用Ubuntu,搭建深度学习环境,持续更新中☆12Apr 18, 2023Updated 3 years ago
- Follow the Wisdom of the Crowd: Effective Text Generation via Minimum Bayes Risk Decoding☆20Nov 16, 2022Updated 3 years ago
- [CVPR 2026] HiconAgent: History Context-aware Policy Optimization for GUI Agents☆30Mar 9, 2026Updated 3 months ago
- Course project. A implementation of Graph Wavelet Neural Network (ICLR 2019)☆11Jan 6, 2020Updated 6 years ago
- ☆13May 26, 2025Updated last year
- [ACL 2025] "World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning." https://arxiv.org/abs/2503.1…☆18Jul 22, 2025Updated 10 months ago