Official code for "Self-Distilled Agentic Reinforcement Learning"
☆77May 15, 2026Updated this week
Alternatives and similar repositories for SDAR
Users that are interested in SDAR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official code for "SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization"☆265Updated this week
- Official implementation of UI-Ins: Enhancing GUI Grounding with Multi-Perspective Instruction-as-Reasoning☆74Apr 20, 2026Updated 3 weeks ago
- MDPO: Overcoming the Training-Inference Divide of Masked Diffusion Language Models☆43Jan 28, 2026Updated 3 months ago
- (CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation☆36Feb 28, 2026Updated 2 months ago
- ☆37Oct 9, 2025Updated 7 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [NeurIPS 2025] Let LRMs Break Free from Overthinking via Self-Braking Tuning. https://arxiv.org/abs/2505.14604☆55Nov 4, 2025Updated 6 months ago
- [NeurIPS 2025] Mind the Gap: Bridging Thought Leap for Improved CoT Tuning https://arxiv.org/abs/2505.14684☆48Oct 20, 2025Updated 6 months ago
- ☆32Aug 11, 2025Updated 9 months ago
- Recursive Self-Aggregation evals on ARC-AGI☆36Jan 26, 2026Updated 3 months ago
- FPGA Low latency 10GBASE-R PCS☆13May 23, 2023Updated 2 years ago
- ☆19Aug 23, 2025Updated 8 months ago
- Official Implementation for Optimus-3: Dual-Router Aligned Mixture-of-Experts Agent with Dual-Granularity Reasoning-Aware Policy Optimiza…☆64Apr 14, 2026Updated last month
- GSM8K-V: Can Vision Language Models Solve Grade School Math Word Problems in Visual Contexts☆40Sep 30, 2025Updated 7 months ago
- [CVPR 2023] Better “CMOS” Produces Clearer Images: Learning Space-Variant Blur Estimation for Blind Image Super-Resolution☆10Mar 19, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- WBSR: Rethinking Imbalance in Image Super-Resolution for Efficient Inference☆13Oct 8, 2024Updated last year
- Does Diffusion Beat GAN in Image Super Resolution?☆12May 27, 2024Updated last year
- An environment for mobile angets to interact with realistic android device or android emulator☆13Jul 19, 2024Updated last year
- ☆40Mar 26, 2026Updated last month
- Benchmarking agent reasoning capabilities in physical interactions, tool usage, and multi-agent coordination.☆45Aug 10, 2025Updated 9 months ago
- Official code for "KnowU-Bench: Towards Interactive, Proactive, and Personalized Mobile Agent Evaluation"☆67May 4, 2026Updated 2 weeks ago
- [AAAI 2026] Test-Time Reinforcement Learning for GUI Grounding via Region Consistency https://arxiv.org/abs/2508.05615☆64Nov 8, 2025Updated 6 months ago
- ☆12Dec 9, 2022Updated 3 years ago
- vTPM with SGX protection☆11May 30, 2019Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [CVPR2023] Practical Network Acceleration with Tiny Sets☆13Jul 28, 2023Updated 2 years ago
- Official Repository of LatentSeek☆82Jun 6, 2025Updated 11 months ago
- The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models"☆33Mar 26, 2026Updated last month
- This repository contains the dataset of the paper ARGUS: Context-Based Detection of Stealthy IoT Infiltration Attacks☆13Apr 28, 2023Updated 3 years ago
- libtpms / swtpm software emulation of a Trusted Platform Module (TPM 1.2 and TPM 2.0) compile script☆13Sep 16, 2020Updated 5 years ago
- ☆21Dec 9, 2025Updated 5 months ago
- ICLR 2026☆42May 12, 2026Updated last week
- [ICCV 2025] MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation☆22Sep 5, 2025Updated 8 months ago
- ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-of…☆88May 30, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆13Oct 3, 2024Updated last year
- Learning from Next-Frame Prediction: Autoregressive Video Modeling Encodes Effective Representations☆22Dec 24, 2025Updated 4 months ago
- Metadata Editor user and practice guide☆18May 8, 2026Updated last week
- Benchmarking Autonomous Mobile Agents in Agent-User Interactive and MCP-Augmented Environments (ACL 2026)☆206May 11, 2026Updated last week
- Demonstrating the usage of FGYM: A Toolkit for benchmarking FPGA-accelerated Reinforcement Learning☆13Aug 12, 2021Updated 4 years ago
- Authenticated independently verifiable agent delegation.☆33Dec 17, 2025Updated 5 months ago
- Source code to accompany research paper on training multi token prediction language models using self-distillation.☆37Feb 21, 2026Updated 2 months ago