A project implementing various agentic RL based on the Slime post-training framework
☆90Apr 3, 2026Updated this week
Alternatives and similar repositories for slime-agentic
Users that are interested in slime-agentic are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Research papers on Proot-of-Concepts☆109Feb 3, 2026Updated 2 months ago
- Reproducing R1 for Code with Reliable Rewards☆12Apr 9, 2025Updated last year
- ☆104Oct 8, 2025Updated 6 months ago
- 2025 CCF International AIOps Challenge | Track 1: Microservice Root Cause Localization Based on Large Model Agents | "男团910" Solution · T…☆220Jan 14, 2026Updated 2 months ago
- This is an official implementation of the Reward rAnked Fine-Tuning Algorithm (RAFT), also known as iterative best-of-n fine-tuning or re…☆39Sep 22, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Logic-in-frames: Dynamic keyframe search via visual semantic-logical verification for long video understanding☆59Nov 17, 2025Updated 4 months ago
- Official Repo of "RobustFlow: Towards Robust Agentic Workflow Generation"☆235Oct 19, 2025Updated 5 months ago
- MIO: A Foundation Model on Multimodal Tokens☆34Dec 13, 2024Updated last year
- [USENIX Security'25] THEMIS: Towards Practical Intellectual Property Protection for Post-Deployment On-Device Deep Learning Models☆108Aug 13, 2025Updated 7 months ago
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆59Apr 20, 2024Updated last year
- Source code for ICLR2025 paper "NExT-Mol: 3D Diffusion Meets 1D Language Modeling for 3D Molecule Generation".☆91Aug 23, 2025Updated 7 months ago
- The enhanced model is specially trained for aquatic targets, achieving higher accuracy. It can detect sailboats, humans, other vessels, b…☆47May 15, 2025Updated 10 months ago
- GAIIC2024无人机视角下的双光目标检测 - Rank6 解决方案☆12Jun 17, 2024Updated last year
- ☆12Nov 2, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- UR2: Unify RAG and Reasoning through Reinforcement Learning☆127Nov 19, 2025Updated 4 months ago
- [ACL 2025] "CoT-UQ: Improving Response-wise Uncertainty Quantification in LLMs with Chain-of-Thought"☆17Apr 3, 2025Updated last year
- Your efficient and accurate answer verification system for RL training.☆41Jun 23, 2025Updated 9 months ago
- 🧬 Python code that implements the active-finite-Voronoi (AFV) model.☆20Mar 19, 2026Updated 3 weeks ago
- [ICLR 2026] End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning☆375Mar 30, 2026Updated last week
- 基于React + OpenLayers + GeoScene Enterprise的现代化遥感影像分析平台,专门用于蒿坪镇区域的高精度遥感数据可视化与分析。系统采用GeoScene Enterprise在线服务,实现了基于高分二号(GF-2)卫星影像的河湖水质监测功能,通…☆24Oct 25, 2025Updated 5 months ago
- PRSA: Prompt Stealing Attacks against Real-World Prompt Services (USENIX Security '25)☆26Dec 25, 2025Updated 3 months ago
- R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and Reinforcement Learning☆36Feb 9, 2026Updated 2 months ago
- Standardizing environment infrastructure with Strands Agents — step, observe, reward.☆44Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Control your Mac with natural language by converting intent into executable action sequences, with planning, retries, and verifiable outc…☆34Feb 8, 2026Updated 2 months ago
- Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models☆32Apr 1, 2025Updated last year
- ☆15May 9, 2024Updated last year
- UniVoice: Unifying Autoregressive ASR and Flow-Matching based TTS with Large Language Models☆110Oct 30, 2025Updated 5 months ago
- ☆32Nov 11, 2025Updated 4 months ago
- Self-use code examples for remote management of the vsphere platform using the pyvmomi library☆66Jan 7, 2025Updated last year
- The summary of code and paper for unified model towards context-dependent (CD) concept segmentation.☆119Aug 27, 2025Updated 7 months ago
- ☆232Nov 5, 2025Updated 5 months ago
- [NeurIPS 2025] RL Tango: Reinforcing Generator and Verifier Together for Language Reasoning☆53Oct 23, 2025Updated 5 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- a iOS network debug library ,It can monitor HTTP requests within the App and displays information related to the request.☆15Apr 17, 2017Updated 8 years ago
- [ICLR'25] Official repository for "AVHBench: A Cross-Modal Hallucination Evaluation for Audio-Visual Large Language Models"☆20Mar 8, 2026Updated last month
- Official implementation of "OptMerge: Unifying Multimodal LLM Capabilities and Modalities via Model Merging".☆50Oct 30, 2025Updated 5 months ago
- Pytorch Codes for LA-Net☆15Dec 14, 2023Updated 2 years ago
- [ICML 2025] Reward-guided Speculative Decoding (RSD) for efficiency and effectiveness.☆57May 2, 2025Updated 11 months ago
- 本项目提出了一种基于思维链(Chain of Thought)推理与意图分析相结合的多模态大语言模型越狱攻击防御机制,在多种攻击场景下实现了显 著的防御效果提升。☆20Jun 17, 2025Updated 9 months ago
- Toolathlon-Gym for testing AI agents real-world tool-use capabilities across diverse MCP servers.☆104Apr 2, 2026Updated last week