LMIS-ORG/slime-agentic

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/LMIS-ORG/slime-agentic)

LMIS-ORG / slime-agentic

A project implementing various agentic RL based on the Slime post-training framework

☆509

Alternatives and similar repositories for slime-agentic

Users that are interested in slime-agentic are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Tanglumy / Finance-Bro
View on GitHub
your finance bro Agent for trading and investing
☆111Nov 8, 2025Updated 8 months ago
sinberCS / switch2ai
View on GitHub
switch2ai - A JetBrains IDE plugin enabling seamless collaboration between JetBrains IDEs and various AI agents (Cursor, Qoder, Claude co…
☆173Nov 11, 2025Updated 8 months ago
zhangyulin-space / ChatFerry
View on GitHub
☆104Oct 8, 2025Updated 9 months ago
MarkLee131 / PoC-Research-Papers
View on GitHub
Research papers on Proot-of-Concepts
☆114Feb 3, 2026Updated 5 months ago
Jinxhy / THEMIS
View on GitHub
[USENIX Security'25] THEMIS: Towards Practical Intellectual Property Protection for Post-Deployment On-Device Deep Learning Models
☆108Aug 13, 2025Updated 11 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
aoda-zhang / PawHaven-FullStack-React-NodeJS
View on GitHub
🐱 PawHaven — an open-source platform that helps volunteers, shelters, and adopters report, track, and share stray animal rescue cases (f…
☆90Updated this week
ECNU-SII / Continual-NExT
View on GitHub
☆235Jun 27, 2026Updated last month
DEFENSE-SEU / RobustFlow
View on GitHub
Official Repo of "RobustFlow: Towards Robust Agentic Workflow Generation"
☆238Oct 19, 2025Updated 9 months ago
ant-research / AvatarArtist
View on GitHub
[CVPR'25] Official PyTorch implementation of AvatarArtist: Open-Domain 4D Avatarization.
☆280Jun 14, 2025Updated last year
gulucaptain / DynamiCtrl
View on GitHub
[TMM'26] Dynamic human image animation with strong identity preservation, heterogeneous character driving, and controllable backgrounds.
☆142May 23, 2025Updated last year
zzhuang94 / gin-vue-web
View on GitHub
一个基于 Gin 和 Vue 的企业级全栈 Web 开发框架，专为快速构建现代化管理平台而生。采用前后端分离架构，通过约定优于配置的设计理念，将传统 CRUD 开发效率提升 10 倍以上。 A minimal MVC web framework built with Gin…
☆31May 27, 2026Updated 2 months ago
AIR-DISCOVER / FreeAskWorld
View on GitHub
[AAAI 2026 Oral] FreeAskWorld is an interactive simulation framework that integrates large language models (LLMs) for high-level plannin…
☆229Jul 3, 2026Updated 3 weeks ago
liufanfanlff / C3-Context-Cascade-Compression
View on GitHub
Official code implementation of Context Cascade Compression: Exploring the Upper Limits of Text Compression
☆313Jan 27, 2026Updated 6 months ago
serendipity800 / open-motion-apis
View on GitHub
☆80Mar 5, 2026Updated 4 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Tsinghua-dhy / UR2
View on GitHub
UR2: Unify RAG and Reasoning through Reinforcement Learning
☆131May 26, 2026Updated 2 months ago
damo-cv / JCo-MVTON
View on GitHub
☆124Aug 29, 2025Updated 11 months ago
tangpan360 / MicroRCA-Agent
View on GitHub
2025 CCF International AIOps Challenge | Track 1: Microservice Root Cause Localization Based on Large Model Agents | "男团910" Solution · T…
☆259Jan 14, 2026Updated 6 months ago
wwang721 / pyafv
View on GitHub
🧬 Python code that implements the active finite Voronoi (AFV) model.
☆22Updated this week
HKUDS / LightReasoner
View on GitHub
[ACL 2026 Oral] "LightReasoner: Can Small Language Models Teach Large Language Models Reasoning?"
☆604May 22, 2026Updated 2 months ago
Victor20082018 / -Optimized-Aquatic-Target-Recognition-Model
View on GitHub
The enhanced model is specially trained for aquatic targets, achieving higher accuracy. It can detect sailboats, humans, other vessels, b…
☆48May 15, 2025Updated last year
yunbeizhang / Awesome-Visual-Prompt-Tuning
View on GitHub
[TMLR] A curated list of awesome papers, resources, and tools for Visual Prompt Tuning (VPT).
☆115Feb 22, 2026Updated 5 months ago
EDAPINENUT / ExplicitShortCut
View on GitHub
Official implementation of the paper <On the Design of One-Step Diffusion via Shortcutting Flow Paths>
☆287Apr 1, 2026Updated 3 months ago
zwt-cmd / zksync-era-go-indexer
View on GitHub
Beginner-friendly Web3 indexer for zkSync Era, written in Go. Easy to run and extend, syncing blocks/txs/logs and DeFi/DEX events into My…
☆21Dec 10, 2025Updated 7 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
bcmi / Object-Reflection-Generation-Dataset-DEROBA
View on GitHub
The dataset, code, and model for our paper "Reflection Generation for Composite Image Using Diffusion Model", ICME, 2026.
☆58Apr 4, 2026Updated 3 months ago
bcmi / OSInsert-Image-Composition
View on GitHub
☆62Jun 28, 2026Updated last month
gwh22 / UniVoice
View on GitHub
UniVoice: Unifying Autoregressive ASR and Flow-Matching based TTS with Large Language Models
☆115Oct 30, 2025Updated 8 months ago
THUDM / slime
View on GitHub
slime is an LLM post-training framework for RL Scaling.
☆7,679Updated this week
kand-ta / kand
View on GitHub
Kand: Blazing-Fast, Modern Technical Analysis in Rust, Python, and WASM.
☆564Jan 22, 2026Updated 6 months ago
wguo-ai / SSV2A
View on GitHub
Gotta Hear Them All: Towards Sound Source Aware Audio Generation.
☆69Nov 15, 2025Updated 8 months ago
RLHFlow / Reinforce-Ada
View on GitHub
[COLM 2026] An adaptive sampling framework for Reinforce-style LLM post training.
☆97Nov 29, 2025Updated 8 months ago
ByteDance-Seed / DAComp
View on GitHub
[ICLR 2026] DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle
☆433Jul 10, 2026Updated 2 weeks ago
Harrydirk41 / ProTDyn
View on GitHub
Generative Protein Emulator
☆69Sep 25, 2025Updated 10 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
BinaryFroggy / Hopet
View on GitHub
A macOS desktop AI pet that mirrors your Claude Code & Codex CLI session state in real time. 支持Claude Code和Codex CLI 的像素风AI桌面宠物，会话状态实时映射，…
☆61Updated this week
THUDM / INFTY
View on GitHub
INFTY Engine: An Optimization Toolkit to Support Continual AI
☆573Jun 8, 2026Updated last month
jiaweizzhao / InRank
View on GitHub
☆153Jan 2, 2024Updated 2 years ago
YOUNG-bit / OpenGS-Fusion
View on GitHub
[IROS2025] OpenGS-Fusion: Open-Vocabulary Dense Mapping with Hybrid 3D Gaussian Splatting for Refined Object-Level Understanding
☆77Aug 2, 2025Updated 11 months ago
AlenjandroWang / UniReason
View on GitHub
☆144May 21, 2026Updated 2 months ago
MarkLee131 / Hypervisor-Testing-Survey
View on GitHub
A collection of research papers on hypervisor testing.
☆65May 21, 2026Updated 2 months ago
Pony-Unicorn / tiny-client-only
View on GitHub
A lightweight React component that renders its children only on the client side, helping avoid SSR hydration errors in frameworks like Ne…
☆31Jul 10, 2026Updated 2 weeks ago