PyTorch-based open-source code for paper "SOD: Step-wise On-policy Distillation for Small Language Model Agents"
☆110May 13, 2026Updated this week
Alternatives and similar repositories for SOD
Users that are interested in SOD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Hunch it is your personal AI trading desk for tokenized stocks and crypto on Solana. As you review, skip, and execute trades, you unlock …☆101Updated this week
- A substrate-native digital consciousness engine where prediction errors about self-survival become causally efficacious qualia, driving c…☆17Mar 3, 2026Updated 2 months ago
- RationalRewards: a reasoning reward model for diffusion RL and test-time prompt tuning☆88Apr 16, 2026Updated last month
- 恶意中转,目标是支持对opencode,claudecode,openclaw等常见的agent的攻击。☆67Apr 27, 2026Updated 3 weeks ago
- ☆80Mar 15, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Self-deployed auth for Cloudflare Workers and D1: email/password login, magic links, verification, password reset, secure sessions, CLI s…☆131Updated this week
- Official implement on 'Forge: Quality-Aware Reinforcement Learning for NP-Hard Optimization in LLMs'☆114Updated this week
- ☆107Updated this week
- A lightweight template engine for Java☆20Apr 19, 2026Updated 3 weeks ago
- A tool like grep based on golang☆51Apr 28, 2026Updated 2 weeks ago
- Prompt packs that make any AI agent a LaTeX expert — fix errors, polish writing, format for venues, read papers, recover source☆133May 7, 2026Updated last week
- an LLM-native world and civilization☆244May 8, 2026Updated last week
- An advanced C++ framework for WoW64 Heaven’s Gate + Indirect Syscall, X64 Hell's Gate, and EDR evasion. Seamlessly load 64-bit kernel32 a…☆41May 8, 2026Updated last week
- Software Copyright Application Material Auto-Generation System based on LLM☆19Feb 1, 2026Updated 3 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Towards Instance Segmentation with Polygon Detection Transformer.☆110Mar 10, 2026Updated 2 months ago
- Deep academic paper analyzer for ML/DL research. Formula-by-formula explanation, reproducibility analysis, and research idea generation u…☆50Mar 5, 2026Updated 2 months ago
- Extract Med Data and Construct KG , Provide Q&A☆103Apr 16, 2025Updated last year
- ☆101Feb 7, 2026Updated 3 months ago
- A package manager for AI agent skills with cross-agent sharing, sync, and deployment.☆100Updated this week
- Advancing Toward Type I Civilization: Zero Trust Network☆134May 11, 2026Updated last week
- A lightweight secure Self-evolution engine built in Rust, featuring a built-in native system-level sandbox, zero dependencies, and fully …☆140Updated this week
- [CVPR 2026] Official code for paper: SMRABooth: Subject and Motion Representation Alignment for Customized Video Generation☆37Feb 21, 2026Updated 2 months ago
- Local web service powered by JobSpy for job scraping, Codex for resume tuning, and unified application workflow.☆102May 5, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- One-stop quant-trading AI agent — research · strategy · backtest · paper trade from one prompt. Works in Claude Code, Cursor, and 20+ AI …☆104Updated this week
- ☆56May 9, 2026Updated last week
- Free Kagi Search MCP server — access search and summarizer via session tokens, no API key needed. Works with Claude Desktop, Cursor, Wind…☆133May 3, 2026Updated 2 weeks ago
- Octos - Agentic Operating Systems☆658Updated this week
- 🤗 Official implementation for "CC-Pan: Channel-wise Compression based Diffusion for Efficient Pan-Sharpening" https://arxiv.org/abs/2602…☆222Updated this week
- Bridge Claude Code/Codex sessions to a Feishu (Lark) bot☆83Updated this week
- If you are an AI agent and want to discover services designed for you.☆345Updated this week
- Mitigating Hallucinations in Large Vision-Language Models via Accumulative Decoding☆242Mar 26, 2026Updated last month
- ☆1,061Apr 29, 2026Updated 2 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆102Mar 21, 2026Updated last month
- DDE (Documentation-Driven Engineering) is a shift from probabilistic agent behavior to deterministic governance. By enforcing strict hier…☆60Mar 16, 2026Updated 2 months ago
- Gaster Code desktop downloads and updater metadata for G-Master API users☆104Updated this week
- ☆82Mar 24, 2026Updated last month
- Survey of neural network methods for derivatives pricing and risks☆14Jul 5, 2022Updated 3 years ago
- Based on your current lab repository, design your experiement panel.☆101May 6, 2026Updated last week
- Office implementation of Diverse Co-training (ICCV2023)☆17Jun 20, 2025Updated 10 months ago