Qwen3-14B Orchestrator Agent Reinforcement Learning. **Achieved 160% improvement** on Stanford's TerminalBench
☆101Nov 3, 2025Updated 6 months ago
Alternatives and similar repositories for Orca-Agent-RL
Users that are interested in Orca-Agent-RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A meta-repo that watches karpathy/autoresearch and adjacent systems, distills portable patterns for bounded agent-verifier research lo…☆43May 8, 2026Updated 3 weeks ago
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 6 months ago
- Claude Code Remote: Remote approvals (Discord), quota-aware auto-continuation, and quota scheduling☆48May 20, 2026Updated last week
- Official implementation for the paper, StackEval: Benchmarking LLMs in Coding Assistance, https://arxiv.org/abs/2412.05288☆20Oct 30, 2024Updated last year
- ☆13Mar 5, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Tree-based speculative decoding for Apple Silicon (MLX). ~10-15% faster than DFlash on code, ~1.5x over autoregressive. First MLX port wi…☆140Apr 15, 2026Updated last month
- Workflows built for my patreon page!☆84May 4, 2026Updated 3 weeks ago
- Minimal Claude Code alternative powered by MLX☆46Jan 11, 2026Updated 4 months ago
- The repository for the paper "Predicting in-hospital mortality by combining clinical notes with time-series data"☆12May 23, 2021Updated 5 years ago
- ☆20May 14, 2025Updated last year
- ☆12Mar 3, 2023Updated 3 years ago
- Local lightning-fast semantic code search built for agents☆42Mar 16, 2026Updated 2 months ago
- ☆15Oct 4, 2024Updated last year
- OpenAI's human-eval sampling benchmark☆13Jan 29, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- tuimorphic choose-your-own-adventure story game☆20Apr 30, 2026Updated last month
- ☆148Mar 31, 2026Updated last month
- lol☆10Mar 12, 2021Updated 5 years ago
- ☆14Dec 13, 2022Updated 3 years ago
- Implementation for Decision-focused Summarization (EMNLP2021)☆12Mar 14, 2022Updated 4 years ago
- A standard language for machine-readable code comments☆131Mar 17, 2026Updated 2 months ago
- ☆28Oct 30, 2025Updated 7 months ago
- A simple Streamlit application to visualize document chunks and queries in embedding space 🗺️🔍☆14Apr 15, 2025Updated last year
- A Windows .mobileconfig generator for installing fonts on iOS device. 一个 Windows 上的 .mobileconfig 配置生成器,用于给 iOS 设备安装字体。☆13Feb 26, 2020Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [EMNLP 2024] Official implementation of "Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge Ut…☆23Dec 4, 2024Updated last year
- Matlab code for face recognition (CS229 Course Project).☆13Jun 17, 2014Updated 11 years ago
- ☆21Oct 2, 2025Updated 7 months ago
- 一个聚合AI相关节目的播客rss☆21May 21, 2026Updated last week
- ktx is an executable context layer for data and analytics agents 🐙 Allow Claude Code, Codex, and any AI agent to query data accurately t…☆509Updated this week
- Automatically Update LLM Papers Daily using Github Actions. Ref: https://github.com/Vincentqyw/cv-arxiv-daily☆10May 18, 2026Updated last week
- Code release for "TempLM: Distilling Language Models into Template-Based Generators"☆14Jul 21, 2022Updated 3 years ago
- Streamline on-policy/off-policy distillation workflows in a few lines of code☆103Feb 26, 2026Updated 3 months ago
- ☆10Jan 20, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- AI Scientist by Chicago Human+AI Lab☆129May 8, 2026Updated 3 weeks ago
- An LLM council that reviews your coding agent's every move☆96Apr 28, 2026Updated last month
- Claude Code Agent Monitoring & Observability on VSCode☆94May 8, 2026Updated 3 weeks ago
- 🎭 Official code and dataset for our CCGPK@COLING 2022 paper - "PersonaChatGen: Generating Personalized Dialogue using GPT-3"☆13Mar 26, 2024Updated 2 years ago
- The official public dataset for Famelack.☆45Updated this week
- 💬A curated list of incredible amount of publications related to Dialogue Systems especially Chatbots and Chit-chat Systems☆10Dec 5, 2019Updated 6 years ago
- Demo of fine-tuning QA models for answering FAQ of cloud providers documentation☆11Mar 7, 2023Updated 3 years ago