Container-free RL framework for training software engineering agents
☆54Mar 4, 2026Updated 2 months ago
Alternatives and similar repositories for SWE-MiniSandbox
Users that are interested in SWE-MiniSandbox are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Short RL☆18Apr 16, 2026Updated 2 weeks ago
- MCP server & Claude Code skills for 100+ AI services (LLMs, image/video gen, TTS). One API key, OpenAI-compatible.☆53Apr 15, 2026Updated 2 weeks ago
- An asynchronous streaming data management module for efficient post-training.☆63Apr 27, 2026Updated last week
- ☆14Mar 11, 2025Updated last year
- ☆14Aug 10, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 此项目是我个人对MIT 6.5940 课程作业的答案,学习笔记和心得。☆15Mar 1, 2024Updated 2 years ago
- Spectral Sphere Optimizer☆114Mar 23, 2026Updated last month
- Data for EMNLP 2022 paper "arXivEdits: Understanding the Human Revision Process in Scientific Writing".☆14Sep 30, 2023Updated 2 years ago
- ☆29Mar 24, 2025Updated last year
- This is an official implementation of "DeformableTST: Transformer for Time Series Forecasting without Over-reliance on Patching" (NeurIPS…☆22Oct 30, 2024Updated last year
- ☆15Aug 5, 2024Updated last year
- Safety-J: Evaluating Safety with Critique☆16Jul 28, 2024Updated last year
- ☆38Mar 6, 2026Updated last month
- [ACL 2025] Research code for the paper "OS-Kairos: Adaptive Interaction for MLLM-Powered GUI Agents"☆21Jun 19, 2025Updated 10 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- This is an open-source project that provides an efficient memory layer for autonomous AI agents, helping AI agents better manage and util…☆19Dec 13, 2024Updated last year
- The official repo for DARG: Dynamic Evaluation of Large Language Models via Adaptive Reasoning Graph☆18Oct 13, 2024Updated last year
- RND1: Scaling Diffusion Language Models☆180Feb 22, 2026Updated 2 months ago
- ☆17Apr 13, 2026Updated 3 weeks ago
- GitHub actions to build wheels for nogil Python☆14Apr 8, 2024Updated 2 years ago
- ☆13Aug 26, 2024Updated last year
- Code for NeurIPS 2022 paper "Robust offline Reinforcement Learning via Conservative Smoothing"☆25Feb 15, 2023Updated 3 years ago
- Chainer and PyTorch implementation of GAN with gradient reversal layer☆10Mar 19, 2022Updated 4 years ago
- 云原生社区可观察性 SIG。☆11Mar 22, 2021Updated 5 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language Models☆48Sep 19, 2025Updated 7 months ago
- sppringboot demo☆12Jul 22, 2023Updated 2 years ago
- ☆12Mar 27, 2026Updated last month
- Material for Ray Connect 2024 Conference☆12Oct 23, 2024Updated last year
- Flexible and Pluggable Serving Engine for Diffusion LLMs☆69Apr 26, 2026Updated last week
- [ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"☆15Jun 21, 2024Updated last year
- RESTful Pattern Recognition (R3) for Apache SkyWalking AI pipeline☆13Apr 21, 2026Updated last week
- ☆12May 14, 2021Updated 4 years ago
- A THU beamer template based on PKU beamer template☆30Aug 26, 2025Updated 8 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- mc-brpc: A more convenient rpc framwork based on brpc to create rpc service☆15Feb 21, 2025Updated last year
- The client implementation for SkyWalking BanyanDB in Java☆21Jan 26, 2026Updated 3 months ago
- FastCuRL: Curriculum Reinforcement Learning with Stage-wise Context Scaling for Efficient LLM Reasoning (EMNLP 2025)☆59Oct 10, 2025Updated 6 months ago
- A powerful streaming log template miner based on the Drain algorithm in golang☆16Oct 23, 2024Updated last year
- Efficient Pandas and Ray Kafka Producer for python using actor model.☆19Jan 18, 2024Updated 2 years ago
- Machine Learning with the Elastic Stack, Published by Packt☆17Jan 30, 2023Updated 3 years ago
- ☆43Aug 20, 2025Updated 8 months ago