Container-free RL framework for training software engineering agents
☆50Mar 4, 2026Updated last month
Alternatives and similar repositories for SWE-MiniSandbox
Users that are interested in SWE-MiniSandbox are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Short RL☆18May 26, 2025Updated 10 months ago
- Official implementation for "Towards Safe Reinforcement Learning via Constraining Conditional Value at Risk" (IJCAI 2022)☆27Aug 29, 2024Updated last year
- An asynchronous streaming data management module for efficient post-training.☆42Updated this week
- Source code, datasets and models of the paper "Efficient White-box Fairness Testing through Gradient Search" by Lingfeng Zhang, Yueling Z…☆11Jul 24, 2021Updated 4 years ago
- ☆14Mar 11, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆13Jul 14, 2024Updated last year
- ☆19Feb 18, 2025Updated last year
- 利用区块链(以太坊)发放营业执照。Issue business license with blockchain.☆11Nov 30, 2025Updated 4 months ago
- some tutorials for blog: simonjisu.github.io☆23Mar 25, 2021Updated 5 years ago
- ☆29Mar 24, 2025Updated last year
- This is an official implementation of "DeformableTST: Transformer for Time Series Forecasting without Over-reliance on Patching" (NeurIPS…☆22Oct 30, 2024Updated last year
- ☆15Aug 5, 2024Updated last year
- Safety-J: Evaluating Safety with Critique☆16Jul 28, 2024Updated last year
- ☆35Mar 6, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ACL 2025] Research code for the paper "OS-Kairos: Adaptive Interaction for MLLM-Powered GUI Agents"☆20Jun 19, 2025Updated 9 months ago
- This is an open-source project that provides an efficient memory layer for autonomous AI agents, helping AI agents better manage and util…☆19Dec 13, 2024Updated last year
- The official repo for DARG: Dynamic Evaluation of Large Language Models via Adaptive Reasoning Graph☆18Oct 13, 2024Updated last year
- RND1: Scaling Diffusion Language Models☆179Feb 22, 2026Updated last month
- Spark (PySpark) script that applies dynamic time warping to Energy usage data (using the python fastdtw package)☆15Oct 22, 2016Updated 9 years ago
- ☆17Feb 9, 2026Updated 2 months ago
- ☆36Feb 21, 2025Updated last year
- ☆13Aug 26, 2024Updated last year
- Code for NeurIPS 2022 paper "Robust offline Reinforcement Learning via Conservative Smoothing"☆24Feb 15, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Chainer and PyTorch implementation of GAN with gradient reversal layer☆10Mar 19, 2022Updated 4 years ago
- 云原生社区可观察性 SIG。☆11Mar 22, 2021Updated 5 years ago
- Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language Models☆46Sep 19, 2025Updated 6 months ago
- Agent Skill Induction: "Inducing Programmatic Skills for Agentic Tasks"☆40Apr 24, 2025Updated 11 months ago
- [Likelihood Lab Project 2024] Official Repository for The Technical Report, Label Unbalance in High-frequency Trading☆30Mar 20, 2025Updated last year
- Material for Ray Connect 2024 Conference☆12Oct 23, 2024Updated last year
- Flexible and Pluggable Serving Engine for Diffusion LLMs☆68Updated this week
- [ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"☆15Jun 21, 2024Updated last year
- RESTful Pattern Recognition (R3) for Apache SkyWalking AI pipeline☆13Updated this week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- mc-brpc: A more convenient rpc framwork based on brpc to create rpc service☆15Feb 21, 2025Updated last year
- FastCuRL: Curriculum Reinforcement Learning with Stage-wise Context Scaling for Efficient LLM Reasoning (EMNLP 2025)☆58Oct 10, 2025Updated 6 months ago
- The client implementation for SkyWalking BanyanDB in Java☆21Jan 26, 2026Updated 2 months ago
- Machine Learning with the Elastic Stack, Published by Packt☆17Jan 30, 2023Updated 3 years ago
- ☆42Aug 20, 2025Updated 7 months ago
- Testing various methods of moving Arrow data between processes☆16Mar 29, 2023Updated 3 years ago
- ☆47Apr 9, 2025Updated last year