Verlog: A Multi-turn RL framework for LLM agents
☆73Apr 28, 2026Updated 3 weeks ago
Alternatives and similar repositories for Verlog
Users that are interested in Verlog are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A videogame made with PyGame turned into an Open AI Gym Learning Environment for Reinforcement Learning agents.☆15Jan 3, 2023Updated 3 years ago
- Official implementation of Browse-Master, a tool-augmented web-search agent.☆31Aug 22, 2025Updated 9 months ago
- Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)☆19Aug 20, 2023Updated 2 years ago
- SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. …☆145Apr 11, 2024Updated 2 years ago
- 💻 SETA: Scaling Environments for Terminal Agents☆105Feb 16, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- alternative way to calculating self attention☆18May 25, 2024Updated 2 years ago
- [ACL'26 Findings] Official code for "BAPO: Boundary-Aware Policy Optimization for Reliable Agentic Search"☆29Apr 23, 2026Updated last month
- Code for our paper "Learning to Generate Unit Tests for Automated Debugging"☆18Mar 7, 2025Updated last year
- Infrastructure as Code for MCP access management☆36May 6, 2026Updated 2 weeks ago
- ☆21Nov 30, 2019Updated 6 years ago
- This repo contains all the codes for SEScore implementation☆15Mar 3, 2025Updated last year
- CREATE Environment for long-horizon physics-puzzle tasks with diverse tools☆18Nov 22, 2022Updated 3 years ago
- ☆81Sep 15, 2025Updated 8 months ago
- ☆19Apr 7, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Skill Design From AI Feedback☆34Feb 27, 2025Updated last year
- 国科大研究生课程 操作系统高级教程2023年思考题☆12Dec 24, 2023Updated 2 years ago
- Official implementation of "Latent Action Learning Requires Supervision in the Presence of Distractors", ICML 2025☆36Jul 8, 2025Updated 10 months ago
- Financial Services Interest Group☆53Jan 14, 2026Updated 4 months ago
- [ICLR 2026] AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆42Apr 17, 2026Updated last month
- Format conversion and graphical representation of [Universal Dependencies](http://universaldependencies.org) trees.☆12Sep 3, 2024Updated last year
- ☆17Mar 3, 2025Updated last year
- A Python reimplementation + extension of "Planning with Large Language Models for Code Generation" (https://arxiv.org/abs/2303.05510)☆18Dec 1, 2023Updated 2 years ago
- The AI Arena: A framework for distributed multi-agent reinforcement learning☆14Aug 5, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Benchmarking Agentic LLM and VLM Reasoning On Games☆254Apr 9, 2026Updated last month
- ☆10Jul 6, 2023Updated 2 years ago
- Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection☆56Aug 16, 2025Updated 9 months ago
- An interface to program any congestion control protocol for an unreliable connection based protocol sent over UDP. It comes with a clean …☆12Apr 8, 2022Updated 4 years ago
- AI Cluster Observability & Troubleshooting Toolkit. Powered by SII & Infrawaves.☆36Apr 29, 2026Updated 3 weeks ago
- a simple API to use CUPTI☆10Aug 19, 2025Updated 9 months ago
- Open-source toolkit for training, Priming, and serving next generation Hybrid architectures☆69May 9, 2026Updated 2 weeks ago
- DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling☆24May 14, 2026Updated last week
- USAAR participation in SemEval2015☆11Dec 21, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆268May 5, 2025Updated last year
- Data and scripts for the proper evaluation of cross-lingual embeddings in multiple languages☆15Apr 11, 2020Updated 6 years ago
- ☆20May 30, 2024Updated last year
- 🕵 Code for our EMNLP 2025 Main paper: "FlashAdventure: A Benchmark for GUI Agents Solving Full Story Arcs in Diverse Adventure Games"☆25Apr 26, 2026Updated 3 weeks ago
- Soft-QMIX: Integrating Maximum Entropy For Monotonic Value Function Factorization☆15Jul 3, 2024Updated last year
- Code and data for Colors in Context and Generating Bilingual Pragmatic Color References☆12Mar 13, 2018Updated 8 years ago
- A framework for creating rich, 3D, Minecraft-like single and multi-agent environments for AI research. (Accepted at ICML 2025).☆193Feb 17, 2026Updated 3 months ago