Official repository of the paper: Continual Harness: Online Adaptation for Self-Improving Foundation Agents and PokeAgent Speedrun Track 2
☆232May 13, 2026Updated last month
Alternatives and similar repositories for continual-harness
Users that are interested in continual-harness are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pokémon Showdown RL Agents and Datasets☆113Jun 19, 2026Updated last week
- Official repository of the spotlight ICML 2025 paper, PokeChamp: an Expert-level Minimax Language Agent.☆174Mar 11, 2026Updated 3 months ago
- ☆94Jun 30, 2025Updated 11 months ago
- Playing Pokemon Red with Reinforcement Learning☆21Jul 28, 2025Updated 11 months ago
- [KDD24-ADS] R-Eval: A Unified Toolkit for Evaluating Domain Knowledge of Retrieval Augmented Large Language Models☆11Apr 9, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- This is the code of paper "RawFormer: An Efficient Vision Transformer for Low-Light RAW Image Enhancement"☆17Apr 14, 2023Updated 3 years ago
- Code and Data for GlitchBench☆13Feb 27, 2024Updated 2 years ago
- [ICML 2025] Adaptive Self-improvement LLM Agentic System for ML Library Development☆17Jan 6, 2026Updated 5 months ago
- Code for Scalable Offline Model-Based RL with Action chunking☆29Feb 20, 2026Updated 4 months ago
- ☆23Aug 26, 2023Updated 2 years ago
- [Roadmap] Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling☆120Jun 9, 2026Updated 2 weeks ago
- Official repo of paper LM2☆48Feb 13, 2025Updated last year
- Accelerating RL for LLM Reasoning with Optimal Advantage Regression☆41May 30, 2025Updated last year
- Simple rules based grapheme to phoneme in Python☆11Sep 2, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official implementation of "Latent Action Learning Requires Supervision in the Presence of Distractors", ICML 2025☆37Jul 8, 2025Updated 11 months ago
- [NeurIPS 2023] Official code release accompanying the paper "NetHack is Hard to Hack" (Piterbarg, Pinto, Fergus)☆13Oct 30, 2023Updated 2 years ago
- [ICML 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)☆20Aug 20, 2024Updated last year
- CAPE using text-graphs☆29Apr 7, 2025Updated last year
- The code for paper "EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning"☆39Oct 1, 2025Updated 8 months ago
- Minimal example to apply Decision Transformer in Atari Pong☆15Feb 1, 2025Updated last year
- [NeurIPS 2025] Scaling Speculative Decoding with Lookahead Reasoning☆69Oct 31, 2025Updated 7 months ago
- COMMS Software for UPSat☆12Dec 17, 2018Updated 7 years ago
- [NeurIPS 2024] Official Implementation for Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks☆101Jun 17, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This repository contains the official implementation (PyTorch) of "Multimodal Forgery Detection Using Ensemble Learning" proposed in APSI…☆10Jan 4, 2023Updated 3 years ago
- [ECCV 2024] MM2Latent: Text-to-facial image generation and editing in GANs with multimodal assistance☆20May 2, 2026Updated last month
- Official PyTorch codebase for the Modeling Caption Diversity in ContrastiveVision-Language Pretraining paper.☆18Mar 28, 2025Updated last year
- Official implementation of TBA for async LLM post-training.☆31Nov 5, 2025Updated 7 months ago
- ☆10May 21, 2023Updated 3 years ago
- ☆46Jan 24, 2024Updated 2 years ago
- ☆10Apr 2, 2023Updated 3 years ago
- PyTorch reimplementation for "LO-Net: Deep Real-time Lidar Odometry" https://arxiv.org/abs/1904.08242☆16Jan 8, 2022Updated 4 years ago
- [ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆204Sep 13, 2025Updated 9 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"☆29Oct 14, 2025Updated 8 months ago
- Toolbox of functions using the ar_track_alvar package.☆12Jul 2, 2021Updated 4 years ago
- ☆10Dec 4, 2024Updated last year
- ☆22Sep 27, 2024Updated last year
- Official Implementation of MultiWorld: Scalable Multi-Agent Multi-View Video World Models☆236May 12, 2026Updated last month
- ☆30Sep 5, 2024Updated last year
- [ICLR'24] Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition☆55May 14, 2024Updated 2 years ago