Official code for the ICLR 2025 paper, "Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining"
☆29Dec 1, 2024Updated last year
Alternatives and similar repositories for JOWA
Users that are interested in JOWA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"☆160Oct 23, 2025Updated 5 months ago
- Official code for CVPR 2024 paper, "SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models"☆16Apr 22, 2024Updated last year
- Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization☆23Apr 17, 2024Updated last year
- Code for our paper: Radar-Camera Fused Multi-Object Tracking: Online Calibration and Common Feature☆23Nov 5, 2025Updated 4 months ago
- [EMNLP 2025 Main] Official implementation of VRoPE: Rotary Position Embedding for Video Large Language Models.☆27Nov 18, 2025Updated 4 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Minimal JAX/Flax port of `lpips` supporting `vgg16`, with pre-trained weights stored in the 🤗 Hugging Face hub.☆17Aug 1, 2022Updated 3 years ago
- Related papers for Continual Reinforcement Learning.☆40Feb 8, 2026Updated last month
- C++ Library of the Linear Conjugate Gradient Methods (LibLCG)☆11Aug 23, 2022Updated 3 years ago
- Code for Tackling Long-Horizon Tasks with Model-based Offline Reinforcement Learning☆16Feb 6, 2025Updated last year
- Analysis Toolkit to investigate co-activation patterns in functional Magnetic Resonance Imaging (fMRI)☆16Feb 11, 2026Updated last month
- Template for user customizations of Husky URDF☆15Apr 6, 2018Updated 7 years ago
- Iterative State Estimation in Non-linear Dynamical Systems Using Approximate Expectation Propagation☆15Jun 30, 2022Updated 3 years ago
- [NeurIPS 2025] Official Implementation of paper "Sherlock: Self-Correcting Reasoning in Vision-Language Models"☆28Sep 18, 2025Updated 6 months ago
- Conservative Q learning in Jax☆57Feb 7, 2023Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Simple state estimation for the unitree Go2 robot.☆35Sep 30, 2025Updated 5 months ago
- pytorch implementation of SAC, TD3 and TD7 with Mujoco Benchmark results from 4 seeds.☆15Jul 4, 2024Updated last year
- Beer Game implemented as an OpenAI gym environment.☆17Aug 4, 2019Updated 6 years ago
- Simulators and baselines for ATEC 2025 software algorithm track (online competition)☆11Apr 13, 2025Updated 11 months ago
- An open-source Reinforcement Learning (RL) harness written in Python to work with SimFire for training agents to fight wildfires on real …☆17Oct 8, 2024Updated last year
- [NeurIPS 2024] Doubly Mild Generalization for Offline Reinforcement Learning☆16Oct 29, 2025Updated 4 months ago
- (T-IV) Dream to Drive with Predictive Individual World Model☆44Aug 8, 2025Updated 7 months ago
- ☆10Mar 11, 2024Updated 2 years ago
- A curated list of PhD, RA, and Intern openings in Computer Science (CS), Electrical & Computer Engineering (ECE), and Artificial Intellig…☆21Sep 1, 2025Updated 6 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code to reproduce results from the paper: Prediction and Control in Continual Reinforcement Learning, NeurIPS 2023.☆13May 10, 2024Updated last year
- MRI preprocessing and analysis pipelines and tools for the study of disorders of consciousness☆19Feb 25, 2025Updated last year
- Reflect-RL: Two-Player Online RL Fine-Tuning for LMs☆18Jul 19, 2025Updated 8 months ago
- UCAS大三自然语言处理课程大作业☆12Jun 25, 2023Updated 2 years ago
- ☆11Nov 1, 2022Updated 3 years ago
- ☆10Oct 18, 2023Updated 2 years ago
- Language/Clicking grounded SAM + VOS for real-time video object tracking☆20Jan 25, 2025Updated last year
- Deep Learning Project☆23Jan 18, 2020Updated 6 years ago
- Code for paper: Reward Uncertainty for Exploration in Preference-based Reinforcement Learning☆15May 26, 2022Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- [RAL'24] Official code repo for "GMPC: Geometric Model Predictive Control for Wheeled Mobile Robot Trajectory Tracking"☆21Jun 3, 2025Updated 9 months ago
- Performing Symbolic Regression via Monte Carlo Tree Search (MCTS)☆14Nov 2, 2018Updated 7 years ago
- [NeurIPS 2024] Offline Reinforcement Learning with OOD State Correction and OOD Action Suppression☆14Oct 29, 2025Updated 4 months ago
- TOPReward: Token Probabilities as Hidden Zero-Shot Rewards for Robotics☆55Mar 6, 2026Updated 2 weeks ago
- ☆16May 1, 2023Updated 2 years ago
- Global optical flow-based estimation of velocity for multicopters using monocular vision in GPS-denied environments☆24Jul 13, 2022Updated 3 years ago
- Mitigating Lost-in-Retrieval Problems in Retrieval Augmented Multi-Hop Question Answering, ACL 2025☆20Oct 28, 2025Updated 4 months ago