OPARL(Optimistic and Pessimistic Actor in RL)
☆18Jan 26, 2024Updated 2 years ago
Alternatives and similar repositories for OPARL
Users that are interested in OPARL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Here is a list of papers related to causal reinforcement learning, and I hope you can submit relevant missing papers in the issue.☆19Jan 23, 2024Updated 2 years ago
- ☆12May 23, 2024Updated 2 years ago
- [ICLR 2026 Oral] 🎉Hallucination Begins Where Saliency Drops☆58Feb 12, 2026Updated 3 months ago
- ☆11Oct 8, 2022Updated 3 years ago
- ☆11Feb 28, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [ECCV 2024] BenchLMM: Benchmarking Cross-style Visual Capability of Large Multimodal Models☆86Aug 19, 2024Updated last year
- personal notes (30k loc)☆13Updated this week
- Code and data for EMNLP 2023 paper "Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans?"☆15Jan 25, 2024Updated 2 years ago
- AI-based analytical tools for the analysis of STEM images.☆12Oct 22, 2024Updated last year
- ☆11Nov 13, 2025Updated 6 months ago
- Integrated software for comprehensive BMS strategy validation, SOC accuracy estimation, cell boundary definition, and battery data analys…☆15Jul 31, 2024Updated last year
- Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, O…☆12Jan 31, 2023Updated 3 years ago
- Official Github of "Geolocation with Real Human Gameplay Data: A Large-Scale Dataset and Human-Like Reasoning Framework"☆19Jan 4, 2026Updated 4 months ago
- 大厂AI模拟面试官 Skill - 覆盖阿里/腾讯/字节/百度/美团/华为等,基于JD+简历生成专属面试10问☆95Updated this week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A Fast, Portable Deep Reinforcement Learning Library for Continuous Control☆13Jul 26, 2023Updated 2 years ago
- Official implementation of the UMDQN algorithm presented in the scientific research paper entitled "Distributional Reinforcement Learning…☆11Jun 3, 2022Updated 3 years ago
- [ICLR 2025] SimXRD-4M: Big Simulated X-ray Diffraction Data and Crystalline Symmetry Classification Benchmark☆28May 24, 2026Updated last week
- ☆16Jun 1, 2023Updated 3 years ago
- [ACM TOMM] Official implementation of "TextCoT: Zoom-In for Enhanced Multimodal Text-Rich Image Understanding"☆45Feb 27, 2026Updated 3 months ago
- A Data Science pipeline for Algorithmic Trading: A comparative study in applications to Finance and cryptoeconomics☆14Jul 1, 2022Updated 3 years ago
- Implementation of BIMRL: Brain Inspired Meta Reinforcement Learning - Roozbeh Razavi et al. (IROS 2022)☆10Dec 1, 2022Updated 3 years ago
- Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.☆11Jun 12, 2019Updated 6 years ago
- ☆17Dec 19, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- OpenAI gym environments for goal-conditioned and language-conditioned reinforcement learning☆14Jan 27, 2026Updated 4 months ago
- NEGA 地道口语助手☆103Feb 3, 2026Updated 3 months ago
- Code for the paper "Minimum-Delay Adaptation in Non-Stationary Reinforcement Learning via Online High-Confidence Change-Point Detection"☆11Aug 7, 2023Updated 2 years ago
- Code for optimal execution☆12Oct 29, 2020Updated 5 years ago
- Implementing DQNClipped and DQNReg Algorithms☆10Mar 2, 2021Updated 5 years ago
- Pytorch implementation of DeepLOB-ATT and DeepLOB-Seq2Seq from Multi Horizon Forecasting for Limit Order Books☆14Feb 4, 2023Updated 3 years ago
- ☆11Jul 10, 2025Updated 10 months ago
- ☆18Jan 30, 2025Updated last year
- Prioritized Sequence Experience Replay☆10Aug 16, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Model-based Hindsight Experience Replay☆10Jun 8, 2022Updated 3 years ago
- Code for NeurIPS2021 submission "A Surrogate Objective Framework for Prediction+Programming with Soft Constraints"☆13Aug 30, 2021Updated 4 years ago
- code for the paper Offline Prioritized Experience Replay☆12Jun 13, 2023Updated 2 years ago
- State of the art time series forecasting method that has the FFORMA ensemble learn from the ESRNN hybrid model and others.☆13Sep 7, 2022Updated 3 years ago
- ☆11Sep 5, 2024Updated last year
- A paper replication project for Time-driven feature-aware jointly deep reinforcement learning☆11Mar 12, 2021Updated 5 years ago
- ☆11Oct 3, 2022Updated 3 years ago