An official JAX-based code for our NeuraLCB paper, "Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization", ICLR 2022.
☆13Mar 13, 2022Updated 4 years ago
Alternatives and similar repositories for offline_neural_bandits
Users that are interested in offline_neural_bandits are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆23Jul 30, 2019Updated 6 years ago
- A collection of the pytorch implementation of neural bandit algorithm includes neuralUCB(Neural Contextual Bandits with UCB-based Explora…☆28Jul 15, 2025Updated 9 months ago
- ☆38Mar 28, 2022Updated 4 years ago
- ☆13Jul 26, 2023Updated 2 years ago
- Estimate intrinsic Permanent Magnet Synchronous Motor temperatures with deep recurrent and convolutional neural networks.☆18Oct 8, 2019Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Codes for the paper "Data-Driven Sample Average Approximation with Covariate Information"☆12Aug 13, 2022Updated 3 years ago
- Trajectory-ranked Reward EXtrapolation (T-REX) for Inverse Reinforcement Learning - A Tensorflow implementation trained on OpenAI Gym env…☆19Jul 4, 2019Updated 6 years ago
- MIE424 Group Project: smart_predict_optimize☆14Apr 27, 2021Updated 4 years ago
- Code for the paper "Deep FTRL-ORW: An Efficient Deep Reinforcement Learning Algorithm for Solving Imperfect Information Extensive-Form Ga…☆11Dec 1, 2022Updated 3 years ago
- ☆10Feb 4, 2021Updated 5 years ago
- ☆10Apr 26, 2023Updated 2 years ago
- ☆10Apr 23, 2021Updated 4 years ago
- Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning☆29Feb 21, 2022Updated 4 years ago
- Data-Driven operations management - https://d3group.github.io/ddop☆17Jun 17, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Source code for the Joint Shapley values: a measure of joint feature importance☆12Sep 14, 2021Updated 4 years ago
- Analyzes and adjusts the volume of MP3 files☆12Apr 7, 2019Updated 7 years ago
- ☆14Nov 17, 2023Updated 2 years ago
- Source code for EMSE 2023 paper "Zero-Shot Code Representation Learning via Prompt Tuning"☆13Feb 15, 2023Updated 3 years ago
- A project to assess the costs of flexible vehicle routing strategies☆20Apr 24, 2023Updated 2 years ago
- LibAFL 文档书 简体中文版☆17Mar 16, 2022Updated 4 years ago
- Implementation of "Interior Point Solving for LP-based prediction+optimisation" paper in Neurips 2020.☆20May 16, 2024Updated last year
- GBRL-based Actor-Critic algorithms implemented in stable-baselines3☆44Updated this week
- ☆16Feb 19, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆50Jul 4, 2020Updated 5 years ago
- AndroidSlicer is a dynamic slicing tool, useful for a variety of tasks, from testing to debugging to security.☆14Jul 28, 2019Updated 6 years ago
- Code to study the generalisability of benchmark models on non-stationary EHRs.☆15Aug 7, 2019Updated 6 years ago
- Edge-weighted online bipartite matching (JACM 2022)☆12Jun 18, 2023Updated 2 years ago
- Code for Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation @ NeurIPS 2021☆13Nov 3, 2021Updated 4 years ago
- Funding arbitrage screener for Binance, OKX, ByBit, Mexc☆14Sep 25, 2024Updated last year
- ☆14Apr 18, 2024Updated 2 years ago
- ☆12Nov 22, 2022Updated 3 years ago
- Replication Code for Paper "Stochastic Optimization Forests".☆22Nov 5, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- PhysioNet 2019 Challenge: Early Prediction of Sepsis from Clinical Data☆12May 19, 2019Updated 6 years ago
- Thompson Sampling for Bandits using UCB policy☆10Jul 29, 2017Updated 8 years ago
- A comprehensive Python library implementing a variety of contextual and non-contextual multi-armed bandit algorithms—including LinUCB, Ep…☆13Dec 31, 2024Updated last year
- DDQN for DFJSP DATA SET☆12Mar 11, 2022Updated 4 years ago
- Safe Reinforcement Learning with Natural Language Constraints☆15Oct 24, 2021Updated 4 years ago
- online learning for time series prediction☆13May 17, 2014Updated 11 years ago
- This project enables hyperledger fabric to evolve the iov energy trading☆10Apr 29, 2022Updated 3 years ago