☆16Apr 12, 2023Updated 2 years ago
Alternatives and similar repositories for BOME
Users that are interested in BOME are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Example code for paper "Bilevel Optimization: Nonasymptotic Analysis and Faster Algorithms"☆50Dec 28, 2021Updated 4 years ago
- An implementation of the penalty-based bilevel gradient descent (PBGD) algorithm and the iterative differentiation (ITD/RHG) methods.☆19Feb 13, 2023Updated 3 years ago
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆34Dec 14, 2023Updated 2 years ago
- ☆13Jul 2, 2025Updated 8 months ago
- Code for Global Convergence of Block Coordinate Descent in Deep Learning (ICML 2019)☆37Jun 18, 2019Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Minimum work examples for "Linear Precoding based on Polynomial Expansion"☆13Oct 27, 2017Updated 8 years ago
- [AAAI 2024 (Oral)] Safety-MuJoCo Environments.☆11Jun 4, 2024Updated last year
- Code for PolyTask: Learning Unified Policies through Behavior Distillation☆11Oct 13, 2023Updated 2 years ago
- Risk-sensitive Inverse Reinforcement Learning☆11Sep 11, 2019Updated 6 years ago
- Official implementation for "How Should We Meta-Learn Reinforcement Learning Algorithms?"☆23Sep 7, 2025Updated 6 months ago
- A pytorch implementation of Graph Neural Networks-Based User Pairing in Wireless Communication Systems☆12Sep 16, 2024Updated last year
- Simulation code for “Hardware Distortion Correlation Has Negligible Impact on UL Massive MIMO Spectral Efficiency” by Emil Björnson, Luca…☆11Nov 7, 2018Updated 7 years ago
- ☆17May 14, 2024Updated last year
- The codes to reproduce the simulation results for the work "An Unsupervised Deep Unrolling Framework for Constrained Optimization Problem…☆12Nov 19, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆11Jun 20, 2023Updated 2 years ago
- Official implementation of the algorithmic approach presented in the research paper entitled "Risk-Sensitive Policy with Distributional R…☆15Dec 19, 2022Updated 3 years ago
- Convex optimization based antenna selection for Massive MIMO☆11Nov 15, 2019Updated 6 years ago
- ☆15Dec 31, 2020Updated 5 years ago
- ☆12Mar 18, 2024Updated 2 years ago
- [ICLR 2022 Spotlight] Code for Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration☆28Feb 10, 2022Updated 4 years ago
- Simulation code for paper on "Electromagnetic Based Communication Model for Dynamic Metasurface Antennas".☆13Apr 25, 2022Updated 3 years ago
- Codebase describing experiments in Truncation Sampling as Language Model Desmoothing☆13Dec 6, 2022Updated 3 years ago
- Robust Reinforcement Learning Benchmark☆12Sep 22, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Implementation of CrossLoco, currently lite version☆14May 12, 2024Updated last year
- [RAL 2025] MTIL: Encoding Full History with Mamba for Temporal Imitation Learning☆27Nov 17, 2025Updated 4 months ago
- ☆11Feb 29, 2024Updated 2 years ago
- ☆22Dec 19, 2025Updated 3 months ago
- RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning☆18May 24, 2023Updated 2 years ago
- Manifold-based-algorithm to solve problems with constant modulus constraints.☆15Jan 2, 2020Updated 6 years ago
- ☆11Oct 21, 2023Updated 2 years ago
- Code for ICML 2023 paper named "Averaged Method of Multipliers for Bi-Level Optimization without Lower-Level Strong Convexity"☆14Jan 14, 2025Updated last year
- Partial implementation of ODE-GAN technique from the paper Training Generative Adversarial Networks by Solving Ordinary Differential Equa…☆16Nov 12, 2020Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs☆39Mar 13, 2026Updated last week
- ☆16Apr 26, 2023Updated 2 years ago
- Scalable Monotonic Neural Networks☆12Mar 14, 2024Updated 2 years ago
- Algorithms for Policy Evaluation, Estimation of Action Values, Policy Improvement, Policy Iteration, Truncated Policy Evaluation, Truncat…☆11Apr 3, 2019Updated 6 years ago
- Blog post: how to do deterministic policy gradient with gumbel softmax and why you should do it.☆12Jun 20, 2017Updated 8 years ago
- Code to accompany "Conformal Prediction as Bayesian Quadrature" by Jake Snell & Tom Griffiths (ICML 2025 Outstanding Paper)☆23Jul 14, 2025Updated 8 months ago
- "proving-contest"-backends for several theorem provers☆13Oct 15, 2024Updated last year