A Python Package for Non-stationary Online Learning (PyNOL)
☆35Apr 5, 2024Updated 2 years ago
Alternatives and similar repositories for PyNOL
Users that are interested in PyNOL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆18Oct 25, 2022Updated 3 years ago
- Performant, differentiable reinforcement learning☆23Jun 16, 2023Updated 2 years ago
- Don't just regulate gradients like in Muon, regulate the weights too☆32Jul 30, 2025Updated 9 months ago
- Distributed multi-agent average consensus☆11Mar 17, 2020Updated 6 years ago
- Code for the paper "Deep FTRL-ORW: An Efficient Deep Reinforcement Learning Algorithm for Solving Imperfect Information Extensive-Form Ga…☆11Dec 1, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Nov 14, 2019Updated 6 years ago
- ☆10Apr 26, 2023Updated 3 years ago
- ☆10Apr 23, 2021Updated 5 years ago
- Google AI Princeton control framework☆39Nov 2, 2020Updated 5 years ago
- ☆13Mar 25, 2023Updated 3 years ago
- The official code of Multi-player Nash Preference Optimization [ICLR 2026]☆35Feb 4, 2026Updated 3 months ago
- PyTorch Implementation of Variance Reduced Optimization Algorithms -- SARAH and SVRG.☆15Jul 11, 2021Updated 4 years ago
- ☆16Feb 10, 2023Updated 3 years ago
- This repository is for the "LLM-Aligned Geographic Item Tokenization for Local-Life Recommendation".☆17Nov 18, 2025Updated 5 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A squad movement planning library for StarCraft AI using Monte Carlo Tree Search and Negamax☆14Jan 1, 2019Updated 7 years ago
- A simple Python implementation of basic Wavelet denoising algorithms☆17Dec 8, 2023Updated 2 years ago
- Offline Policy Evaluation via Adaptive Weighting with Data from Contextual Bandits☆11Oct 21, 2024Updated last year
- machine learning on edge (fog) computing☆13Dec 5, 2018Updated 7 years ago
- Code for Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation @ NeurIPS 2021☆13Nov 3, 2021Updated 4 years ago
- ☆13Nov 22, 2022Updated 3 years ago
- ☆13Jan 9, 2018Updated 8 years ago
- Mitigating Routing Update Overhead for Traffic Engineering by Combining Destination-based Routing with Reinforcement Learning☆15Oct 16, 2022Updated 3 years ago
- 南京大学本科毕业论文模板☆13Jun 1, 2016Updated 9 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Safe Reinforcement Learning with Natural Language Constraints☆15Oct 24, 2021Updated 4 years ago
- source code of the paper "[CIKM 2023] Task-Difficulty-Aware Meta-Learning with Adaptive Update Strategies for User Cold-Start Recommendat…☆10Oct 27, 2023Updated 2 years ago
- ☆37Apr 16, 2021Updated 5 years ago
- ☆19Jul 18, 2021Updated 4 years ago
- ☆16Apr 21, 2022Updated 4 years ago
- An official JAX-based code for our NeuraLCB paper, "Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization", ICLR…☆13Mar 13, 2022Updated 4 years ago
- Code repo for the NeurIPS 2021 paper "Online Adaption to Label Distribution Shift".☆15Feb 15, 2023Updated 3 years ago
- A general purpose monte-carlo tree search AI in Javascript☆17Jan 4, 2023Updated 3 years ago
- Generate a Deep Neural Network for Equalization of Optical channels☆16Aug 12, 2021Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Code for "Nonlinear stochastic modeling with Langevin regression" J. L. Callaham, J.-C. Loiseau, G. Rigas, and S. L. Brunton☆26Feb 24, 2022Updated 4 years ago
- Tutorial on Multi-Objective Recommender Systems @ KDD 2021☆19Dec 4, 2022Updated 3 years ago
- Prose for a painting source code☆12Oct 8, 2019Updated 6 years ago
- A2C for GVG-AI☆22Nov 7, 2018Updated 7 years ago
- Code and real data for "Counterfactual Temporal Point Processes", NeurIPS 2022☆16Sep 26, 2022Updated 3 years ago
- Models Supported: DenseNet121, DenseNet161, DenseNet169, DenseNet201 and DenseNet264 (1D and 2D version with DEMO for Classification and …☆16Nov 25, 2021Updated 4 years ago
- Official implementation of "Approximating Gradients for Differentiable Quality Diversity in Reinforcement Learning"☆22Oct 3, 2022Updated 3 years ago