eyounx / RACOS
A theoretically-grounded derivative-free optimization method, born from a statistical view of evolutionary algorithms
☆66Updated 4 years ago
Related projects: ⓘ
- ☆57Updated this week
- ☆9Updated 5 years ago
- A MATLAB project containing many popular / existing constrained clustering algorithms☆27Updated 6 years ago
- Implementation of Scheduled Policy Optimization for task-oriented language grouding☆29Updated 6 years ago
- dlADMM: Deep Learning Optimization via Alternating Direction Method of Multipliers☆153Updated last year
- ☆53Updated 7 years ago
- Lasso with ADMM in Python/MPI☆27Updated 8 years ago
- Proceedings of ICML 2018☆39Updated last year
- ☆17Updated 4 years ago
- Examples/code for the alternating direction method of multipliers (ADMM)☆100Updated 9 years ago
- Code accompanying the paper "Learning Permutations with Sinkhorn Policy Gradient"☆39Updated 6 years ago
- Code for the paper "Let’s Make Block Coordinate Descent Go Fast"☆44Updated last year
- A branch-and-bound ILP solver☆26Updated 5 years ago
- Deep Reinforcement Learning with pytorch & visdom (the branch for A3C continuous control)☆24Updated 6 years ago
- Multi-task learning via Structural Regularization☆133Updated 3 years ago
- Library for Online Learning algorithms☆68Updated 9 years ago
- A generic optimization method for any integer programming problem☆88Updated 3 years ago
- Multi Task Learning Package for Matlab☆21Updated 5 years ago
- An introduction to variational Bayesian☆24Updated 5 years ago
- Pytorch code for Arxiv Paper: Learning to learn: Meta-Critic Networks for Sample-Efficient Learning☆56Updated 6 years ago
- Stochastic Variance Reduction Policy Gradient Estimation☆11Updated 5 years ago
- A demo for VR-SGD(Comparing to some state-of-the-art algorithms).☆13Updated 6 years ago
- ☆39Updated 11 years ago
- [Code] Deep Multi-task Representation Learning: A Tensor Factorisation Approach☆57Updated 7 years ago
- 2048 playing agent using deep Q-learning in Matlab.☆38Updated 8 years ago
- Codes for Stackelberg GAN☆12Updated 5 years ago
- PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch☆114Updated 7 years ago
- advantage actor-critic reinforcement learning for openai gym cartpole☆64Updated 7 years ago
- Collaborative Deep Reinforcement Learning☆32Updated 7 years ago
- Reinforcement Learning in Python☆107Updated 4 years ago