phoenix1917 / CASIA-Thesis
自动化所硕博论文模板
☆38Updated 6 years ago
Alternatives and similar repositories for CASIA-Thesis:
Users that are interested in CASIA-Thesis are comparing it to the libraries listed below
- Pytorch code for Arxiv Paper: Learning to learn: Meta-Critic Networks for Sample-Efficient Learning☆56Updated 6 years ago
- Code for paper "Episodic Memory Deep Q-Networks" (https://arxiv.org/abs/1805.07603), IJCAI 2018☆62Updated 6 years ago
- homework for CS294 Fall 2017☆168Updated 6 years ago
- Code for Continual Learning of Context-dependent Processing in Neural Networks☆179Updated 3 years ago
- Code for the paper Adaptive Auxiliary Task Weighting for Reinforcement Learning☆25Updated 4 years ago
- Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization☆44Updated 6 years ago
- Simple pytorch implmentation of reinforcement learning algorithms☆25Updated 5 years ago
- Tensorflow code for ICML 2019 paper: LGM-Net: Learning to Generate Matching Networks for Few-Shot Learning☆84Updated 4 years ago
- Assignments for CS294-112 Fall2018 in Pytorch☆63Updated 6 years ago
- An implementation of ICML 2017 paper <<Neural Optimizer Search with Reinforcement Learning>> https://arxiv.org/abs/1709.07417☆15Updated 6 years ago
- ☆86Updated 2 years ago
- A large-scale multi-modal pre-trained model☆129Updated last year
- Neat and flexible implementation of MAML in pytorch: https://arxiv.org/abs/1703.03400☆59Updated 3 years ago
- Official implementation for our CVPR19 paper, AOGNets: Compositional Grammatical Architectures for Deep Learning☆66Updated 5 years ago
- Meta-SGD experiment on Omniglot classification compared with MAML☆79Updated 7 years ago
- Proximal Policy Optimization(PPO) Algorithm and its distributed implementation in Pytorch☆15Updated 7 years ago
- PCGrad pytorch sample code [not official]☆30Updated 4 years ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Updated 5 years ago
- Implementation of Random Expert Distillation☆29Updated 5 years ago
- PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch☆114Updated 7 years ago
- ☆17Updated 4 years ago
- ☆152Updated 5 years ago
- ☆43Updated 8 months ago
- The implementation of "Self-Supervised Generalisation with Meta Auxiliary Learning" [NeurIPS 2019].☆173Updated 3 years ago
- ☆66Updated 4 years ago
- ☆33Updated 7 years ago
- A PyTorch implementation of SSINet.☆16Updated 4 years ago
- A new paper list for multi-agent reinforcement learning (actively maintained)☆25Updated 4 years ago