tianbingsz/SVRG

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tianbingsz/SVRG)

tianbingsz / SVRG

Stochastic Variance Reduction Policy Gradient Estimation

☆11

Alternatives and similar repositories for SVRG

Users that are interested in SVRG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tianbingsz / WALL-E
View on GitHub
Codebase for Efficient yet simple Reinforcement Learning Research Framework
☆28Jan 14, 2023Updated 3 years ago
yfhanhust / MiniBatchSpectralClustering
View on GitHub
☆13Mar 3, 2017Updated 9 years ago
idlrl / flare
View on GitHub
RL framework for embodied agents based on PyTorch
☆11Apr 11, 2019Updated 7 years ago
JuliaPOMDP / TabularTDLearning.jl
View on GitHub
Julia implementations of temporal difference Reinforcement Learning algorithms like Q-Learning and SARSA
☆12Nov 16, 2025Updated 8 months ago
flint-xf-fan / MLDA-Workshop
View on GitHub
ML/DL training workshops for EEE undergrads
☆13Jan 16, 2019Updated 7 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
xiecong / Simple-Implementation-of-ML-Algorithms
View on GitHub
My simplest implementations of common ML algorithms
☆20Jul 23, 2023Updated 3 years ago
kgorman / ocean
View on GitHub
Ocean sensor data from the NOAA CO-OPS API
☆14Jul 14, 2016Updated 10 years ago
yueqiw / OptML-SVRG-PyTorch
View on GitHub
Implementation of SVRG for training neural networks
☆24Nov 24, 2019Updated 6 years ago
annieyan / Bandits-using-UCB-algorithm
View on GitHub
Thompson Sampling for Bandits using UCB policy
☆10Jul 29, 2017Updated 9 years ago
Zhaoxian-Wu / Byrd-SAGA
View on GitHub
Code for paper "Byzantine-Resilient Distributed Finite-Sum Optimization over Networks"
☆18Nov 5, 2020Updated 5 years ago
igsor / HDPy
View on GitHub
Heuristic Dynamic Programming with Python
☆14Jul 28, 2014Updated 12 years ago
FutureComputing4AI / Reverse-Engineering-Function-Search
View on GitHub
☆13Oct 31, 2024Updated last year
abhinavgupta / Modbus-protocol-RS485
View on GitHub
Implementation of Modbus protocol for a data logger project back that I worked back in my 3rd year of college. This is a low overhead wir…
☆14Jan 26, 2012Updated 14 years ago
silencesmile / pyecharts
View on GitHub
Python画图超级模块：pyecharts 功能大全
☆11Oct 12, 2019Updated 6 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
microsoft / bonsai-anylogic
View on GitHub
AnyLogic connector for Microsoft Bonsai and sample models
☆29Dec 27, 2022Updated 3 years ago
mewmew / float
View on GitHub
Binary floating-point formats in Go (IEEE 754 half and quadruple precision, x86 extended precision and PowerPC quadruple precision with d…
☆23Dec 12, 2021Updated 4 years ago
BigBayes / SGMCMC.jl
View on GitHub
Stochastic Gradient Markov Chain Monte Carlo and Optimisation
☆17Mar 21, 2017Updated 9 years ago
flint-xf-fan / Federated-RLHF
View on GitHub
[AAMAS 2025] Privacy-preserving and Personalized RLHF, with convergence guarantees. The Code contains experiments for training multiple i…
☆16Apr 16, 2025Updated last year
Sthing / Nick-Gammon-RS485
View on GitHub
Python implementation of Nick Gammon's RS485 library for the Arduino.
☆11Oct 29, 2015Updated 10 years ago
fbora / tic-tac-GO_ZERO
View on GitHub
Implementation of Alpha Go Zero algorithm for the game of tic-tac-toe
☆16Nov 4, 2017Updated 8 years ago
feidieufo / homework
View on GitHub
Assignments for CS294-112.
☆30Sep 11, 2019Updated 6 years ago
covert-labs / covert-labs.github.io
View on GitHub
Covert.io blog
☆12Feb 3, 2024Updated 2 years ago
zalanborsos / online-variance-reduction
View on GitHub
Online Variance Reduction
☆15May 9, 2019Updated 7 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
mohakbhardwaj / SaIL
View on GitHub
☆17May 16, 2018Updated 8 years ago
hongyanz / Stackelberg-GAN
View on GitHub
Codes for Stackelberg GAN
☆15Apr 23, 2019Updated 7 years ago
yashpatel5400 / neuropath
View on GitHub
A neural branch predictor tested using CPU emulator, testing both supervised learning and reinforcement learning (for COS 583: Great Mome…
☆15May 17, 2017Updated 9 years ago
lns / memoire
View on GitHub
☆18Apr 17, 2019Updated 7 years ago
andyliu42 / Counterfactual_Regret_Minimization_Python
View on GitHub
Counterfactual Regret Minimization (CFR) sample code in Python
☆14Apr 16, 2019Updated 7 years ago
inhere / md-site-reader
View on GitHub
a very lightweight markdown docs site reader
☆18Oct 9, 2017Updated 8 years ago
sinagolara / VRP
View on GitHub
Adaptive Heuristic Method Based on SA and LNS for Solving Vehicle Routing Problem
☆13Oct 9, 2017Updated 8 years ago
SUBER-Team / SUBER
View on GitHub
This repository accompanies our research paper titled "An LLM-based Recommender System Environment".
☆17Jul 15, 2024Updated 2 years ago
flint-xf-fan / Byzantine-Federated-RL
View on GitHub
[NeurIPS2021] Federated Reinforcement Learning with Theoretical Guarantees. The repo contains code and experiments for our Federated Poli…
☆106Apr 16, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
ermongroup / best-arm-delayed
View on GitHub
Code for "Best arm identification in multi-armed bandits with delayed feedback", AISTATS 2018.
☆20Apr 3, 2018Updated 8 years ago
RobRomijnders / bandit
View on GitHub
Implementation of Counterfactual risk minimization
☆26Apr 13, 2017Updated 9 years ago
facebookresearch / reward-estimator-corl
View on GitHub
Reward Estimation for Variance Reduction in Deep Reinforcement Learning
☆23Oct 26, 2018Updated 7 years ago
huchunxu / TodoList
View on GitHub
python+django练习，一个简单的todolist
☆11Apr 20, 2014Updated 12 years ago
aliemamalinezhad / machine-learning
View on GitHub
android-malware-classification using machine learning algorithms
☆11Aug 17, 2020Updated 5 years ago
aliemamalinezhad / Android-Malware-Detection
View on GitHub
Android malware classification using both .java files and .so files
☆11Jan 19, 2019Updated 7 years ago
iPhaeton / car_identification
View on GitHub
☆24Jan 25, 2019Updated 7 years ago