maxreciprocate/offline

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/maxreciprocate/offline)

maxreciprocate / offline

Offline RL experiments

☆15

Alternatives and similar repositories for offline

Users that are interested in offline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mansimov / chatgpt_cli
View on GitHub
Lightweight wrapper of the official ChatGPT API in your terminal
☆42Mar 10, 2023Updated 3 years ago
CarperAI / Algorithm-Distillation-RLHF
View on GitHub
☆35Jan 29, 2023Updated 3 years ago
vikashplus / unitree_sim
View on GitHub
MuJoCo models for Unitree Robots
☆12Nov 24, 2021Updated 4 years ago
Asap7772 / PTR
View on GitHub
This repository contains the implementation of the PTR algorithm described in the paper: Pre-Training for Robots: Leveraging Diverse Mult…
☆32Oct 26, 2022Updated 3 years ago
XueFuzhao / HowToRunScenic
View on GitHub
☆14Nov 28, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
young-geng / SimpleSAC
View on GitHub
A simple and easy to use implementation of the soft actor-critic algorithm.
☆15Sep 2, 2022Updated 3 years ago
brentyi / transformer-exercises-jax
View on GitHub
☆18Apr 17, 2026Updated 3 months ago
JusperLee / Look2hear
View on GitHub
A toolkit for researchers in the multimodal sound separation.
☆16Oct 20, 2023Updated 2 years ago
henry-prior / multimodal-rl
View on GitHub
Solving reinforcement learning tasks which require language and vision
☆33Apr 4, 2023Updated 3 years ago
stas00 / python-tools
View on GitHub
Python tools
☆14Oct 22, 2023Updated 2 years ago
kevinzakka / dm_env_wrappers
View on GitHub
Standalone library of frequently-used wrappers for dm_env environments.
☆19Jul 9, 2024Updated 2 years ago
rovle / gpt3-in-context-fitting
View on GitHub
Experiments on GPT-3's ability to fit numerical models in-context.
☆14Aug 11, 2022Updated 3 years ago
maxwells-daemons / caltech-cs11-tensorflow
View on GitHub
Repository for a TensorFlow class I taught at Caltech.
☆14Jan 9, 2020Updated 6 years ago
yobibyte / amorpheus
View on GitHub
My Body Is A Cage
☆41Apr 13, 2021Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ikostrikov / dmcgym
View on GitHub
☆23Aug 19, 2022Updated 3 years ago
LHAC / dac
View on GitHub
Distributed lbfgs on Apache Spark
☆10Sep 25, 2020Updated 5 years ago
sisl / pomdpland
View on GitHub
A tour of Pomdpland
☆10Aug 10, 2022Updated 3 years ago
Sea-Snell / MLLibCpp
View on GitHub
A machine learning library capable of training various deep neural networks (RNNs, LSTMs, DBNs, ect...) on a GPU. It makes use of auto-di…
☆10Aug 28, 2018Updated 7 years ago
TheDuckAI / prm
View on GitHub
☆12Jan 17, 2025Updated last year
yobibyte / iclr-viewer
View on GitHub
Go through the list of accepted papers for ICLR in terminal and add them to your reading list.
☆13Jan 30, 2021Updated 5 years ago
scottemmons / rvs
View on GitHub
Reinforcement Learning via Supervised Learning
☆72May 16, 2022Updated 4 years ago
Dahoas / QDSyntheticData
View on GitHub
☆14Aug 15, 2024Updated last year
EleutherAI / radioactive-lab
View on GitHub
Adapting the "Radioactive Data" paper to work for text models
☆13Dec 23, 2020Updated 5 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
yobibyte / unitary-scalarization-dmtl
View on GitHub
In Defense of the Unitary Scalarization for Deep Multi-Task Learning
☆22Mar 8, 2023Updated 3 years ago
thkkk / FCNet
View on GitHub
Fourier Controller Networks (FCNet) for Real-Time Decision-Making in Embodied Learning, ICML 2024
☆32Jan 2, 2025Updated last year
Zeta611 / golpy
View on GitHub
Efficient Conway's Game of Life implemented in Python using NumPy.
☆14May 1, 2024Updated 2 years ago
prajjwal1 / rl_paradigm
View on GitHub
Code for ICLR 2024 paper "When should we prefer Decision Transformers for Offline Reinforcement Learning?"
☆17Jan 31, 2024Updated 2 years ago
halcy / tpuddim
View on GitHub
☆22May 3, 2022Updated 4 years ago
HarleyCoops / smolThinker-.5B
View on GitHub
A Qwen .5B reasoning model trained on OpenR1-Math-220k
☆14Updated this week
itaicaspi / inception-v4.torch
View on GitHub
GoogLeNet Inception arhitecture v4 implementation on torch
☆11Mar 18, 2016Updated 10 years ago
thu-ml / Efficient-Diffusion-Alignment
View on GitHub
Official Codebase for "Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control" (NeurIPS 2024)
☆15Oct 29, 2024Updated last year
kngwyu / mujoco-maze
View on GitHub
Simple maze environments using mujoco-py
☆61Dec 27, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
wut0n9 / Wechat_Stat
View on GitHub
统计微信朋友圈送出的赞票与得到的赞票人员比例
☆11May 3, 2016Updated 10 years ago
zhaoyi11 / adaptive_bc
View on GitHub
☆15Jul 4, 2022Updated 4 years ago
facebookresearch / learning-audio-visual-dereverberation
View on GitHub
Code for paper Learning Audio-Visual Dereverberation
☆32Aug 10, 2022Updated 3 years ago
soumith / mltrain-nips-2017
View on GitHub
This repository contains all the material for the MLTrain NIPS workshop
☆10Dec 9, 2017Updated 8 years ago
JusperLee / DANet-For-Speech-Separation
View on GitHub
Pytorch implement of DANet For Speech Separation
☆21Jan 9, 2020Updated 6 years ago
ryanxhr / BEAR
View on GitHub
Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"
☆11Oct 29, 2019Updated 6 years ago
young-geng / mlxu
View on GitHub
Machine Learning eXperiment Utilities
☆48Jul 29, 2025Updated 11 months ago