sdan/vlm-gym

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sdan/vlm-gym)

sdan / vlm-gym

RL gym for vision language models written in JAX

☆148

Alternatives and similar repositories for vlm-gym

Users that are interested in vlm-gym are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

brendanhogan / completion_tree_view
View on GitHub
☆15Apr 26, 2025Updated last year
shahanneda / rust-gs
View on GitHub
Gaussian Splat Viewer made from scratch using Rust and WebGL.
☆18Updated this week
willccbb / localchat
View on GitHub
☆13Apr 16, 2025Updated last year
okarthikb / DPO
View on GitHub
Implementation of Direct Preference Optimization
☆17Jul 17, 2023Updated 2 years ago
ivanfioravanti / asitop
View on GitHub
Perf monitoring CLI tool for Apple Silicon
☆16Jan 1, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
stockeh / mlx-grokking
View on GitHub
Grokking on modular arithmetic in less than 150 epochs in MLX
☆15Oct 24, 2024Updated last year
danielgross / embedland
View on GitHub
A collection of text embedding experiments
☆55Feb 27, 2023Updated 3 years ago
quackduck / bankCLI
View on GitHub
Hack Club Bank CLI
☆10Jul 25, 2022Updated 3 years ago
kywch / brax-trainer
View on GitHub
Brax + Pufferlib + CARBS for gpu-accelerated robotics RL
☆12Jun 12, 2025Updated last year
vwxyzjn / gym-pysc2
View on GitHub
Gym wrapper for pysc2
☆10Sep 16, 2022Updated 3 years ago
jasonappah / wordcounter
View on GitHub
Next.js app to display the word count of given text.
☆11Mar 5, 2023Updated 3 years ago
Farama-Foundation / Procgen-Staging
View on GitHub
Procgen2: A community maintained fork of procgen
☆12Aug 25, 2022Updated 3 years ago
HumanCompatibleAI / interpreting-rewards
View on GitHub
Experiments in applying interpretability techniques to learned reward functions.
☆10Dec 11, 2020Updated 5 years ago
koulanurag / dream-and-search
View on GitHub
Code for "Dream and Search to Control: Latent Space Planning for Continuous Control"
☆12Jul 12, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
cosmicoptima / birdwatch
View on GitHub
Twitter scraping library
☆11Nov 11, 2021Updated 4 years ago
devtooligan / MyBeaconChain
View on GitHub
Beacon Chain from scratch
☆44Feb 9, 2023Updated 3 years ago
BingSu12 / Log-Polar-Space-Convolution
View on GitHub
Log-Polar Space Convolution for Convolutional Neural Networks
☆13Dec 12, 2022Updated 3 years ago
smearle / pcgrl-jax
View on GitHub
A JAX-accelerated implementation of the Procedural Content Generation via Reinforcement Learning (PCGRL) framework. We train RL agents to…
☆15Nov 26, 2025Updated 7 months ago
hcmlab / GANterfactual-RL
View on GitHub
Counterfactual explanations for Reinforcement Learning agents on Atari
☆12Apr 3, 2023Updated 3 years ago
yunfeixie233 / ViGaL
View on GitHub
☆70Feb 4, 2026Updated 5 months ago
cicero225 / llm_pokemon_scaffold
View on GitHub
☆34May 31, 2025Updated last year
klightz / splitting
View on GitHub
Offical Repo for Splitting Steepest Descent for Growing Neural Architectures
☆13May 12, 2021Updated 5 years ago
dilinwang820 / fast-energy-aware-splitting
View on GitHub
Energy-Aware Neural Architecture Optimization with Fast Splitting Steepest Descent
☆14Feb 6, 2020Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
03ladr / xevm
View on GitHub
evm implementation in rust
☆16May 26, 2022Updated 4 years ago
scarlet0703 / LoRA-Sub-DRS
View on GitHub
Official PyTorch implementation of our CVPR 2025 paper, "LoRA Subtraction for Drift-Resistant Space in Exemplar-Free Continual Learning."
☆18Mar 28, 2025Updated last year
yz93 / Learn-to-Interpret-Atari-Agents
View on GitHub
☆11Feb 20, 2020Updated 6 years ago
luchris429 / discovered-policy-optimisation
View on GitHub
Code for Discovered Policy Optimisation (NeurIPS 2022)
☆12Jun 15, 2023Updated 3 years ago
hiverge / cifar10-speedrun
View on GitHub
CIFAR-10 speedrun: Trains to 94% accuracy in 1.98 seconds on a single NVIDIA A100 GPU.
☆79Oct 17, 2025Updated 8 months ago
BurakGurbuz97 / NICE
View on GitHub
NICE: Neurogenesis Inspired Contextual Encoding for Replay-free Class Incremental Learning
☆29Jul 28, 2024Updated last year
iKora128 / stop-ai-slop-jp
View on GitHub
日本語の文章からAI臭を取り除く Claude Skill
☆319Jun 11, 2026Updated 3 weeks ago
refcell / evm.rs
View on GitHub
Barebones Rust EVM Implementation
☆12Feb 9, 2022Updated 4 years ago
SpeechColab / PySpeechColab
View on GitHub
A library of speech gadgets.
☆15Oct 15, 2022Updated 3 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
zocomputer / selectron
View on GitHub
AI web parser library + CLI
☆49May 5, 2025Updated last year
AnjieCheng / Fast-ImageNet-Dataloader
View on GitHub
A fast data loader for ImageNet on PyTorch.
☆18Mar 17, 2019Updated 7 years ago
jphacks / SD_1702
View on GitHub
☆14Nov 21, 2017Updated 8 years ago
Algomancer / Minimal-Drifting-Models
View on GitHub
A minimal implementation of Drifting Models for 2D toy data. Unlike diffusion/flow models that iterate at inference, drifting models evo…
☆72Feb 13, 2026Updated 4 months ago
TehreemFarooqi / Preparing-a-speech-recognition-dataset-using-YouTube-videos
View on GitHub
Using YouTube to prepare a speech recognition dataset for any language
☆10Mar 30, 2021Updated 5 years ago
bespokelabsai / verifiers
View on GitHub
Verifiers for LLM Reinforcement Learning
☆81Updated this week
kadenzipfel / huff-tools
View on GitHub
A set of tools for use with the huff language.
☆21Jun 24, 2022Updated 4 years ago