msaroufim / vscode-pytorch-extensionLinks

☆12

Alternatives and similar repositories for vscode-pytorch-extension

Users that are interested in vscode-pytorch-extension are comparing it to the libraries listed below

Sorting:

kvfrans / lmpo
☆97Updated last week
hundredblocks / large-model-parallelism
Functional local implementations of main model parallelism approaches
☆96Updated 2 years ago
BricksRL / bricksrl
BricksRL: A Platform for Democratizing Robotics and Reinforcement Learning Research and Education with LEGO
☆62Updated 10 months ago
dshah3 / GPU-Puzzles
Solve puzzles. Learn CUDA.
☆64Updated last year
MatX-inc / seqax
seqax = sequence modeling + JAX
☆165Updated 3 weeks ago
stas00 / ml-ways
ML/DL Math and Method notes
☆63Updated last year
srush / GPTWorld
A puzzle to learn about prompting
☆132Updated 2 years ago
itsdaniele / jeometric
Graph neural networks in JAX.
☆67Updated last year
kyegomez / swarms-pytorch
Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊
☆129Updated 2 weeks ago
google-deepmind / regress-lm
Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple r…
☆124Updated this week
NVlabs / gbrl
Gradient Boosting Reinforcement Learning (GBRL)
☆118Updated 3 weeks ago
google-deepmind / asyncdiloco
☆45Updated last year
J-Rosser-UK / Torch2Jax-DeepSeek-R1-Distill-Qwen-1.5B
Flax (Jax) implementation of DeepSeek-R1-Distill-Qwen-1.5B with weights ported from Hugging Face.
☆22Updated 5 months ago
young-geng / mlxu
Machine Learning eXperiment Utilities
☆46Updated 2 weeks ago
imbue-ai / carbs
Cost aware hyperparameter tuning algorithm
☆167Updated last year
CLAIRE-Labo / quantile-reward-policy-optimization
Official codebase for "Quantile Reward Policy Optimization: Alignment with Pointwise Regression and Exact Partition Functions" (Matrenok …
☆24Updated last month
sangmichaelxie / cs324_p2
Project 2 (Building Large Language Models) for Stanford CS324: Understanding and Developing Large Language Models (Winter 2022)
☆105Updated 2 years ago
LeonGuertler / UnstableBaselines
☆98Updated last week
andrew-silva / clean-rl-mlx
Clean RL implementation using MLX
☆32Updated last year
nebius / kvax
A FlashAttention implementation for JAX with support for efficient document mask computation and context parallelism.
☆135Updated 4 months ago
ayaka14732 / llama-2-jax
JAX implementation of the Llama 2 model
☆219Updated last year
HenryNdubuaku / nanodl
A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.
☆290Updated 11 months ago
facebookresearch / motif
Intrinsic Motivation from Artificial Intelligence Feedback
☆131Updated last year
xrsrke / pipegoose
Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*
☆87Updated last year
cloneofsimo / min-max-gpt
Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training
☆130Updated last year
yixiaoer / tpux
A set of Python scripts that makes your experience on TPU better
☆54Updated last year
facebookresearch / oni
Learn online intrinsic rewards from LLM feedback
☆43Updated 7 months ago
vdesai2014 / inference-optimization-blog-post
☆88Updated last year
noahfarr / rlx
A reinforcement learning framework based on MLX.
☆235Updated 5 months ago
tyler-romero / microR1
Simple repository for training small reasoning models
☆32Updated 6 months ago