hengyuan-hu / jax-vs-pytorchLinks
☆13Updated 10 months ago
Alternatives and similar repositories for jax-vs-pytorch
Users that are interested in jax-vs-pytorch are comparing it to the libraries listed below
Sorting:
- Official code for "Accelerating Feedforward Computation via Parallel Nonlinear Equation Solving", ICML 2021☆28Updated 4 years ago
- ☆10Updated 3 years ago
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆16Updated 6 months ago
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"☆18Updated last year
- Open source code for paper "On the Learning and Learnability of Quasimetrics".☆32Updated 3 years ago
- An adaptive training algorithm for residual network☆17Updated 5 years ago
- AGaLiTe: Approximate Gated Linear Transformers for Online Reinforcement Learning (Published in TMLR)☆22Updated last year
- [NeurIPS'20] Code for the Paper Compositional Visual Generation and Inference with Energy Based Models☆47Updated 2 years ago
- [NeurIPS 2021] Code for Unsupervised Learning of Compositional Energy Concepts☆62Updated 3 years ago
- ☆15Updated 3 years ago
- PyTorch implementation for all methods and environments in the paper "MIMEx: Intrinsic Rewards from Masked Input Modeling"☆16Updated 2 years ago
- Measuring the Signal to Noise Ratio in Language Model Evaluation☆28Updated 4 months ago
- Official PyTorch Implementation of the Longhorn Deep State Space Model☆57Updated last year
- Cross-Domain Imitation Learning via Optimal Transport☆25Updated 3 years ago
- ☆14Updated 3 years ago
- The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.☆13Updated last year
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆25Updated 10 months ago
- Deep Networks Grok All the Time and Here is Why☆38Updated last year
- Generalised UDRL☆37Updated 3 years ago
- Official code for the paper "Attention as a Hypernetwork"☆46Updated last year
- GPT implementation in Flax☆18Updated 3 years ago
- This is the pytorch implementation of the UAI2023 paper "A Trajectory is Worth Three Sentences: Multimodal Transformer for Offline Reinf…☆11Updated 2 years ago
- ☆44Updated last year
- ☆12Updated last year
- ☆52Updated last year
- Code for ICML2021 paper 'Commutative Lie Group VAE for Disentanglement Learning'.☆23Updated 3 years ago
- Code associated with our paper "Learning Group Structure and Disentangled Representations of Dynamical Environments"☆15Updated 3 years ago
- Neural Optimal Transport with Lagrangian Costs☆60Updated 7 months ago
- Meta Optimal Transport☆105Updated 2 years ago
- Beyond Straight-Through☆105Updated 2 years ago