davidar / eigenGPTLinks
Minimal C++ implementation of GPT2
☆40Updated 2 years ago
Alternatives and similar repositories for eigenGPT
Users that are interested in eigenGPT are comparing it to the libraries listed below
Sorting:
- Fast reinforcement learning 💨☆28Updated 3 months ago
- Lightweight Llama 3 8B Inference Engine in CUDA C☆48Updated 7 months ago
- OMNI: Open-endedness via Models of human Notions of Interestingness☆57Updated 9 months ago
- Loop Nest - Linear algebra compiler and code generator.☆21Updated 3 years ago
- LLM training in simple, raw C/CUDA☆18Updated last year
- FlexAttention w/ FlashAttention3 Support☆27Updated last year
- A really tiny autograd engine☆96Updated 5 months ago
- MACTA: A Multi-agent Reinforcement Learning Approach for Cache Timing Attacks and Detection☆46Updated 2 years ago
- ☆53Updated last year
- Exploration into the Firefly algorithm in Pytorch☆41Updated 8 months ago
- throwaway GPT inference☆140Updated last year
- Clean RL implementation using MLX☆33Updated last year
- ☆19Updated 3 years ago
- Galactic Scaling End-to-End Reinforcement Learning for Rearrangement at 100k Steps-Per-Second☆86Updated 2 years ago
- Implementations of Curious Replay for model-based adaptation.☆42Updated 2 years ago
- Experimental scripts for researching data adaptive learning rate scheduling.☆22Updated 2 years ago
- ☆51Updated 3 months ago
- Implementation of ReWiND, "Language-Guided Rewards Teach Robot Policies without New Demonstrations", from USC / Amazon Robotics☆35Updated 2 months ago
- A lightweight PyTorch implementation of the Transformer-XL architecture proposed by Dai et al. (2019)☆37Updated 2 years ago
- ☆89Updated last year
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Updated 2 years ago
- Multi-agent simulator in Jax for research and teaching in AI & ALife☆29Updated 3 weeks ago
- Efficiently send large arrays across machines☆16Updated last year
- Tools and Utils for Experiments (TUX)☆15Updated 9 months ago
- The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.☆11Updated last year
- Make triton easier☆48Updated last year
- Codes accompanying the paper "LaProp: a Better Way to Combine Momentum with Adaptive Gradient"☆29Updated 5 years ago
- ☆144Updated 2 years ago
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆113Updated last year
- Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"☆38Updated 4 months ago