davidar / eigenGPT
Minimal C++ implementation of GPT2
☆40Updated last year
Alternatives and similar repositories for eigenGPT:
Users that are interested in eigenGPT are comparing it to the libraries listed below
- throwaway GPT inference☆140Updated 8 months ago
- Jax like function transformation engine but micro, microjax☆30Updated 3 months ago
- OMNI: Open-endedness via Models of human Notions of Interestingness☆42Updated 3 weeks ago
- LLM training in simple, raw C/CUDA☆18Updated 9 months ago
- MACTA: A Multi-agent Reinforcement Learning Approach for Cache Timing Attacks and Detection☆46Updated last year
- Code for "Meta Learning Backpropagation And Improving It" @ NeurIPS 2021 https://arxiv.org/abs/2012.14905☆31Updated 3 years ago
- ☆32Updated 8 months ago
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- Official repository for the paper "Automating Continual Learning"☆12Updated 10 months ago
- Efficiently send large arrays across machines☆15Updated 6 months ago
- This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."☆26Updated 3 months ago
- The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.☆11Updated 8 months ago
- ☆30Updated 2 months ago
- A really tiny autograd engine☆89Updated 10 months ago
- Implementation of BC-IRL and other IRL baselines☆25Updated last year
- Learn online intrinsic rewards from LLM feedback☆34Updated 2 months ago
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Updated last year
- ☆70Updated 5 months ago
- Fast modular code to create and train cutting edge LLMs☆65Updated 9 months ago
- GPT implementation in Flax☆18Updated 3 years ago
- Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"☆36Updated last year
- A fork of llama3.c used to do some R&D on inferencing☆18Updated last month
- Exploration into the Firefly algorithm in Pytorch☆35Updated this week
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆92Updated 4 months ago
- FlexAttention w/ FlashAttention3 Support☆26Updated 4 months ago
- Experimental scripts for researching data adaptive learning rate scheduling.☆23Updated last year
- Repo to reproduce the First-Explore paper results☆37Updated last month
- Make triton easier☆44Updated 8 months ago
- Galactic Scaling End-to-End Reinforcement Learning for Rearrangement at 100k Steps-Per-Second☆83Updated last year
- Code for Powderworld: A Platform for Understanding Generalization via Rich Task Distributions☆65Updated 5 months ago