davidar / eigenGPTLinks
Minimal C++ implementation of GPT2
☆40Updated 2 years ago
Alternatives and similar repositories for eigenGPT
Users that are interested in eigenGPT are comparing it to the libraries listed below
Sorting:
- MACTA: A Multi-agent Reinforcement Learning Approach for Cache Timing Attacks and Detection☆45Updated 2 years ago
- Exploration into the Firefly algorithm in Pytorch☆40Updated 6 months ago
- ☆53Updated last year
- Multi-agent simulator in Jax for research and teaching in AI & ALife☆29Updated this week
- OMNI: Open-endedness via Models of human Notions of Interestingness☆55Updated 7 months ago
- Make triton easier☆47Updated last year
- Clean RL implementation using MLX☆32Updated last year
- Experimental scripts for researching data adaptive learning rate scheduling.☆23Updated last year
- LLM training in simple, raw C/CUDA☆18Updated last year
- Fast and memory efficient PyTorch implementation of the Perceiver with FlashAttention.☆26Updated 9 months ago
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Updated last year
- Fast reinforcement learning 💨☆26Updated last month
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆122Updated 3 weeks ago
- GPT implementation in Flax☆18Updated 3 years ago
- ☆44Updated last month
- Lightweight Llama 3 8B Inference Engine in CUDA C☆48Updated 5 months ago
- A lightweight PyTorch implementation of the Transformer-XL architecture proposed by Dai et al. (2019)☆37Updated 2 years ago
- Utilities for Training Very Large Models☆58Updated 11 months ago
- throwaway GPT inference☆140Updated last year
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆40Updated last year
- Implementations of Curious Replay for model-based adaptation.☆41Updated 2 years ago
- Efficiently send large arrays across machines☆16Updated last year
- FlexAttention w/ FlashAttention3 Support☆27Updated 10 months ago
- Code for "Goal-Guided Neural Cellular Automata: Learning to Control Self-Organising Systems"☆56Updated 3 years ago
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆107Updated 11 months ago
- FastFeedForward Networks☆20Updated last year
- A really tiny autograd engine☆95Updated 3 months ago
- Implemenation of the HIERarchical imagionation On Structured State Space Sequence Models (HIEROS) paper☆18Updated last year
- Inference of Mamba models in pure C☆191Updated last year
- Codes accompanying the paper "LaProp: a Better Way to Combine Momentum with Adaptive Gradient"☆29Updated 5 years ago