juraam / snake-diffusion
Implementation snake game based on Diffusion model
☆90Updated 3 months ago
Alternatives and similar repositories for snake-diffusion:
Users that are interested in snake-diffusion are comparing it to the libraries listed below
- Official implementation of the paper "Linear Transformers with Learnable Kernel Functions are Better In-Context Models"☆159Updated 3 months ago
- Documented and Unit Tested educational Deep Learning framework with Autograd from scratch.☆110Updated last year
- Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)☆33Updated last month
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆96Updated last month
- Focused on fast experimentation and simplicity☆71Updated 3 months ago
- working implimention of deepseek MLA☆40Updated 3 months ago
- ☆142Updated last month
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆42Updated 10 months ago
- ☆129Updated 8 months ago
- σ-GPT: A New Approach to Autoregressive Models☆62Updated 8 months ago
- realtime latent world model inference demo☆44Updated 5 months ago
- Teaching transformers to play chess☆121Updated 2 months ago
- Getting crystal-like representations with harmonic loss☆182Updated 2 weeks ago
- Video+code lecture on building nanoGPT from scratch☆66Updated 10 months ago
- ☆30Updated 6 months ago
- NanoGPT-speedrunning for the poor T4 enjoyers☆61Updated 2 weeks ago
- Collection of autoregressive model implementation☆85Updated 2 months ago
- An introduction to LLM Sampling☆77Updated 4 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆139Updated 2 months ago
- Testing LLM reasoning abilities with family relationship quizzes.☆62Updated 2 months ago
- Training small GPT-2 style models using Kolmogorov-Arnold networks.☆116Updated 10 months ago
- ☆49Updated last year
- Quick implementation of nGPT, learning entirely on the hypersphere, from NvidiaAI☆279Updated last month
- ☆77Updated 9 months ago
- The AdEMAMix Optimizer: Better, Faster, Older.☆180Updated 7 months ago
- LLMs represent numbers on a helix and manipulate that helix to do addition.☆21Updated 2 months ago
- ☆94Updated 3 months ago
- ☆44Updated last month
- look how they massacred my boy☆63Updated 6 months ago
- Torch-activation, a collection of activation functions for PyTorch library☆24Updated 3 weeks ago