MaxRobinsonTheGreat / mandelbrotnn
Torturing neural networks by forcing them to learn the Mandelbrot set.
☆122Updated last year
Related projects: ⓘ
- ☆65Updated last year
- A simplistic linear and multiprocessed approach to sentiment analysis using Gzip Normalized Compression Distances with k nearest neighbor…☆142Updated last year
- Visualizing some of the internals of a neural network during training and inference.☆69Updated 7 months ago
- The boundary of neural network trainability is fractal☆155Updated 7 months ago
- Material for the Systems and Cognitive NeuroScience online course☆115Updated 2 years ago
- Training small GPT-2 style models using Kolmogorov-Arnold networks.☆105Updated 3 months ago
- This repository's goal is to precompile all past presentations of the Huggingface reading group☆42Updated 3 weeks ago
- Code used in creating YouTube videos☆88Updated last year
- Some helpers and examples for creating an LLM fine-tuning dataset☆60Updated 6 months ago
- All of the code shown in my YouTube tutorials. Files are grouped by Playlist, then Video Title.☆29Updated 4 months ago
- ☆159Updated 5 months ago
- LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch☆72Updated last year
- A package for defining deep learning models using categorical algebraic expressions.☆53Updated last month
- 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.☆100Updated 2 years ago
- ☆104Updated 2 months ago
- Repo where I recreate some popular machine learning models from scratch in Python☆88Updated last week
- Brain-Inspired Modular Training (BIMT), a method for making neural networks more modular and interpretable.☆159Updated last year
- Official repository for the paper "Grokfast: Accelerated Grokking by Amplifying Slow Gradients"☆488Updated 2 months ago
- ☆130Updated 10 months ago
- Reweight GPT - a simple neural network using transformer architecture for next character prediction☆48Updated last year
- ☆16Updated 4 months ago
- Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊☆106Updated last week
- GPT-2 (124M) quality in 5B tokens☆227Updated last week
- This is the source code for the animations in the series "Visualizing Deep Learning"☆190Updated 3 months ago
- ☆32Updated 2 months ago
- Machine Learning library for educational purpose.☆275Updated 3 months ago
- ☆37Updated 11 months ago
- A c/c++ implementation of micrograd: a tiny autograd engine with neural net on top.☆60Updated last year
- customizable template GPT code designed for easy novel architecture experimentation☆23Updated this week
- ☆28Updated 3 months ago