hkproj / kan-notes
☆19Updated 11 months ago
Alternatives and similar repositories for kan-notes:
Users that are interested in kan-notes are comparing it to the libraries listed below
- Training small GPT-2 style models using Kolmogorov-Arnold networks.☆116Updated 11 months ago
- Visualizing some of the internals of a neural network during training and inference.☆75Updated last year
- documentation for content creation☆194Updated 2 months ago
- Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊☆121Updated 2 weeks ago
- LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch☆101Updated last year
- Variations of Kolmogorov-Arnold Networks☆114Updated 11 months ago
- From scratch implementation of a vision language model in pure PyTorch☆213Updated 11 months ago
- SaLSa Optimizer implementation (No learning rates needed)☆29Updated 2 weeks ago
- Collection of tests performed during the study of the new Kolmogorov-Arnold Neural Networks (KAN)☆40Updated 2 months ago
- An open source implementation of LFMs from Liquid AI: Liquid Foundation Models☆90Updated 6 months ago
- Notebooks for fine tuning pali gemma☆100Updated last week
- my attempts at implementing various bits of Sepp Hochreiter's new xLSTM architecture☆130Updated 11 months ago
- This repository contains a better implementation of Kolmogorov-Arnold networks☆61Updated 11 months ago
- ☆129Updated 8 months ago
- Simple, minimal implementation of the Mamba SSM in one pytorch file. Using logcumsumexp (Heisen sequence).☆112Updated 6 months ago
- Kolmogorov-Arnold Networks (KAN) using Chebyshev polynomials instead of B-splines.☆370Updated 11 months ago
- ☆43Updated this week
- making the official triton tutorials actually comprehensible☆26Updated last month
- Getting crystal-like representations with harmonic loss☆182Updated 3 weeks ago
- An easy to use PyTorch implementation of the Kolmogorov Arnold Network and a few novel variations☆178Updated 5 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆80Updated 10 months ago
- ☆45Updated 3 weeks ago
- Rebuild the Stable Diffusion Model in a single python script. Tutorial for Harvard ML from Scratch Series☆204Updated 3 months ago
- An open source implementation of LFMs from Liquid AI: Liquid Foundation Models☆162Updated this week
- The boundary of neural network trainability is fractal☆198Updated last year
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆42Updated 11 months ago
- Notes about "Attention is all you need" video (https://www.youtube.com/watch?v=bCz4OMemCcA)☆269Updated last year
- Reference implementation of Mistral AI 7B v0.1 model.☆28Updated last year
- a simplified version of Meta's Llama 3 model to be used for learning☆41Updated 11 months ago
- This is the code that went into our practical dive using mamba as information extraction☆54Updated last year