catswe / LinearKANLinks
LinearKAN: A very fast implementation of Kolmogorov-Arnold Networks
☆18Updated 2 months ago
Alternatives and similar repositories for LinearKAN
Users that are interested in LinearKAN are comparing it to the libraries listed below
Sorting:
- all the materials for cs140e winter 2026☆33Updated this week
- making the official triton tutorials actually comprehensible☆111Updated 5 months ago
- a Jax quantization library☆90Updated this week
- Dion optimizer algorithm☆431Updated 3 weeks ago
- ☆544Updated 6 months ago
- 🧱 Modula software package☆322Updated 5 months ago
- ☆291Updated last year
- ☆236Updated last year
- ☆89Updated 2 months ago
- Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs☆843Updated 2 weeks ago
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆198Updated 8 months ago
- Implementation of Diffusion Transformer (DiT) in JAX☆306Updated last year
- Landing repository for the paper "Softpick: No Attention Sink, No Massive Activations with Rectified Softmax"☆86Updated 4 months ago
- BabyTorch is a minimalist deep-learning framework with a similar API to PyTorch. This minimalist design encourages learners explore and u…☆26Updated 8 months ago
- CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds☆352Updated 2 months ago
- A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.☆457Updated 11 months ago
- ☆16Updated last year
- Solve puzzles to improve your tinygrad skills!☆178Updated 3 months ago
- Learning about CUDA by writing PTX code.☆152Updated last year
- ☆492Updated last year
- Simple and readable code for training and sampling from diffusion models☆696Updated 7 months ago
- Quick implementation of nGPT, learning entirely on the hypersphere, from NvidiaAI☆293Updated 8 months ago
- ☆246Updated last year
- Minimal yet performant LLM examples in pure JAX☆240Updated 3 weeks ago
- coding CUDA everyday!☆73Updated this week
- Official Implementation of Dynamic erf (Derf).☆127Updated last month
- For optimization algorithm research and development.☆558Updated 3 weeks ago
- ☆28Updated 4 months ago
- GPU Kernels☆220Updated 9 months ago
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆276Updated last year