waefrebeorn / bytropixLinks
WuBu Nesting Playground, Inspired by XJDR Entropy, Now Hyperbolic Math Focused
☆25Updated 3 weeks ago
Alternatives and similar repositories for bytropix
Users that are interested in bytropix are comparing it to the libraries listed below
Sorting:
- NanoGPT-speedrunning for the poor T4 enjoyers☆73Updated 7 months ago
- ☆40Updated last year
- A collection of lightweight interpretability scripts to understand how LLMs think☆71Updated this week
- ☆57Updated 5 months ago
- ☆28Updated last year
- look how they massacred my boy☆63Updated last year
- Because it's there.☆16Updated last year
- Approximating the joint distribution of language models via MCTS☆22Updated last year
- Simple repository for training small reasoning models☆47Updated 10 months ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆121Updated 2 months ago
- Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)☆34Updated 9 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆108Updated 9 months ago
- Tensor-Slayer : Manipulate weights and tensors of LLMs to achieve performance upgrades and introduce a novel inferenceless mechanistic in…☆27Updated 6 months ago
- ☆24Updated 6 months ago
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆58Updated last year
- webgpu autograd library☆33Updated 6 months ago
- NanoGPT (124M) quality in 2.67B tokens☆28Updated 3 months ago
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28Updated 7 months ago
- MoE training for Me and You and maybe other people☆239Updated this week
- Extensive introductory writeup on Zig language functionalities☆10Updated last year
- ☆55Updated last year
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated last week
- EXO Gym is an open-source Python toolkit that facilitates distributed AI research.☆87Updated 2 weeks ago
- coloring terminal text with intensities (used for plotting probability, entropy with tokens)☆12Updated last year
- Samples of good AI generated CUDA kernels☆94Updated 6 months ago
- ☆11Updated last year
- Simple Transformer in Jax☆140Updated last year
- Modify Entropy Based Sampling to work with Mac Silicon via MLX☆49Updated last year
- A graph visualization of attention☆57Updated 7 months ago
- working implimention of deepseek MLA☆45Updated 11 months ago