hitorilabs / naviLinks
compute, storage, and networking infra at home
☆65Updated last year
Alternatives and similar repositories for navi
Users that are interested in navi are comparing it to the libraries listed below
Sorting:
- Simple Transformer in Jax☆139Updated last year
- Just large language models. Hackable, with as little abstraction as possible. Done for my own purposes, feel free to rip.☆44Updated last year
- Simplex Random Feature attention, in PyTorch☆74Updated last year
- Simple embedding -> text model trained on a small subset of Wikipedia sentences.☆156Updated 2 years ago
- papers.day☆91Updated last year
- Stream of my favorite papers and links☆42Updated 5 months ago
- A really tiny autograd engine☆95Updated 3 months ago
- a highly efficient compression algorithm for the n1 implant (neuralink's compression challenge)☆46Updated last year
- Cerule - A Tiny Mighty Vision Model☆66Updated 11 months ago
- inference code for mixtral-8x7b-32kseqlen☆101Updated last year
- Helpers and such for working with Lambda Cloud☆51Updated last year
- Run GGML models with Kubernetes.☆174Updated last year
- A collection of LLM services you can self host via docker or modal labs to support your applications development☆194Updated last year
- ☆144Updated 2 years ago
- ☆22Updated 2 years ago
- This repo is my attempt at a rough implementation of nanoGPT trained on a dataset of 30,000 unique Twitter usernames☆24Updated last year
- ☆61Updated last year
- ☆39Updated last year
- A graph visualization of attention☆57Updated 3 months ago
- PageRank for LLMs☆44Updated 4 months ago
- ☆166Updated 2 years ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year
- Full finetuning of large language models without large memory requirements☆94Updated last year
- look how they massacred my boy☆64Updated 10 months ago
- Chat Markup Language conversation library☆55Updated last year
- Tools to help me easily add scripts to make my unix workflow faster☆39Updated 4 months ago
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆58Updated last year
- run paligemma in real time☆131Updated last year
- An introduction to LLM Sampling☆79Updated 8 months ago
- Command-line script for inferencing from models such as MPT-7B-Chat☆100Updated 2 years ago