tanmaysachan / splitcomputeLinks
Split model weights and execute partially
☆4Updated 10 months ago
Alternatives and similar repositories for splitcompute
Users that are interested in splitcompute are comparing it to the libraries listed below
Sorting:
- Stream of my favorite papers and links☆41Updated 2 months ago
- Because it's there.☆16Updated 8 months ago
- Verbosity control for AI agents☆63Updated last year
- SIMD quantization kernels☆70Updated this week
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆74Updated last week
- ☆59Updated 2 weeks ago
- A miniature version of Modal☆20Updated 11 months ago
- ☆29Updated 6 months ago
- Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.☆29Updated last month
- in this repository, i'm going to implement increasingly complex llm inference optimizations☆58Updated 2 weeks ago
- Using modal.com to process FineWeb-edu data☆20Updated 2 months ago
- ☆38Updated 10 months ago
- Approximating the joint distribution of language models via MCTS☆21Updated 7 months ago
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆32Updated 3 months ago
- look how they massacred my boy☆63Updated 7 months ago
- Sphynx Hallucination Induction☆54Updated 4 months ago
- rl from zero pretrain, can it be done? we'll see.☆24Updated this week
- Just large language models. Hackable, with as little abstraction as possible. Done for my own purposes, feel free to rip.☆44Updated last year
- Simple repository for training small reasoning models☆31Updated 4 months ago
- Benchmark structured generation libraries☆27Updated 7 months ago
- Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.☆38Updated last month
- Lightweight tools for quick and easy LLM demo's☆27Updated 8 months ago
- NanoGPT-speedrunning for the poor T4 enjoyers☆66Updated last month
- Compiling useful links, papers, benchmarks, ideas, etc.☆46Updated 2 months ago
- ☆20Updated 2 months ago
- Latent Large Language Models☆18Updated 9 months ago
- NLP with Rust for Python 🦀🐍☆62Updated 3 weeks ago
- Official repo for Learning to Reason for Long-Form Story Generation☆60Updated last month
- Simple orchestration for EC2 spot containers☆19Updated 8 months ago
- ☆48Updated last year