tanmaysachan / splitcomputeLinks
Split model weights and execute partially
☆4Updated last year
Alternatives and similar repositories for splitcompute
Users that are interested in splitcompute are comparing it to the libraries listed below
Sorting:
- Sphynx Hallucination Induction☆53Updated 6 months ago
- ☆213Updated last month
- Storing long contexts in tiny caches with self-study☆124Updated this week
- ☆31Updated 8 months ago
- A curated list of amazingly awesome Modal applications, demos, and shiny things. Inspired by awesome-php.☆150Updated last month
- A miniature version of Modal☆20Updated last year
- Verbosity control for AI agents☆64Updated last year
- Inference-time scaling for LLMs-as-a-judge.☆272Updated 3 weeks ago
- Long context evaluation for large language models☆220Updated 5 months ago
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆95Updated 3 weeks ago
- ☆96Updated 2 weeks ago
- A framework for optimizing DSPy programs with RL☆96Updated this week
- Just a bunch of benchmark logs for different LLMs☆119Updated last year
- ☆41Updated 6 months ago
- Stream of my favorite papers and links☆42Updated 4 months ago
- A lightweight evaluation suite tailored specifically for assessing Indic LLMs across a diverse range of tasks☆37Updated last year
- ☆65Updated 2 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆55Updated 6 months ago
- ☆14Updated 3 months ago
- Modded vLLM to run pipeline parallelism over public networks☆37Updated 2 months ago
- Small, simple agent task environments for training and evaluation☆18Updated 9 months ago
- Cerule - A Tiny Mighty Vision Model☆66Updated 11 months ago
- An introduction to LLM Sampling☆79Updated 7 months ago
- Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.☆45Updated 3 months ago
- Because it's there.☆16Updated 10 months ago
- ☆64Updated last month
- ☆22Updated last year
- ☆130Updated 4 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Updated last year
- ☆47Updated last year