tanmaysachan / splitcomputeLinks

Split model weights and execute partially

☆4

Alternatives and similar repositories for splitcompute

Users that are interested in splitcompute are comparing it to the libraries listed below

Sorting:

BBischof / yapping
Verbosity control for AI agents
☆64Updated last year
JD-P / RetroInstruct
Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.
☆32Updated 4 months ago
haizelabs / sphynx
Sphynx Hallucination Induction
☆53Updated 5 months ago
Ziems / arbor
A framework for optimizing DSPy programs with RL
☆89Updated this week
enjalot / latent-data-modal
Using modal.com to process FineWeb-edu data
☆20Updated 3 months ago
nano-R1 / resources
Compiling useful links, papers, benchmarks, ideas, etc.
☆45Updated 4 months ago
brendanhogan / picoDeepResearch
☆64Updated last month
QasimWani / gct
Graphical Code Tracer (GCT): Visualize code at lightning speed
☆53Updated last year
PrimeIntellect-ai / prime-cli
The Prime Intellect CLI provides a powerful command-line interface for managing GPU resources across various providers
☆29Updated last month
teknium1 / transformers-gptq-quant
☆47Updated last year
doomslide / autoloom
Approximating the joint distribution of language models via MCTS
☆21Updated 8 months ago
omkaark / spotty
Simple orchestration for EC2 spot containers
☆19Updated 9 months ago
jlewi / foyle
Foyle is a copilot to help developers deploy and operate their applications.
☆131Updated 4 months ago
ishan0102 / rsrch.space
Stream of my favorite papers and links
☆42Updated 3 months ago
charlesfrye / cuda-substrings
Because it's there.
☆16Updated 9 months ago
xjdr-alt / muzero_sketch
☆38Updated 11 months ago
DeployQL / LintDB
Vector Database with support for late interaction and token level embeddings.
☆55Updated 3 weeks ago
philipp-eisen / modal-mcp-toolbox
A collection of tools for your LLMs that run on Modal
☆21Updated 4 months ago
Nearcyan / papers.day
papers.day
☆91Updated last year
ivanleomk / modal-grpo
☆20Updated 4 months ago
MaximeRivest / funnydspy
Vanilla-Python ergonomics on top of DSPy
☆33Updated last month
kyegomez / Exa
Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…
☆26Updated 8 months ago
virevolai / logos-shift-client
Replace expensive LLM calls with finetunes automatically
☆65Updated last year
Extensible-AI / DAGent
Build AI Agents with Your Existing Python Code!
☆61Updated 8 months ago
raphaelsty / LeNLP
NLP with Rust for Python 🦀🐍
☆63Updated 2 months ago
Columbia-NLP-Lab / PAPILLON
Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles
☆52Updated 2 months ago
charlesfrye / minimodal
A miniature version of Modal
☆20Updated last year
haizelabs / j1-micro
j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.
☆91Updated last month
taylorai / onnx_embedding_models
utilities for loading and running text embeddings with onnx
☆44Updated 11 months ago
PrimeIntellect-ai / pi-quant
SIMD quantization kernels
☆73Updated this week