tanmaysachan / splitcompute
Split model weights and execute partially
☆2Updated last month
Related projects: ⓘ
- Question answering on codebase☆22Updated 3 months ago
- Stream of my favorite papers and links☆34Updated 2 weeks ago
- Using modal.com to process FineWeb-edu data☆18Updated 2 weeks ago
- A miniature version of Modal☆18Updated 3 months ago
- KMD is a collection of conversational exchanges between patients and doctors on various medical topics. It aims to capture the intricaci…☆23Updated 10 months ago
- ☆17Updated last month
- ☆58Updated 3 weeks ago
- NLP with Rust for Python 🦀🐍☆57Updated 3 months ago
- Tools to make language models a bit easier to use☆22Updated last week
- Simple orchestration for EC2 spot containers☆19Updated last week
- Sphynx Hallucination Induction☆44Updated last month
- utilities for loading and running text embeddings with onnx☆39Updated last month
- Binary vector search example using Unum's USearch engine and pre-computed Wikipedia embeddings from Co:here and MixedBread☆18Updated 5 months ago
- Verbosity control for AI agents☆55Updated 3 months ago
- Just large language models. Hackable, with as little abstraction as possible. Done for my own purposes, feel free to rip.☆42Updated last year
- ☆48Updated 11 months ago
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.☆25Updated last year
- Vector Database with support for late interaction and token level embeddings.☆51Updated last week
- ☆38Updated this week
- A new benchmark for measuring LLM's capability to detect bugs in large codebase.☆23Updated 3 months ago
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆18Updated 2 months ago
- A curated list of amazingly awesome Modal applications, demos, and shiny things. Inspired by awesome-php.☆63Updated last week
- Simple Transformer in Jax☆100Updated 2 months ago
- alternative way to calculating self attention☆18Updated 3 months ago
- An attribution library for LLMs☆31Updated this week
- A fast minimalistic implementation of guided generation on Apple Silicon using Outlines and MLX☆42Updated 7 months ago
- Thorn in a HaizeStack test for evaluating long-context adversarial robustness.☆26Updated last month
- Retrieve the source code for any model made available on replicate.com!☆33Updated 7 months ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆26Updated last year
- Cerule - A Tiny Mighty Vision Model☆67Updated 2 weeks ago