kir-gadjello / zipslicer
A library for incremental loading of large PyTorch checkpoints
☆56Updated last year
Alternatives and similar repositories for zipslicer:
Users that are interested in zipslicer are comparing it to the libraries listed below
- Implement recursion using English as the programming language and an LLM as the runtime.☆136Updated last year
- Extensible AI assistant platform that bridges LLMs to tasks and actions☆38Updated last year
- iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh☆47Updated last year
- Tiny inference-only implementation of LLaMA☆92Updated 10 months ago
- Command-line script for inferencing from models such as MPT-7B-Chat☆101Updated last year
- A playground to make it easy to try crazy things☆33Updated this week
- ☆40Updated last year
- A fork of llama3.c used to do some R&D on inferencing☆18Updated 2 months ago
- utilities for loading and running text embeddings with onnx☆44Updated 6 months ago
- What if an HNSW index was just a file, and you could serve it from a CDN, and search it directly in the browser?☆89Updated 9 months ago
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆44Updated last year
- ☆126Updated last year
- A web-app to explore topics using LLM (less typing and more clicks)☆66Updated last year
- ☆34Updated last year
- An easily-trained baby GPT that can stand in for the real thing. Based on Andrej Karpathy's makemore, but set up to mimic a llama-cpp ser…☆27Updated last year
- Generates grammer files from typescript for LLM generation☆36Updated last year
- assign color hues to a collection of text fragments based on embeddings☆20Updated 8 months ago
- Hierarchical topic segmentation of meeting transcripts using embeddings and divisive clustering.☆51Updated 6 months ago
- Embedding models from Jina AI☆58Updated last year
- Image Generation API Server - Similar to https://text-generator.io but for images☆50Updated 2 months ago
- Quantized inference code for LLaMA models☆13Updated last year
- A CLI to manage install and configure llama inference implemenation in multiple languages☆65Updated last year
- C++ raytracer that supports custom models. Supports running the calculations on the CPU using C++11 threads or in the GPU via CUDA.☆75Updated 2 years ago
- Revealing example of self-attention, the building block of transformer AI models☆130Updated last year
- Local Startup Advisor Chatbot☆31Updated last year
- tinygrad port of the RWKV large language model.☆44Updated 8 months ago
- Evolutionary Search for expert-level performance on any task with environmental feedback☆14Updated last year
- ☆163Updated 8 months ago
- Praetor is a lightweight finetuning data and prompt management tool☆67Updated 3 months ago
- Efficiently computing & storing token n-grams from large corpora☆18Updated 4 months ago