kir-gadjello / zipslicerLinks
A library for incremental loading of large PyTorch checkpoints
☆56Updated 2 years ago
Alternatives and similar repositories for zipslicer
Users that are interested in zipslicer are comparing it to the libraries listed below
Sorting:
- A playground to make it easy to try crazy things☆33Updated 2 weeks ago
- Revealing example of self-attention, the building block of transformer AI models☆130Updated 2 years ago
- ☆127Updated 2 years ago
- Hierarchical topic segmentation of meeting transcripts using embeddings and divisive clustering.☆53Updated last year
- Tiny inference-only implementation of LLaMA☆92Updated last year
- Pytorch script hot swap: Change code without unloading your LLM from VRAM☆124Updated 6 months ago
- ☆254Updated 2 years ago
- ☆40Updated 2 years ago
- A copy of ONNX models, datasets, and code all in one GitHub repository. Follow the README to learn more.☆104Updated last year
- C++ raytracer that supports custom models. Supports running the calculations on the CPU using C++11 threads or in the GPU via CUDA.☆74Updated 2 years ago
- Run AI models anywhere. https://muna.ai/explore☆68Updated this week
- WebGPU LLM inference tuned by hand☆150Updated 2 years ago
- Implement recursion using English as the programming language and an LLM as the runtime.☆236Updated 2 years ago
- Testing various image matching algorithms' performance on the Pinecone vector DB☆43Updated 2 years ago
- utilities for loading and running text embeddings with onnx☆44Updated 2 months ago
- A CLI to manage install and configure llama inference implemenation in multiple languages☆65Updated last year
- Python notebook to run OpenAI's Whisper model with speaker identification☆80Updated 2 years ago
- A star for organising blocks and playing with transformers.☆23Updated last year
- ☆35Updated 2 years ago
- GPU accelerated client-side embeddings for vector search, RAG etc.☆65Updated last year
- Nearly a thousand bash and python scripts I've written over the years.☆123Updated 8 months ago
- Praetor is a lightweight finetuning data and prompt management tool☆67Updated 11 months ago
- Command-line script for inferencing from models such as falcon-7b-instruct☆74Updated 2 years ago
- Command-line script for inferencing from models such as MPT-7B-Chat☆99Updated 2 years ago
- Tool to create a dataset of semantic segmentation on website screenshots from their DOM☆89Updated 2 years ago
- Enforce structured output from LLMs 100% of the time☆248Updated last year
- GPU-targeted vendor-agnostic AI library for Windows, and Mistral model implementation.☆58Updated last year
- The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…☆121Updated 2 years ago
- GPT Takes the Bar Exam☆142Updated 2 years ago
- iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh☆50Updated last year