kir-gadjello / zipslicer
A library for incremental loading of large PyTorch checkpoints
☆56Updated 2 years ago
Alternatives and similar repositories for zipslicer:
Users that are interested in zipslicer are comparing it to the libraries listed below
- A playground to make it easy to try crazy things☆33Updated last week
- Implement recursion using English as the programming language and an LLM as the runtime.☆137Updated last year
- A fork of llama3.c used to do some R&D on inferencing☆19Updated 3 months ago
- ☆40Updated last year
- Tiny inference-only implementation of LLaMA☆92Updated 11 months ago
- iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh☆48Updated last year
- Hierarchical topic segmentation of meeting transcripts using embeddings and divisive clustering.☆52Updated 7 months ago
- utilities for loading and running text embeddings with onnx☆44Updated 7 months ago
- A CLI to manage install and configure llama inference implemenation in multiple languages☆65Updated last year
- A copy of ONNX models, datasets, and code all in one GitHub repository. Follow the README to learn more.☆104Updated last year
- Web browser version of StarCoder.cpp☆44Updated last year
- Converts JSON-Schema to GBNF grammar to use with llama.cpp☆52Updated last year
- GPU-targeted vendor-agnostic AI library for Windows, and Mistral model implementation.☆54Updated last year
- ☆34Updated last year
- GPU accelerated client-side embeddings for vector search, RAG etc.☆66Updated last year
- Image Generation API Server - Similar to https://text-generator.io but for images☆50Updated 3 months ago
- ☆126Updated last year
- Testing various image matching algorithms' performance on the Pinecone vector DB☆43Updated last year
- The repository provides code for training the SegmentAnything Model (SAM) for predicting frame polygons in comic books☆50Updated last year
- Extensible AI assistant platform that bridges LLMs to tasks and actions☆38Updated last year
- ☆30Updated last year
- tinygrad port of the RWKV large language model.☆44Updated 2 weeks ago
- ☆163Updated 9 months ago
- GGML implementation of BERT model with Python bindings and quantization.☆56Updated last year
- A star for organising blocks and playing with transformers.☆23Updated 10 months ago
- A web-app to explore topics using LLM (less typing and more clicks)☆66Updated last year
- Command-line script for inferencing from models such as LLaMA, in a chat scenario, with LoRA adaptations☆33Updated last year
- Drop in replacement for OpenAI, but with Open models.☆153Updated last year
- Gather directory contents into a single upload for ChatGPT☆22Updated last year