kir-gadjello / zipslicerLinks
A library for incremental loading of large PyTorch checkpoints
☆56Updated 2 years ago
Alternatives and similar repositories for zipslicer
Users that are interested in zipslicer are comparing it to the libraries listed below
Sorting:
- A playground to make it easy to try crazy things☆33Updated last week
- ☆40Updated 2 years ago
- ☆126Updated 2 years ago
- Tiny inference-only implementation of LLaMA☆93Updated last year
- Evolutionary Search for expert-level performance on any task with environmental feedback☆14Updated last year
- ☆35Updated 2 years ago
- iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh☆49Updated last year
- Run Python functions on desktop, mobile, web, and in the cloud. https://fxn.ai/explore☆64Updated this week
- GPU-targeted vendor-agnostic AI library for Windows, and Mistral model implementation.☆58Updated last year
- A fork of llama3.c used to do some R&D on inferencing☆22Updated 6 months ago
- A copy of ONNX models, datasets, and code all in one GitHub repository. Follow the README to learn more.☆105Updated last year
- Pytorch script hot swap: Change code without unloading your LLM from VRAM☆126Updated 2 months ago
- GPU accelerated client-side embeddings for vector search, RAG etc.☆66Updated last year
- Extensible AI assistant platform that bridges LLMs to tasks and actions☆38Updated 2 years ago
- Web browser version of StarCoder.cpp☆45Updated last year
- A star for organising blocks and playing with transformers.☆23Updated last year
- Converts JSON-Schema to GBNF grammar to use with llama.cpp☆55Updated last year
- Hierarchical topic segmentation of meeting transcripts using embeddings and divisive clustering.☆52Updated 10 months ago
- utilities for loading and running text embeddings with onnx☆44Updated 10 months ago
- C++ raytracer that supports custom models. Supports running the calculations on the CPU using C++11 threads or in the GPU via CUDA.☆75Updated 2 years ago
- Plug n Play GBNF Compiler for llama.cpp☆25Updated last year
- Revealing example of self-attention, the building block of transformer AI models☆131Updated 2 years ago
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆45Updated last year
- Experimental fork of Facebooks LLaMa model which runs it with GPU acceleration on Apple Silicon M1/M2☆86Updated last year
- Praetor is a lightweight finetuning data and prompt management tool☆67Updated 7 months ago
- A super simple web interface to perform blind tests on LLM outputs.☆28Updated last year
- A CLI to manage install and configure llama inference implemenation in multiple languages☆67Updated last year
- Testing various image matching algorithms' performance on the Pinecone vector DB☆43Updated last year
- Grow virtual creatures in static and physics simulated environments.☆53Updated last year
- Generates grammer files from typescript for LLM generation☆38Updated last year