kir-gadjello / zipslicerLinks
A library for incremental loading of large PyTorch checkpoints
☆56Updated 2 years ago
Alternatives and similar repositories for zipslicer
Users that are interested in zipslicer are comparing it to the libraries listed below
Sorting:
- A playground to make it easy to try crazy things☆33Updated 3 months ago
- Revealing example of self-attention, the building block of transformer AI models☆131Updated 2 years ago
- ☆126Updated 2 years ago
- A copy of ONNX models, datasets, and code all in one GitHub repository. Follow the README to learn more.☆104Updated last year
- Hierarchical topic segmentation of meeting transcripts using embeddings and divisive clustering.☆53Updated last year
- Implement recursion using English as the programming language and an LLM as the runtime.☆238Updated 2 years ago
- Tiny inference-only implementation of LLaMA☆93Updated last year
- A CLI to manage install and configure llama inference implemenation in multiple languages☆67Updated last year
- ☆254Updated 2 years ago
- Testing various image matching algorithms' performance on the Pinecone vector DB☆43Updated 2 years ago
- iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh☆51Updated last year
- Pytorch script hot swap: Change code without unloading your LLM from VRAM☆124Updated 5 months ago
- ☆40Updated 2 years ago
- GPT Takes the Bar Exam☆142Updated 2 years ago
- An easily-trained baby GPT that can stand in for the real thing. Based on Andrej Karpathy's makemore, but set up to mimic a llama-cpp ser…☆28Updated last year
- A JPEG Image Compression Service using Part Homomorphic Encryption.☆31Updated 6 months ago
- utilities for loading and running text embeddings with onnx☆44Updated last month
- WebGPU LLM inference tuned by hand☆151Updated 2 years ago
- ☆35Updated 2 years ago
- C++ raytracer that supports custom models. Supports running the calculations on the CPU using C++11 threads or in the GPU via CUDA.☆75Updated 2 years ago
- PILF: A IPWT-inspired bionic continual learning experiment focus on mitigate catastrophic forgetting with Surprise-gated Mixture of Exper…☆36Updated 2 months ago
- Web browser version of StarCoder.cpp☆45Updated 2 years ago
- A star for organising blocks and playing with transformers.☆23Updated last year
- ☆163Updated last year
- Deepmark AI enables a unique testing environment for language models (LLM) assessment on task-specific metrics and on your own data so yo…☆104Updated last year
- Text generator prompting with Boolean operators☆179Updated 3 weeks ago
- Advanced Python Function Debugging with MCP Integration.☆57Updated 3 months ago
- Converts JSON-Schema to GBNF grammar to use with llama.cpp☆55Updated last year
- Heirarchical Navigable Small Worlds☆101Updated last month
- A web-app to explore topics using LLM (less typing and more clicks)☆67Updated last year