kir-gadjello / zipslicerLinks
A library for incremental loading of large PyTorch checkpoints
☆56Updated 2 years ago
Alternatives and similar repositories for zipslicer
Users that are interested in zipslicer are comparing it to the libraries listed below
Sorting:
- A playground to make it easy to try crazy things☆33Updated last month
- ☆126Updated 2 years ago
- Revealing example of self-attention, the building block of transformer AI models☆131Updated 2 years ago
- utilities for loading and running text embeddings with onnx☆44Updated last year
- Pytorch script hot swap: Change code without unloading your LLM from VRAM☆126Updated 3 months ago
- Hierarchical topic segmentation of meeting transcripts using embeddings and divisive clustering.☆53Updated last year
- ☆252Updated 2 years ago
- A copy of ONNX models, datasets, and code all in one GitHub repository. Follow the README to learn more.☆105Updated last year
- Tiny inference-only implementation of LLaMA☆93Updated last year
- A star for organising blocks and playing with transformers.☆23Updated last year
- Implement recursion using English as the programming language and an LLM as the runtime.☆239Updated 2 years ago
- GPU accelerated client-side embeddings for vector search, RAG etc.☆66Updated last year
- Drop in replacement for OpenAI, but with Open models.☆152Updated 2 years ago
- ☆40Updated 2 years ago
- Praetor is a lightweight finetuning data and prompt management tool☆67Updated 8 months ago
- Mistral7B playing DOOM☆133Updated last year
- WebGPU LLM inference tuned by hand☆151Updated 2 years ago
- A web-app to explore topics using LLM (less typing and more clicks)☆67Updated last year
- ☆163Updated last year
- Codebase topic modeling using GNNs(Node aggregation and clustering)☆61Updated 2 years ago
- iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh☆50Updated last year
- GPU-targeted vendor-agnostic AI library for Windows, and Mistral model implementation.☆58Updated last year
- A CLI to manage install and configure llama inference implemenation in multiple languages☆67Updated last year
- Text generator prompting with Boolean operators☆178Updated 2 years ago
- A novel approach for transformer model introspection that enables saving, compressing, and manipulating internal thought states for advan…☆22Updated 4 months ago
- Converts JSON-Schema to GBNF grammar to use with llama.cpp☆55Updated last year
- Generates grammer files from typescript for LLM generation☆38Updated last year
- PILF: A IPWT-inspired bionic continual learning experiment focus on mitigate catastrophic forgetting with Surprise-gated Mixture of Exper…☆36Updated 3 weeks ago
- The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…☆121Updated 2 years ago
- Tool to create a dataset of semantic segmentation on website screenshots from their DOM☆89Updated 2 years ago