kir-gadjello / zipslicerLinks
A library for incremental loading of large PyTorch checkpoints
☆56Updated 2 years ago
Alternatives and similar repositories for zipslicer
Users that are interested in zipslicer are comparing it to the libraries listed below
Sorting:
- A playground to make it easy to try crazy things☆33Updated 3 weeks ago
- Revealing example of self-attention, the building block of transformer AI models☆131Updated 2 years ago
- Hierarchical topic segmentation of meeting transcripts using embeddings and divisive clustering.☆54Updated last year
- A copy of ONNX models, datasets, and code all in one GitHub repository. Follow the README to learn more.☆105Updated 2 years ago
- ☆127Updated 2 years ago
- ☆255Updated 2 years ago
- Tokenflood is a load testing framework for simulating arbitary loads on instruction-tuned LLMs☆43Updated last week
- utilities for loading and running text embeddings with onnx☆44Updated 4 months ago
- iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh☆50Updated 2 years ago
- A CLI to manage install and configure llama inference implemenation in multiple languages☆65Updated last year
- Nearly a thousand bash and python scripts I've written over the years.☆124Updated 10 months ago
- Pytorch script hot swap: Change code without unloading your LLM from VRAM☆125Updated 8 months ago
- Converts JSON-Schema to GBNF grammar to use with llama.cpp☆55Updated 2 years ago
- Codebase topic modeling using GNNs(Node aggregation and clustering)☆61Updated 2 years ago
- Text generator prompting with Boolean operators☆181Updated last month
- Tiny inference-only implementation of LLaMA☆92Updated last year
- ☆35Updated 2 years ago
- ☆31Updated 2 years ago
- Web browser version of StarCoder.cpp☆45Updated 2 years ago
- Deepmark AI enables a unique testing environment for language models (LLM) assessment on task-specific metrics and on your own data so yo…☆104Updated 2 years ago
- Implement recursion using English as the programming language and an LLM as the runtime.☆237Updated 2 years ago
- Praetor is a lightweight finetuning data and prompt management tool☆67Updated last year
- Testing various image matching algorithms' performance on the Pinecone vector DB☆43Updated 2 years ago
- Test prompts for GPT-J-6B and the resulting AI-generated texts☆53Updated 4 years ago
- A star for organising blocks and playing with transformers.☆23Updated last year
- ☆40Updated 2 years ago
- GPU-targeted vendor-agnostic AI library for Windows, and Mistral model implementation.☆57Updated last year
- A web-app to explore topics using LLM (less typing and more clicks)☆67Updated last year
- Full finetuning of large language models without large memory requirements☆94Updated 3 months ago
- LLM plugin for clustering embeddings☆82Updated last year