kir-gadjello / zipslicerLinks

A library for incremental loading of large PyTorch checkpoints

☆56

Alternatives and similar repositories for zipslicer

Users that are interested in zipslicer are comparing it to the libraries listed below

Sorting:

jmward01 / lmplay
A playground to make it easy to try crazy things
☆33Updated last month
fullthom / chat-gpt-quine
☆126Updated 2 years ago
jostmey / NakedAttention
Revealing example of self-attention, the building block of transformer AI models
☆131Updated 2 years ago
taylorai / onnx_embedding_models
utilities for loading and running text embeddings with onnx
☆44Updated last year
valine / training-hot-swap
Pytorch script hot swap: Change code without unloading your LLM from VRAM
☆126Updated 3 months ago
AugmendTech / treeseg
Hierarchical topic segmentation of meeting transcripts using embeddings and divisive clustering.
☆53Updated last year
Futrell / ziplm
☆252Updated 2 years ago
xetdata / onnx-models
A copy of ONNX models, datasets, and code all in one GitHub repository. Follow the README to learn more.
☆105Updated last year
recmo / cria
Tiny inference-only implementation of LLaMA
☆93Updated last year
rayking99 / BlockStar
A star for organising blocks and playing with transformers.
☆23Updated last year
andyk / recursive_llm
Implement recursion using English as the programming language and an LLM as the runtime.
☆239Updated 2 years ago
FL33TW00D / embd
GPU accelerated client-side embeddings for vector search, RAG etc.
☆66Updated last year
closedai-project / closedai
Drop in replacement for OpenAI, but with Open models.
☆152Updated 2 years ago
lachlansneff / sparsellama
☆40Updated 2 years ago
goodreasonai / praetor-data
Praetor is a lightweight finetuning data and prompt management tool
☆67Updated 8 months ago
umuthopeyildirim / DOOM-Mistral
Mistral7B playing DOOM
☆133Updated last year
kayvr / token-hawk
WebGPU LLM inference tuned by hand
☆151Updated 2 years ago
charstorm / llmbinge
A web-app to explore topics using LLM (less typing and more clicks)
☆67Updated last year
carsonpo / haystackdb
☆163Updated last year
danielpatrickhug / GitModel
Codebase topic modeling using GNNs(Node aggregation and clustering)
☆61Updated 2 years ago
spirobel / bunny-llama
iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh
☆50Updated last year
Const-me / Cgml
GPU-targeted vendor-agnostic AI library for Windows, and Mistral model implementation.
☆58Updated last year
mikepapadim / llama-shepherd-cli
A CLI to manage install and configure llama inference implemenation in multiple languages
☆67Updated last year
jeffbinder / promptarray
Text generator prompting with Boolean operators
☆178Updated 2 years ago
Dicklesworthstone / llm_introspective_compression_and_metacognition
A novel approach for transformer model introspection that enables saving, compressing, and manipulating internal thought states for advan…
☆22Updated 4 months ago
adrienbrault / json-schema-to-gbnf
Converts JSON-Schema to GBNF grammar to use with llama.cpp
☆55Updated last year
IntrinsicLabsAI / grammar-builder
Generates grammer files from typescript for LLM generation
☆38Updated last year
dmf-archive / PILF
PILF: A IPWT-inspired bionic continual learning experiment focus on mitigate catastrophic forgetting with Surprise-gated Mixture of Exper…
☆36Updated 3 weeks ago
geov-ai / geov
The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…
☆121Updated 2 years ago
dmvaldman / html_semantic_seg
Tool to create a dataset of semantic segmentation on website screenshots from their DOM
☆89Updated 2 years ago