AlexBuz / llama-zip
LLM-powered lossless compression tool
☆274Updated 7 months ago
Alternatives and similar repositories for llama-zip:
Users that are interested in llama-zip are comparing it to the libraries listed below
- llama.cpp fork with additional SOTA quants and improved performance☆217Updated this week
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆148Updated 10 months ago
- Dynamically structure language models to produce outputs that adhere to specific requirements without sacrificing their creative capabili…☆119Updated this week
- ☆81Updated 3 months ago
- Falcon LLM ggml framework with CPU and GPU support☆246Updated last year
- A fast batching API to serve LLM models☆182Updated 10 months ago
- 1.58 Bit LLM on Apple Silicon using MLX☆192Updated 10 months ago
- LLaVA server (llama.cpp).☆178Updated last year
- Experimental adventure game with AI-generated content☆109Updated last year
- Fast parallel LLM inference for MLX☆174Updated 8 months ago
- ☆273Updated last month
- Web UI for ExLlamaV2☆487Updated last month
- LLM-based code completion engine☆181Updated 2 months ago
- This is our own implementation of 'Layer Selective Rank Reduction'☆233Updated 9 months ago
- automatically quant GGUF models☆163Updated this week
- ☆152Updated 8 months ago
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆224Updated 10 months ago
- An implementation of bucketMul LLM inference☆215Updated 8 months ago
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆71Updated 6 months ago
- Testing LLM reasoning abilities with family relationship quizzes.☆62Updated last month
- Low-Rank adapter extraction for fine-tuned transformers models☆171Updated 10 months ago
- Mistral7B playing DOOM☆130Updated 8 months ago
- Python bindings for ggml☆140Updated 6 months ago
- Inference of Mamba models in pure C☆186Updated last year
- A multimodal, function calling powered LLM webui.☆215Updated 6 months ago
- Train your own small bitnet model☆65Updated 5 months ago
- GPT-2 small trained on phi-like data☆65Updated last year
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆237Updated 2 weeks ago
- Like grep but for natural language questions. Based on Mistral 7B or Mixtral 8x7B.☆380Updated last year
- Kosmos-2.5 is a cutting-edge Multimodal-LLM (MLLM) specializing in image OCR. However, its stringent software requirements & Python-scrip…☆59Updated 8 months ago