oKatanaaa / lima-guiLinks
A simple GUI utility for gathering LIMA-like chat data.
☆22Updated last month
Alternatives and similar repositories for lima-gui
Users that are interested in lima-gui are comparing it to the libraries listed below
Sorting:
- 5X faster 60% less memory QLoRA finetuning☆21Updated last year
- Low-Rank adapter extraction for fine-tuned transformers models☆178Updated last year
- ☆67Updated last year
- ☆31Updated last year
- ☆116Updated 11 months ago
- A simple experiment on letting two local LLM have a conversation about anything!☆111Updated last year
- Guaranteed Structured Output from any Language Model via Hierarchical State Machines☆145Updated last month
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆45Updated last year
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆84Updated last year
- Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.☆44Updated 3 weeks ago
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆98Updated 4 months ago
- entropix style sampling + GUI☆27Updated last year
- Distributed Inference for mlx LLm☆99Updated last year
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆145Updated 9 months ago
- Complex RAG backend☆29Updated last year
- LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMT☆27Updated last year
- Simple examples using Argilla tools to build AI☆56Updated last year
- Easily view and modify JSON datasets for large language models☆84Updated 6 months ago
- Let's create synthetic textbooks together :)☆75Updated last year
- The DPAB-α Benchmark☆30Updated 10 months ago
- run ollama & gguf easily with a single command☆52Updated last year
- Experimental sampler to make LLMs more creative☆31Updated 2 years ago
- A high performance batching router optimises max throughput for text inference workload☆16Updated 2 years ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Updated last year
- Enhancing LLMs with LoRA☆176Updated 3 weeks ago
- ☆24Updated 9 months ago
- ☆62Updated 4 months ago
- An unsupervised model merging algorithm for Transformers-based language models.☆106Updated last year
- ☆49Updated last year
- ☆163Updated 3 months ago