silphendio / sliced_llamaLinks
Simple LLM inference server
☆20Updated last year
Alternatives and similar repositories for sliced_llama
Users that are interested in sliced_llama are comparing it to the libraries listed below
Sorting:
- Experimental sampler to make LLMs more creative☆31Updated 2 years ago
- Yet Another (LLM) Web UI, made with Gemini☆12Updated last year
- A Python library to orchestrate LLMs in a neural network-inspired structure☆52Updated last year
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆17Updated last year
- 5X faster 60% less memory QLoRA finetuning☆21Updated last year
- Run Vision LLMs, TTS and STT APIs. Website and API for https://text-generator.io☆39Updated this week
- implementation of https://arxiv.org/pdf/2312.09299☆21Updated last year
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆43Updated last year
- BH hackathon☆14Updated last year
- Cog wrapper for collabora/WhisperSpeech☆25Updated last year
- ☆40Updated last year
- ☆27Updated 2 years ago
- "a towel is about the most massively useful thing an interstellar AI hitchhiker can have"☆47Updated last year
- ☆62Updated 6 months ago
- Port of Facebook's LLaMA model in C/C++☆22Updated 2 years ago
- Modified Beam Search with periodical restart☆12Updated last year
- Attend - to what matters.☆17Updated 10 months ago
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆37Updated last year
- An OpenAI API compatible LLM inference server based on ExLlamaV2.☆25Updated last year
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated last year
- run ollama & gguf easily with a single command☆52Updated last year
- Gradio UI for a Cog API☆71Updated last year
- Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …☆56Updated 11 months ago
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆25Updated 9 months ago
- ☆68Updated last year
- OpenAI GPT hosted Agent Framework for Windows and MacOS☆36Updated last year
- LLM Chat is an open-source serverless alternative to ChatGPT.☆35Updated last year
- Apps that run on modal.com☆12Updated 3 months ago
- Accepts a Hugging Face model URL, automatically downloads and quantizes it using Bits and Bytes.☆38Updated last year
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆45Updated 2 years ago