silphendio / sliced_llama
Simple LLM inference server
☆20Updated 9 months ago
Alternatives and similar repositories for sliced_llama:
Users that are interested in sliced_llama are comparing it to the libraries listed below
- Experimental sampler to make LLMs more creative☆30Updated last year
- Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine☆21Updated 3 months ago
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆15Updated 6 months ago
- 5X faster 60% less memory QLoRA finetuning☆21Updated 9 months ago
- Tools for formatting large language model prompts.☆12Updated last year
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆43Updated last year
- BH hackathon☆14Updated 11 months ago
- Large-Language-Model to Machine Interface project.☆18Updated last year
- ☆27Updated 6 months ago
- Local LLM inference & management server with built-in OpenAI API☆31Updated 11 months ago
- Yet Another (LLM) Web UI, made with Gemini☆11Updated 2 months ago
- Using modal.com to process FineWeb-edu data☆20Updated 2 weeks ago
- run ollama & gguf easily with a single command☆49Updated 10 months ago
- Complex RAG backend☆28Updated 11 months ago
- ☆16Updated last year
- Build HTML artefacts with Ollama☆11Updated 3 months ago
- entropix style sampling + GUI☆25Updated 4 months ago
- Chat WebUI is an easy-to-use user interface for interacting with AI, and it comes with multiple useful built-in tools.☆20Updated 3 weeks ago
- ☆39Updated last year
- LLM backed Fantasy Tribe Game☆18Updated 4 months ago
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆31Updated 3 weeks ago
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 3 months ago
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆45Updated last year
- a simple create-llama template using llama-index v0.10 and integrated with Ollama☆10Updated 10 months ago
- Embed anything.☆29Updated 9 months ago
- A Python library to orchestrate LLMs in a neural network-inspired structure☆46Updated 5 months ago