Nero10578 / LLM-Inference-BenchmarkLinks
☆14Updated 10 months ago
Alternatives and similar repositories for LLM-Inference-Benchmark
Users that are interested in LLM-Inference-Benchmark are comparing it to the libraries listed below
Sorting:
- Serving LLMs in the HF-Transformers format via a PyFlask API☆71Updated 10 months ago
- run ollama & gguf easily with a single command☆52Updated last year
- LlamaCards is a web application that provides a dynamic interface for interacting with LLM models in real-time. This app allows users to …☆39Updated 10 months ago
- Locally hosted AI Agent Python Tool To Generate Novel Research Hypothesis + Titles + Abstracts☆24Updated 2 months ago
- Locally running LLM with internet access☆96Updated 2 weeks ago
- A Python library to orchestrate LLMs in a neural network-inspired structure☆49Updated 9 months ago
- Easily view and modify JSON datasets for large language models☆78Updated 2 months ago
- Complex RAG backend☆28Updated last year
- Experience the power of AI with this free AI voice generator demo. Utilizing Deepgram and Groq, we transform text into voice seamlessly. …☆37Updated last year
- 5X faster 60% less memory QLoRA finetuning☆21Updated last year
- ☆17Updated last week
- Use smol agents to do research and then update csv coumns with its findings.☆41Updated 5 months ago
- ☆66Updated last year
- Automated LLM novelist☆47Updated last year
- ☆17Updated 7 months ago
- Conduct in-depth research with AI-driven insights : DeepDive is a command-line tool that leverages web searches and AI models to generate…☆42Updated 10 months ago
- LLM backed Fantasy Tribe Game☆18Updated 7 months ago
- LIVA - Local Intelligent Voice Assistant☆61Updated 10 months ago
- ☆30Updated last year
- Simple LLM inference server☆20Updated last year
- Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine☆23Updated 6 months ago
- A simple speech-to-text and text-to-speech AI chatbot that can be run fully offline.☆45Updated last year
- ☆22Updated 5 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆63Updated 10 months ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆67Updated 2 weeks ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆93Updated last year
- Branch Out Your Conversations☆44Updated 6 months ago
- Embed anything.☆28Updated last year
- The original BabyAGI, updated with LiteLLM and no vector database reliance (csv instead)☆21Updated 9 months ago
- a lightweight, open-source blueprint for building powerful and scalable LLM chat applications☆28Updated last year