This repo is to showcase how you can run a model locally and offline, free of OpenAI dependencies.
☆297Jul 12, 2024Updated last year
Alternatives and similar repositories for local_llama
Users that are interested in local_llama are comparing it to the libraries listed below
Sorting:
- Gradio based tool to run opensource LLM models directly from Huggingface☆97Jun 27, 2024Updated last year
- Simple agent framework using Ollama tool calling☆10Aug 27, 2024Updated last year
- ☆17Apr 22, 2024Updated last year
- A Kotlin based terminal command to interact with the OpenAI Assistants API in a slightly geeky way...☆14Jun 23, 2024Updated last year
- LlamaCards is a web application that provides a dynamic interface for interacting with LLM models in real-time. This app allows users to …☆38Aug 28, 2024Updated last year
- Prompt Development Environment for GPT☆14Jul 23, 2023Updated 2 years ago
- OllaDeck is a purple technology stack for Generative AI (text modality) cybersecurity. It provides a comprehensive set of tools for both …☆18Sep 21, 2024Updated last year
- Host LLM via text-generation-inference☆16Dec 5, 2023Updated 2 years ago
- Simple and fast server for GPTQ-quantized LLaMA inference☆24May 18, 2023Updated 2 years ago
- Finetune Your Local LLM☆18Sep 23, 2023Updated 2 years ago
- A simple speech-to-text and text-to-speech AI chatbot that can be run fully offline.☆45Jan 28, 2024Updated 2 years ago
- Like system requirements lab but for LLMs☆31Jun 10, 2023Updated 2 years ago
- PaLM-Kosmos-Vision is a foundational project showcasing basic ChatGPT with vision capabilities, inviting further development for advanced…☆16Nov 15, 2023Updated 2 years ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆31May 1, 2025Updated 10 months ago
- A micro LLM multi-agent system for data analysis☆17Apr 27, 2025Updated 10 months ago
- Tool for chatting with your codebase and docs using OpenAI, LlamaCpp, and GPT-4-All☆512Nov 18, 2024Updated last year
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆59Dec 1, 2024Updated last year
- Harnessing the Memory Power of the Camelids☆147Oct 19, 2023Updated 2 years ago
- Browser extension that lets you summarize and chat with any webpage using a local LLM of your choice.☆22Oct 24, 2024Updated last year
- Writing Extension for Text Generation WebUI☆66Aug 7, 2025Updated 7 months ago
- A universal adapter including zero-copy Python bindings for Philip Turner's metal flash attention library.☆23Dec 15, 2025Updated 2 months ago
- AI_Powered_Dev_Search_Engine☆12Mar 10, 2024Updated last year
- Identify and automatically fix issues in shell scripts☆15Nov 24, 2023Updated 2 years ago
- ☆30Dec 12, 2025Updated 2 months ago
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Jan 19, 2024Updated 2 years ago
- A simple user interface for interacting with AI☆46Oct 14, 2025Updated 4 months ago
- An agentic runtime that enables secure, extensible and configurable AI automation from any model☆17Updated this week
- ☆10Jan 25, 2022Updated 4 years ago
- ☆13May 25, 2023Updated 2 years ago
- Simple Streamlit UI for Ollama☆21May 13, 2024Updated last year
- Generic rag framework to apply the power of LLMs on any given dataset☆671Feb 24, 2026Updated last week
- WebAISum is a Python script that allows you to summarize web pages using AI models. It supports both local models like Ollama and remote …☆15Apr 28, 2024Updated last year
- Dumb experiment exploring Clean Architecture☆11Dec 22, 2018Updated 7 years ago
- Controllable Language Model Interactions in TypeScript☆10May 17, 2024Updated last year
- Desktop application for instant AI-powered text transformation. Translate, correct, summarize, and change the tone of any text, anywhere,…☆28Dec 29, 2025Updated 2 months ago
- I made ChatPDF using Langchain which leverages the power of LLMs like LLama, ChatGPT,OpenAssistant and allows the user to upload document…☆12May 24, 2023Updated 2 years ago
- A tiny server to run local inference on MLX model in the style of OpenAI☆13Jan 31, 2024Updated 2 years ago
- Convenience scripts to finetune (chat-)LLaMa3 and other models for any language☆313Jun 17, 2024Updated last year
- run ollama & gguf easily with a single command☆52May 15, 2024Updated last year