nistvan86 / continuedev-llamacpp-gpu-llm-server
☆10Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for continuedev-llamacpp-gpu-llm-server
- Drop in replacement for OpenAI's embedding API. Self Hosted.☆50Updated last year
- Building LLM apps with Text Tensors using PyTorch concepts and text gradients☆35Updated 2 months ago
- Embed anything.☆29Updated 5 months ago
- Falcon40B and 7B (Instruct) with streaming, top-k, and beam search☆39Updated last year
- Logging and caching superpowers for the openai sdk☆99Updated 7 months ago
- Experimental LLM agent/toolkit with direct Vim access using neovim/pynvim☆69Updated last month
- A simple Python sandbox for helpful LLM data agents☆162Updated 4 months ago
- YouTube Transcript Cleaner is a simple web-based application that improves the readability of YouTube transcripts.☆22Updated last year
- Open-source Rewind.ai clone written in Rust and Vue running 100% locally with whisper.cpp☆45Updated last year
- A list of software that allows searching the web with the assistance of AI.☆98Updated this week
- Simple, Fast, Parallel Huggingface GGML model downloader written in python☆24Updated last year
- 🐣🕐📅 A simple utility to draft scheduling emails.☆12Updated last year
- A data-centric AI package for ML/AI. Get the best high-quality data for the best results. Discord: https://discord.gg/t6ADqBKrdZ☆63Updated 11 months ago
- Tool to download models from Huggingface Hub and convert them to GGML/GGUF for llama.cpp☆96Updated last month
- BabyAGI-🦙: Enhanced for Llama models (running 100% local) and persistent memory, with smart internet search based on BabyCatAGI and docu…☆89Updated last year
- A guidance compatibility layer for llama-cpp-python☆34Updated last year
- ☆40Updated 6 months ago
- Fun project: LLM powered RAG Discord Bot that works seamlessly on CPU☆30Updated last year
- A large German Legal Corpus of laws, administrative regulations and court decisions issued in Germany at federal level. Query the corpus:…☆10Updated 11 months ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆36Updated 9 months ago
- Wingman is the fastest and easiest way to run Llama models on your PC or Mac.☆42Updated 5 months ago
- ☆48Updated last year
- ☆24Updated 10 months ago
- Simple examples using Argilla tools to build AI☆38Updated last week
- LLM-DB: A database powered by language models☆50Updated last year
- ☆31Updated 10 months ago
- Text to Python Objects via a LLM Function Call☆56Updated 7 months ago
- Multimodal Chat with Gemini API☆46Updated 10 months ago
- Let's create synthetic textbooks together :)☆70Updated 9 months ago