mzbac / GPTQ-for-LLaMa-API
Provide a way to use the GPT-QLLama model as an API
☆43Updated last year
Related projects: ⓘ
- Simple and fast server for GPTQ-quantized LLaMA inference☆24Updated last year
- Harnessing the Memory Power of the Camelids☆145Updated 11 months ago
- Local LLM ReAct Agent with Guidance☆154Updated last year
- ☆133Updated 9 months ago
- Example of calling OpenRouter from a Streamit app☆88Updated last year
- 🔓 The open-source autonomous agent LLM initiative 🔓☆90Updated 7 months ago
- ☆44Updated 7 months ago
- oobaboga -text-generation-webui implementation of wafflecomposite - langchain-ask-pdf-local☆65Updated last year
- Conduct consumer interviews with synthetic focus groups using LLMs and LangChain☆44Updated last year
- This code implements a Local LLM Selector from the list of Local Installed Ollama LLMs for your specific user Query☆103Updated 9 months ago
- A Personalised AI Assistant Inspired by 'Diamond Age, Powered by SMS☆91Updated last year
- ☆63Updated last year
- Easy-to-use agent memory, powered by chromadb and postgres☆56Updated 11 months ago
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆139Updated 11 months ago
- GPT-2 small trained on phi-like data☆65Updated 7 months ago
- An OpenAI-like LLaMA inference API☆111Updated last year
- Collection of Tree of Thoughts prompting techniques I've found useful to start with, then stylize, then iterate☆69Updated 11 months ago
- auto fine tune of models with synthetic data☆71Updated 7 months ago
- A guidance compatibility layer for llama-cpp-python☆35Updated last year
- Load local LLMs effortlessly in a Jupyter notebook for testing purposes alongside Langchain or other agents. Contains Oobagooga and Kobol…☆210Updated last year
- Host the GPTQ model using AutoGPTQ as an API that is compatible with text generation UI API.☆91Updated last year
- A backend API to perform search over Wikipedia using LangChain, Cohere and Weaviate☆106Updated last year
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆123Updated last year
- ☆32Updated 7 months ago
- An AI agent for interacting with a computer using the graphical user interface☆61Updated 11 months ago
- Porting BabyAGI to Oobabooba.☆33Updated last year
- BabyAGI-🦙: Enhanced for Llama models (running 100% local) and persistent memory, with smart internet search based on BabyCatAGI and docu…☆88Updated last year
- Roy: A lightweight, model-agnostic framework for crafting advanced multi-agent systems using large language models.☆76Updated 10 months ago
- AgentX is an experiment to develop autonomous agents so you can automate everything! This project is inspired by Auto-GPT, babyagi, Super…☆97Updated last year
- The code we currently use to fine-tune models.☆107Updated 4 months ago