RobinQu / instinct.cpp
instinct.cpp provides ready to use alternatives to OpenAI Assistant API and built-in utilities for developing AI Agent applications (RAG, Chatbot, Code interpreter) powered by language models. Call it langchain.cpp if you like.
☆36Updated 2 months ago
Related projects: ⓘ
- GGML implementation of BERT model with Python bindings and quantization.☆51Updated 7 months ago
- ggml implementation of embedding models including SentenceTransformer and BGE☆50Updated 8 months ago
- LLM based agents with proactive interactions, long-term memory, external tool integration, and local deployment capabilities.☆85Updated this week
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆36Updated 7 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆53Updated 3 weeks ago
- Something similar to Apple Intelligence?☆54Updated 2 months ago
- Self-hosted LLM chatbot arena, with yourself as the only judge☆36Updated 7 months ago
- Semantic Search demo featuring UForm, USearch, UCall, and StreamLit, to visual and retrieve from image datasets, similar to "CLIP Retriev…☆37Updated 8 months ago
- Local LLM inference & management server with built-in OpenAI API☆30Updated 5 months ago
- ☆31Updated 8 months ago
- ChatData 🔍 📖 brings RAG to real applications with FREE✨ knowledge bases. Now enjoy your chat with 6 million wikipedia pages and 2 milli…☆152Updated 2 months ago
- Open-source observability for your LLM application.☆40Updated 2 months ago
- Local ML voice chat using high-end models.☆138Updated 2 weeks ago
- ☆34Updated this week
- Deployment a light and full OpenAI API for production with vLLM to support /v1/embeddings with all embeddings models.☆32Updated 2 months ago
- ggml implementation of BERT Embedding☆24Updated 9 months ago
- ☆50Updated 3 months ago
- General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). …☆35Updated 2 weeks ago
- A memory framework for Large Language Models and Agents.☆157Updated last month
- Rust implementation of Surya☆46Updated last month
- Complex RAG backend☆28Updated 5 months ago
- One Line To Build Zero-Data Classifiers in Minutes☆29Updated 3 weeks ago
- Implementation of nougat that focuses on processing pdf locally.☆68Updated 4 months ago
- Port of Suno AI's Bark in C/C++ for fast inference☆50Updated 5 months ago
- AirLLM 70B inference with single 4GB GPU☆11Updated last month
- Inference of Mamba models in pure C☆176Updated 6 months ago
- A QT GUI for large language models☆23Updated 8 months ago
- Host the GPTQ model using AutoGPTQ as an API that is compatible with text generation UI API.☆91Updated last year
- Easy to deploy.A cloud service for python code interpreter sandbox for Code-Interpreter.☆44Updated 6 months ago
- Eternal is an experimental platform for machine learning models and workflows.☆36Updated last month