s-kostyaev / rerankerLinks
Reranker local service. Can be useful as a part of RAG pipeline.
☆20Updated last year
Alternatives and similar repositories for reranker
Users that are interested in reranker are comparing it to the libraries listed below
Sorting:
- A comprehensive list of document parsers, covering PDF-to-text conversion and layout extraction. Each tested for support of tables, equat…☆170Updated 6 months ago
- Open WebUI tool which executes code in a docker environment☆18Updated last year
- 🚀 Retrieval Augmented Generation (RAG) with txtai. Combine search and LLMs to find insights with your own data.☆438Updated 2 months ago
- git-like rag pipeline☆256Updated last month
- parakeet asr demo☆64Updated 8 months ago
- Since OpenAI and friends refuse to give us a max_ctx param in /models, here's the current context window, input token and output token li…☆65Updated last month
- A chess arena for large language models☆38Updated 8 months ago
- ☆209Updated last month
- Docker compose to run vLLM on Windows☆114Updated 2 years ago
- Collection of LLM system prompts☆254Updated 5 months ago
- Sentence Transformers API: An OpenAI compatible embedding API server☆70Updated last year
- A bridge to use Langchain output as an OpenAI-compatible API☆89Updated 7 months ago
- Open Deep Researcher with openai compatible endpoint, now completely local with ollama, local playwright via searxng with citations and p…☆154Updated 10 months ago
- ☆64Updated last year
- An implementation of iterative deep research using the OpenAI Agents SDK☆721Updated last month
- 🗣️ Real‑time, low‑latency voice, vision, and conversational‑memory AI assistant built on LiveKit and local LLMs ✨☆107Updated 7 months ago
- RAG on codebases using treesitter and LanceDB☆279Updated last year
- Integrates AI tools into Microsoft Word☆157Updated last year
- Wraps any OpenAI API interface as Responses with MCPs support so it supports Codex. Adding any missing stateful features. Ollama and Vllm…☆149Updated 3 months ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆88Updated this week
- This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…☆118Updated last year
- An AI memory layer with short- and long-term storage, semantic clustering, and optional memory decay for context-aware applications.☆681Updated last year
- Hallucinations (Confabulations) Document-Based Benchmark for RAG. Includes human-verified questions and answers.☆243Updated 6 months ago
- QA-Pilot is an interactive chat project that leverages online/local LLM for rapid understanding and navigation of GitHub code repository.☆318Updated 5 months ago
- Chat with your current directory's files using a local or API LLM.☆426Updated 4 months ago
- Ingest files for retrieval augmented generation (RAG) with open-source Large Language Models (LLMs), all without 3rd parties or sensitive…☆730Updated last year
- High-performance lightweight proxy and load balancer for LLM infrastructure. Intelligent routing, automatic failover and unified model di…☆137Updated last week
- A proxy server for multiple ollama instances with Key security☆582Updated this week
- Interactive launcher and benchmarking harness for llama.cpp server throughput, with tests, sweeps, and round‑robin load tools.☆211Updated last week
- ☆30Updated last year