cohere-ai / cohere-toolkitLinks
Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.
☆3,058Updated last week
Alternatives and similar repositories for cohere-toolkit
Users that are interested in cohere-toolkit are comparing it to the libraries listed below
Sorting:
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,511Updated last month
- Llama-3 agents that can browse the web by following instructions and talking to you☆1,404Updated 6 months ago
- Deploy your agentic worfklows to production☆2,028Updated this week
- The easiest way to use Agentic RAG in any enterprise☆4,268Updated 5 months ago
- Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serper☆4,898Updated 8 months ago
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality☆4,052Updated 10 months ago
- Python & JS/TS SDK for running AI-generated code/code interpreting in your AI app☆1,839Updated last week
- Harness LLMs with Multi-Agent Programming☆3,432Updated this week
- ☆2,973Updated 9 months ago
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆5,191Updated 3 months ago
- Training LLMs with QLoRA + FSDP☆1,487Updated 7 months ago
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆4,112Updated 4 months ago
- Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models☆2,761Updated 5 months ago
- [ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling☆1,704Updated 11 months ago
- A framework for Claude Opus to intelligently orchestrate subagents.☆4,235Updated 11 months ago
- proof of concept prototype for generating and querying against an ever-expanding knowledge graph with ai☆902Updated last year
- A RAG LLM co-pilot for browsing the web, powered by local LLMs☆1,511Updated 5 months ago
- Accelerate your Hugging Face Transformers 7.6-9x. Native to Hugging Face and PyTorch.☆683Updated 10 months ago
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,262Updated this week
- A framework for prompt tuning using Intent-based Prompt Calibration☆2,626Updated 2 months ago
- Streamlines and simplifies prompt design for both developers and non-technical users with a low code approach.☆1,080Updated this week
- 🦜⛏️ Did you say you like data?☆1,138Updated last week
- Easy token price estimates for 400+ LLMs. TokenOps.☆1,720Updated this week
- The easiest way to get started with LlamaIndex☆1,428Updated last week
- Yes, it's another chat over documents implementation... but this one is entirely local!☆1,773Updated 3 months ago
- Knowledge Agents and Management in the Cloud☆4,022Updated this week
- Retrieval Augmented Generation (RAG) framework and context engine powered by Pinecone☆1,017Updated 7 months ago
- LangServe 🦜️🏓☆2,112Updated 2 weeks ago
- Ship RAG based LLM web apps in seconds.☆995Updated last year
- ☆1,829Updated 3 weeks ago