MeetKai / functionary
Chat language model that can use tools and interpret the results
☆1,506Updated this week
Alternatives and similar repositories for functionary:
Users that are interested in functionary are comparing it to the libraries listed below
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,200Updated 4 months ago
- ☆783Updated 4 months ago
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆2,321Updated this week
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,064Updated this week
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆1,748Updated last week
- Superfast AI decision making and intelligent processing of multi-modal data.☆2,334Updated this week
- Harness LLMs with Multi-Agent Programming☆2,965Updated this week
- Enforce the output format (JSON Schema, Regex etc) of a language model☆1,679Updated 3 months ago
- A blazing fast inference solution for text embeddings models☆3,089Updated last week
- Open-source tool to visualise your RAG 🔮☆1,101Updated 3 weeks ago
- [ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings☆1,906Updated 2 weeks ago
- Customizable implementation of the self-instruct paper.☆1,034Updated 10 months ago
- [ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling☆1,588Updated 6 months ago
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆4,832Updated this week
- Python bindings for the Transformer models implemented in C/C++ using GGML library.☆1,830Updated last year
- High-performance retrieval engine for unstructured data☆1,128Updated 2 weeks ago
- A language for constraint-guided and efficient LLM programming.☆3,787Updated 7 months ago
- S-LoRA: Serving Thousands of Concurrent LoRA Adapters☆1,781Updated last year
- Efficient Retrieval Augmentation and Generation Framework☆1,428Updated 2 weeks ago
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality☆3,501Updated 5 months ago
- A fast inference library for running LLMs locally on modern consumer-class GPUs☆3,893Updated this week
- Developer APIs to Accelerate LLM Projects☆1,530Updated 3 months ago
- Zep | The Memory Foundation For Your AI Stack☆2,923Updated 2 months ago
- Tools for merging pretrained large language models.☆5,157Updated this week
- YaRN: Efficient Context Window Extension of Large Language Models☆1,405Updated 9 months ago
- The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval☆1,066Updated 4 months ago
- Large-scale LLM inference engine☆1,254Updated this week
- This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai,…☆1,946Updated 8 months ago
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆2,806Updated last year
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆522Updated last month