Multi-Faceted AI Agent and Workflow Autotuning. Automatically optimizes LangChain, LangGraph, DSPy programs for better quality, lower execution latency, and lower execution cost. Also has a simple agent/workflow framework
☆272May 16, 2025Updated 10 months ago
Alternatives and similar repositories for cognify
Users that are interested in cognify are comparing it to the libraries listed below
Sorting:
- A fast text search engine built for SSDs, written in C++.☆11Aug 29, 2022Updated 3 years ago
- Efficient Long-context Language Model Training by Core Attention Disaggregation☆96Mar 5, 2026Updated 2 weeks ago
- ☆35Jun 22, 2024Updated last year
- FlexRAG: A RAG Framework for Information Retrieval and Generation.☆234Updated this week
- Stateful LLM Serving☆97Mar 11, 2025Updated last year
- [NeurIPS 2024] Efficient LLM Scheduling by Learning to Rank☆75Nov 4, 2024Updated last year
- A High-Efficiency System of Large Language Model Based Search Agents☆77Jul 2, 2025Updated 8 months ago
- 🔎 A deep-dive into HyDE for Advanced LLM RAG + 💡 Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, covera…☆34Mar 26, 2024Updated last year
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆33Feb 10, 2025Updated last year
- [NeurIPS 2025] Simple extension on vLLM to help you speed up reasoning model without training.☆224May 31, 2025Updated 9 months ago
- ☆63Dec 6, 2024Updated last year
- This is the official repo for Towards Uncertainty-Aware Language Agent.☆31Aug 15, 2024Updated last year
- Library to stream operating system events to AI☆41Apr 8, 2025Updated 11 months ago
- Very minimal (and stateless) agent framework☆44Jan 12, 2025Updated last year
- Source code of IPA, https://escholarship.org/uc/item/2p0805dq☆12Jun 27, 2024Updated last year
- Structured AI Prompt Engineering Agent based on OpenAI’s best practices☆38May 4, 2025Updated 10 months ago
- FlexFlow Serve: Low-Latency, High-Performance LLM Serving☆75Sep 15, 2025Updated 6 months ago
- [ACL25' Findings] SWE-Dev is an SWE agent with a scalable test case construction pipeline.☆59Jul 21, 2025Updated 8 months ago
- Clone of OpenAI assistants API☆12Jul 14, 2024Updated last year
- (ACL 2025) Divide-Then-Aggregate: An Efficient Tool Learning Method via Parallel Tool Invocation☆12May 21, 2025Updated 10 months ago
- This is for EMNLP 2024 Paper: AppBench: Planning of Multiple APIs from Various APPs for Complex User Instruction☆15Nov 4, 2024Updated last year
- Implementing ReaRAG, a knowledge-guided reasoning model that enhances factual accuracy using iterative retrieval-augmented generation. Ad…☆15Feb 2, 2026Updated last month
- ☆26Updated this week
- An interactive AI-powered learning experience generator that creates comprehensive, multimedia educational content on any topic.☆25Jan 13, 2025Updated last year
- ☆28Nov 10, 2025Updated 4 months ago
- LangGraph-GUI backend with fastapi☆62Oct 16, 2025Updated 5 months ago
- ☆39Sep 13, 2025Updated 6 months ago
- This codebase demonstrates various DSPy functionalities through practical examples.☆57Feb 16, 2025Updated last year
- DSPy: The framework for programming—not prompting—language models☆32,853Updated this week
- An MCP server implementation providing a standardized interface for LLMs to interact with the Atla API.☆17Jul 21, 2025Updated 8 months ago
- An Awesome list of curated DSPy resources.☆529Dec 10, 2025Updated 3 months ago
- SIFT: Grounding LLM Reasoning in Contexts via Stickers☆57Mar 6, 2025Updated last year
- An advanced distributed knowledge fabric for intelligent document processing, featuring multi-document agents, optimized query handling, …☆50Nov 4, 2025Updated 4 months ago
- PoC for "SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning" [NeurIPS '25]☆67Oct 2, 2025Updated 5 months ago
- TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients. Published in Nature.☆3,424Jul 25, 2025Updated 7 months ago
- Agentless🐱: an agentless approach to automatically solve software development problems☆2,019Dec 22, 2024Updated last year
- FlashInfer: Kernel Library for LLM Serving☆5,194Updated this week
- [NeurIPS 2025 Spotlight] LLM post-training suite — featuring ReasonFlux, ReasonFlux-PRM, and ReasonFlux-Coder.☆524Sep 27, 2025Updated 5 months ago
- A curated list of recent papers on efficient video attention for video diffusion models, including sparsification, quantization, and cach…☆59Oct 27, 2025Updated 4 months ago