Codium-ai / AlphaCodium
Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""
☆3,616Updated last week
Related projects ⓘ
Alternatives and complementary repositories for AlphaCodium
- A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 30.67% tasks (pass@1) in SWE-b…☆2,711Updated last week
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆4,601Updated 2 months ago
- All things prompt engineering☆5,410Updated 5 months ago
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!☆3,207Updated 2 months ago
- Build resilient language agents as graphs.☆6,531Updated this week
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆13,615Updated this week
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆2,819Updated this week
- A code-first agent framework for seamlessly planning and executing data analytics tasks.☆5,320Updated 2 weeks ago
- 🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with Llam…☆6,302Updated this week
- Python SDK for AI agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks like CrewAI…☆2,010Updated this week
- The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework Join our Community: https://discord.com/servers/agora-999382051…☆1,720Updated this week
- Letta (formerly MemGPT) is a framework for creating LLM services with memory.☆12,082Updated this week
- AIOS: LLM Agent Operating System☆3,390Updated this week
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,031Updated 2 months ago
- A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.☆6,752Updated last week
- Adding guardrails to large language models.☆4,053Updated this week
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆3,270Updated this week
- [NeurIPS 2024] SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employ…☆13,651Updated this week
- An self-improving embodied conversational agent seamlessly integrated into the operating system to automate our daily tasks.☆1,505Updated 2 months ago
- DSPy: The framework for programming—not prompting—foundation models☆18,587Updated this week
- structured outputs for llms☆8,068Updated this week
- A list of AI autonomous agents☆11,186Updated last month
- Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Ge…☆4,650Updated this week
- ✨ Fully autonomous AI Agent that can perform complicated tasks and projects using terminal, browser, and editor.☆2,141Updated 6 months ago
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)☆11,442Updated this week
- Deploy your agentic worfklows to production☆1,817Updated this week
- A language for constraint-guided and efficient LLM programming.☆3,683Updated 5 months ago
- [ICLR 2024] SWE-bench: Can Language Models Resolve Real-world Github Issues?☆1,943Updated last week
- 🥷 Run AI-agents with an API☆5,280Updated 2 weeks ago
- Build Conversational AI in minutes ⚡️☆7,130Updated this week