FareedKhan-dev / llm-scale-deploy-guideLinks
An end-to-end pipeline to optimize and host LLM for 100K parallel queries
☆21Updated 2 weeks ago
Alternatives and similar repositories for llm-scale-deploy-guide
Users that are interested in llm-scale-deploy-guide are comparing it to the libraries listed below
Sorting:
- Handling Big Data with Knowledge Graph: A Detailed Guide☆24Updated 2 months ago
- A Step-by-Step Implementation of Google Veo 3 Architecture from Scratch☆45Updated last month
- LLM reads a paper and produce a working prototype☆58Updated 3 months ago
- ☆18Updated 5 months ago
- tickr-agent is an enterprise-ready, scalable Python library for building swarms of financial agents that conduct comprehensive stock anal…☆46Updated 3 weeks ago
- Official Repo for The Paper "Talk Structurally, Act Hierarchically: A Collaborative Framework for LLM Multi-Agent Systems"☆55Updated 5 months ago
- 9 Different Ways to Optimize AI Agent Memories☆25Updated 2 weeks ago
- Simple Graph Memory for AI applications☆89Updated 2 months ago
- ToolAgents is a lightweight and flexible framework for creating function-calling agents with various language models and APIs.☆27Updated last month
- Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference an…☆22Updated 2 months ago
- ☆40Updated 7 months ago
- Very minimal (and stateless) agent framework☆44Updated 6 months ago
- ☆47Updated 10 months ago
- ☆21Updated 8 months ago
- ScreenSuite - The most comprehensive benchmarking suite for GUI Agents!☆98Updated this week
- This template demonstrates how to create a collaborative team of AI agents that work together to process, analyze, and generate insights …☆34Updated 6 months ago
- A collection of example AI programs built using DSPy and maitained by the Langtrace AI team.☆34Updated 8 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 8 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆111Updated 3 months ago
- Explore the latest AI Agent Framework!☆65Updated 11 months ago
- ☆119Updated last week
- AI-powered fashion recommendation system leveraging LLMs, embeddings, and retrieval techniques to deliver personalized shopping experienc…☆17Updated 5 months ago
- Design Patterns for Multi Agents Frameworks Like Autogen, Langraph, Taskweaver, Crewai,etc☆59Updated last year
- A platform for building configurable, database-backed generative AI agentic assistants.☆24Updated 5 months ago
- OpenPipe Reinforcement Learning Experiments☆28Updated 4 months ago
- AI at your fingertips: powerful CLI tools for speech, text, and language processing☆18Updated 10 months ago
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆14Updated this week
- Agent Watch is an AgentOps monitoring library designed for Crew AI applications.☆19Updated 7 months ago
- ☆82Updated 3 weeks ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆74Updated 8 months ago