CVxTz / llm-serve-tutorialLinks

☆20

Alternatives and similar repositories for llm-serve-tutorial

Users that are interested in llm-serve-tutorial are comparing it to the libraries listed below

Sorting:

mrmaheshrajput / productionizing-llms
Code Repository for Blog - How to Productionize Large Language Models (LLMs)
☆11Updated last year
katanaml / llm-ollama-llamaindex-invoice-cpu
Data extraction with LLM on CPU
☆113Updated last year
Logisx / AI-Senior
🤖 AI Assistant fine-tuned to provide support for coding and design questions based on the latest trends in the industry.
☆17Updated last year
jayita13 / GenerativeAI
GenAI Experimentation
☆57Updated 2 weeks ago
deshwalmahesh / PHUDGE
Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…
☆49Updated last year
clab2024 / clab
LlamaWorksDB is a Retrieval Augmented Generation (RAG) product designed to interact with the documentation of various products such as Ll…
☆16Updated last year
jjovalle99 / agentic-design-patterns
☆14Updated last year
wjbmattingly / youtube-florence-table
Table detection with Florence.
☆14Updated last year
marib00 / llamaindex-embedding-lora
☆29Updated last year
tahreemrasul / semantic_research_engine
A semantic research engine to get relevant papers based on a user query. Application frontend with Chainlit Copilot. Observability with L…
☆83Updated last year
AIAnytime / agent-watch
Agent Watch is an AgentOps monitoring library designed for Crew AI applications.
☆19Updated 8 months ago
leehanchung / llm-pdf-qa-workshop
Introduction to LLM App Development Workshop: PDF Q&A App using OpenAI, Langchain, and Chainlit
☆47Updated last year
AIAnytime / Zephyr-7B-beta-RAG-Demo
Zephyr 7B beta RAG Demo inside a Gradio app powered by BGE Embeddings, ChromaDB, and Zephyr 7B Beta LLM.
☆35Updated last year
AntonioGr7 / pratical-llms
A collection of hand on notebook for LLMs practitioner
☆49Updated 6 months ago
langchain-ai / prompt-eval-recommendation
Streamlit app for recommending eval functions using prompt diffs
☆29Updated last year
AI-ANK / RAGArch
RAGArch is a Streamlit-based application that empowers users to experiment with various components and parameters of Retrieval-Augmented …
☆85Updated last year
kylejtobin / rag_bot
A platform designed to facilitate the development of advanced conversational agents using retrieval augmented generation (RAG).
☆34Updated last year
sachink1729 / DSPy-Chain-of-Thought-RAG
Building a Chain of Thought RAG Model with DSPy, Qdrant and Ollama
☆32Updated last year
evidentlyai / community-examples
Examples of using Evidently to evaluate, test and monitor ML models.
☆34Updated last month
ravi03071991 / KT_Generator
Repository of the code base for KT Generation process that we worked at Google Cloud and Searce GenAI Hackathon.
☆74Updated last year
zrizvi93 / trevorhack
☆45Updated last year
anakin87 / mistral-haystack
Mistral + Haystack: build RAG pipelines that rock 🤘
☆105Updated last year
AhmedSSoliman / Llama2-CodeGen-Fine-Tuning-LLama-2
☆15Updated last year
Renumics / renumics-rag
Visualization for a Retrieval-Augmented Generation (RAG) Assistant 🤖❤️📚
☆191Updated 7 months ago
Paulescu / testing-llms-in-the-real-world
Test LLMs automatically with Giskard and CI/CD
☆30Updated 11 months ago
davanstrien / data-for-fine-tuning-llms
☆77Updated last year
sachink1729 / Healthcare-AI-Assistant-Medical-Data-Qdrant-Dspy-Groq
Building Private Healthcare AI Assistant for Clinics Using Qdrant Hybrid Cloud, DSPy and Groq - Llama3
☆22Updated last year
darshil3011 / AutoMetaRAG
Dynamic Metadata based RAG Framework
☆75Updated last year
aniketmaurya / fastserve-ai
Machine Learning Serving focused on GenAI with simplicity as the top priority.
☆59Updated 3 weeks ago
githubpradeep / notebooks
☆54Updated 5 months ago