CVxTz / llm-serve-tutorialLinks
☆20Updated last year
Alternatives and similar repositories for llm-serve-tutorial
Users that are interested in llm-serve-tutorial are comparing it to the libraries listed below
Sorting:
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Updated last year
- Agent Watch is an AgentOps monitoring library designed for Crew AI applications.☆20Updated last year
- 🤖 AI Assistant fine-tuned to provide support for coding and design questions based on the latest trends in the industry.☆17Updated 2 years ago
- Table detection with Florence.☆15Updated last year
- GenAI Experimentation☆59Updated 5 months ago
- Data extraction with LLM on CPU☆112Updated 2 years ago
- A collection of hand on notebook for LLMs practitioner☆51Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆51Updated last year
- ☆14Updated last year
- ☆55Updated 5 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆69Updated 2 months ago
- A semantic research engine to get relevant papers based on a user query. Application frontend with Chainlit Copilot. Observability with L…☆81Updated last year
- ☆15Updated 2 years ago
- Test LLMs automatically with Giskard and CI/CD☆31Updated last year
- ☆31Updated 2 years ago
- Machine Learning Serving focused on GenAI with simplicity as the top priority.☆59Updated last month
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆84Updated last year
- This repository contains the implementation of evaluation metrics for recommendation systems. We have compared similarity, candidate gene…☆27Updated 11 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆115Updated 10 months ago
- Mistral + Haystack: build RAG pipelines that rock 🤘☆106Updated 2 years ago
- Dynamic Metadata based RAG Framework☆78Updated 2 months ago
- ☆80Updated last year
- Awesome LLM application repo☆87Updated 10 months ago
- 🔎 A deep-dive into HyDE for Advanced LLM RAG + 💡 Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, covera…☆34Updated last year
- ☆54Updated 3 weeks ago
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆119Updated 10 months ago
- ☆44Updated last year
- LlamaWorksDB is a Retrieval Augmented Generation (RAG) product designed to interact with the documentation of various products such as Ll…☆17Updated last year
- Building Private Healthcare AI Assistant for Clinics Using Qdrant Hybrid Cloud, DSPy and Groq - Llama3☆24Updated last year
- Fine-tune an LLM to perform batch inference and online serving.☆120Updated 8 months ago