CVxTz / llm-serve-tutorialLinks
☆20Updated last year
Alternatives and similar repositories for llm-serve-tutorial
Users that are interested in llm-serve-tutorial are comparing it to the libraries listed below
Sorting:
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Updated last year
- Table detection with Florence.☆15Updated last year
- Data extraction with LLM on CPU☆112Updated 2 years ago
- A collection of hand on notebook for LLMs practitioner☆51Updated last year
- GenAI Experimentation☆59Updated 4 months ago
- ☆14Updated last year
- A semantic research engine to get relevant papers based on a user query. Application frontend with Chainlit Copilot. Observability with L…☆81Updated last year
- This repository contains the implementation of evaluation metrics for recommendation systems. We have compared similarity, candidate gene…☆27Updated 10 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆51Updated last year
- Mistral + Haystack: build RAG pipelines that rock 🤘☆106Updated last year
- Building Private Healthcare AI Assistant for Clinics Using Qdrant Hybrid Cloud, DSPy and Groq - Llama3☆23Updated last year
- Fine-tune an LLM to perform batch inference and online serving.☆117Updated 7 months ago
- Agent Watch is an AgentOps monitoring library designed for Crew AI applications.☆21Updated last year
- Build Enterprise RAG (Retriver Augmented Generation) Pipelines to tackle various Generative AI use cases with LLM's by simply plugging co…☆117Updated last year
- ☆80Updated last year
- 🤖 AI Assistant fine-tuned to provide support for coding and design questions based on the latest trends in the industry.☆17Updated 2 years ago
- Machine Learning Serving focused on GenAI with simplicity as the top priority.☆59Updated last week
- Test LLMs automatically with Giskard and CI/CD☆31Updated last year
- Building a chatbot powered with a RAG pipeline to read,summarize and quote the most relevant papers related to the user query.☆166Updated last year
- LlamaWorksDB is a Retrieval Augmented Generation (RAG) product designed to interact with the documentation of various products such as Ll…☆17Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆69Updated last month
- MLFlow End to End Workshop at Chandigarh University☆11Updated 2 years ago
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆119Updated 9 months ago
- This is a repository for the course "From Beginner to LLM Developer" by Towards AI.☆12Updated last year
- Large Language Model (LLM) Inference API and Chatbot☆127Updated last year
- Multi-Agent LLM System for Digital Scam Protection☆12Updated last year
- Retrieval Augmented Generation (RAG) on audio data with LangChain☆15Updated 2 years ago
- ☆15Updated 2 years ago
- GGUF Quantization of any LLM.☆41Updated last year
- Reference code base for ML Engineering in Action, Manning Publications Author: Ben Wilson☆20Updated 2 years ago