CVxTz / llm-serve-tutorialLinks
☆20Updated last year
Alternatives and similar repositories for llm-serve-tutorial
Users that are interested in llm-serve-tutorial are comparing it to the libraries listed below
Sorting:
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated last year
- Data extraction with LLM on CPU☆112Updated last year
- GenAI Experimentation☆58Updated last month
- A collection of hand on notebook for LLMs practitioner☆50Updated 8 months ago
- Table detection with Florence.☆15Updated last year
- 🤖 AI Assistant fine-tuned to provide support for coding and design questions based on the latest trends in the industry.☆17Updated last year
- LlamaWorksDB is a Retrieval Augmented Generation (RAG) product designed to interact with the documentation of various products such as Ll…☆17Updated last year
- ☆80Updated last year
- Mistral + Haystack: build RAG pipelines that rock 🤘☆105Updated last year
- Fine-tune an LLM to perform batch inference and online serving.☆112Updated 4 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆67Updated 11 months ago
- Data extraction with LLM on CPU☆85Updated last year
- Explore the use of DSPy for extracting features from PDFs 🔎☆45Updated last year
- Repository of the code base for KT Generation process that we worked at Google Cloud and Searce GenAI Hackathon.☆76Updated 2 years ago
- ☆14Updated last year
- Machine Learning Serving focused on GenAI with simplicity as the top priority.☆59Updated 2 months ago
- Dynamic Metadata based RAG Framework☆75Updated last year
- RAG example using DSPy, Gradio, FastAPI☆85Updated last year
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆78Updated 11 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆111Updated 5 months ago
- ☆30Updated last year
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆116Updated 6 months ago
- ☆54Updated last month
- ☆15Updated 2 years ago
- Agent Watch is an AgentOps monitoring library designed for Crew AI applications.☆19Updated 10 months ago
- A semantic research engine to get relevant papers based on a user query. Application frontend with Chainlit Copilot. Observability with L…☆83Updated last year
- Notebooks using the Neural Magic libraries 📓☆39Updated last year
- Visualization for a Retrieval-Augmented Generation (RAG) Assistant 🤖❤️📚☆192Updated last week
- ☆29Updated last year