CVxTz / llm-serve-tutorial
β20Updated last year
Alternatives and similar repositories for llm-serve-tutorial:
Users that are interested in llm-serve-tutorial are comparing it to the libraries listed below
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)β11Updated last year
- π€ AI Assistant fine-tuned to provide support for coding and design questions based on the latest trends in the industry.β16Updated last year
- Running load tests on a FastAPI application using Locustβ15Updated 2 weeks ago
- LlamaWorksDB is a Retrieval Augmented Generation (RAG) product designed to interact with the documentation of various products such as Llβ¦β16Updated 11 months ago
- Experimenting text-embeddings-inference server on both CPU andΒ GPUβ18Updated last year
- Notebooks using the Neural Magic libraries πβ42Updated 8 months ago
- β14Updated 11 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated 9 months ago
- Agent Watch is an AgentOps monitoring library designed for Crew AI applications.β17Updated 4 months ago
- Table detection with Florence.β13Updated 9 months ago
- GenAI Experimentationβ58Updated 2 months ago
- β16Updated 9 months ago
- Data extraction with LLM on CPUβ68Updated last year
- Medical Help App using GPT-4Vβ25Updated last year
- I have explained how to create superior RAG pipeline for complex pdfs using LlamaParse. We can extract text and tables from pdf and QA onβ¦β44Updated last year
- β41Updated 4 months ago
- A tutorial on DSPy and whether automated prompt engineering lives up to the hypeβ22Updated 11 months ago
- β52Updated 2 months ago
- Medical Mixture of Experts LLM using Mergekit.β20Updated last year
- Build Agentic workflows with function calling using open LLMsβ26Updated this week
- Streamlit application that helps users analyze RFP's using the latest Gemini 2.0 Flash Experimental LLM.β13Updated 3 months ago
- Writing Blog Posts with Generative Feedback Loops!β47Updated last year
- β29Updated last year
- A basic streamlit application that uses Mito for data importing and cleaning.β23Updated last year
- β1Updated 9 months ago
- β45Updated 6 months ago
- β31Updated last year
- Uses a Gradio interface to stream coding related responses from local and cloud based large language models. Pulls context from GitHub Reβ¦β21Updated last month
- π A deep-dive into HyDE for Advanced LLM RAG + π‘ Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, coveraβ¦β32Updated last year
- β77Updated 10 months ago