CVxTz / llm-serve-tutorialLinks
☆20Updated last year
Alternatives and similar repositories for llm-serve-tutorial
Users that are interested in llm-serve-tutorial are comparing it to the libraries listed below
Sorting:
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Updated last year
- Table detection with Florence.☆15Updated last year
- Data extraction with LLM on CPU☆112Updated last year
- A collection of hand on notebook for LLMs practitioner☆51Updated 11 months ago
- GenAI Experimentation☆59Updated 4 months ago
- 🤖 AI Assistant fine-tuned to provide support for coding and design questions based on the latest trends in the industry.☆17Updated last year
- Agent Watch is an AgentOps monitoring library designed for Crew AI applications.☆21Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆51Updated last year
- ☆14Updated last year
- ☆80Updated last year
- LlamaWorksDB is a Retrieval Augmented Generation (RAG) product designed to interact with the documentation of various products such as Ll…☆17Updated last year
- A semantic research engine to get relevant papers based on a user query. Application frontend with Chainlit Copilot. Observability with L…☆82Updated last year
- ☆15Updated 2 years ago
- Repository of the code base for KT Generation process that we worked at Google Cloud and Searce GenAI Hackathon.☆76Updated 2 years ago
- ☆55Updated 4 months ago
- Fine-tune an LLM to perform batch inference and online serving.☆115Updated 6 months ago
- ☆44Updated last year
- Multi-Agent LLM System for Digital Scam Protection☆12Updated last year
- Retrieval Augmented Generation (RAG) on audio data with LangChain☆15Updated 2 years ago
- ☆45Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆68Updated last month
- Dynamic Metadata based RAG Framework☆78Updated 2 weeks ago
- ☆90Updated last year
- Mistral + Haystack: build RAG pipelines that rock 🤘☆106Updated last year
- Pandas-LLM☆46Updated 2 years ago
- Data extraction with LLM on CPU☆85Updated 2 years ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆78Updated last year
- A tutorial on DSPy and whether automated prompt engineering lives up to the hype☆24Updated last year
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆119Updated 8 months ago
- Large Language Model (LLM) Inference API and Chatbot☆127Updated last year