aniketmaurya / llm-inferenceLinks
Large Language Model (LLM) Inference API and Chatbot
β126Updated last year
Alternatives and similar repositories for llm-inference
Users that are interested in llm-inference are comparing it to the libraries listed below
Sorting:
- Building a chatbot powered with a RAG pipeline to read,summarize and quote the most relevant papers related to the user query.β167Updated last year
- Mistral + Haystack: build RAG pipelines that rock π€β106Updated last year
- Repository of the code base for KT Generation process that we worked at Google Cloud and Searce GenAI Hackathon.β76Updated 2 years ago
- Data extraction with LLM on CPUβ112Updated last year
- β223Updated 2 years ago
- β52Updated 2 years ago
- Anthropic Claude2 Hackathon:Building MCTS with Claude for optimal action prediction during patient/doctor interactions.β106Updated 2 years ago
- Data extraction with LLM on CPUβ269Updated last year
- Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDBβ123Updated last year
- β80Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β51Updated last year
- Data extraction with LLM on CPUβ85Updated last year
- β89Updated last year
- Chat Bot with LLM and Fact Reference. RAG(Retrieval Augmented Generation) and LangChain backedβ129Updated last year
- β186Updated 2 years ago
- β93Updated 2 years ago
- β103Updated 2 years ago
- π¬ minimalistic ChatBot Interface in pure pythonβ227Updated last year
- A curated collection of interesting applications, repos, and tutorials using large language models (LLM) like GPT-3β150Updated 2 years ago
- Machine Learning Serving focused on GenAI with simplicity as the top priority.β59Updated last month
- Chat with PDF using Zephyr 7B Alpha, Langchain, ChromaDB, and Gradio with Free Google Colabβ137Updated last year
- β43Updated last year
- β137Updated 2 years ago
- LangChain chat model abstractions for dynamic failover, load balancing, chaos engineering, and more!β83Updated last year
- π Datasets and models for instruction-tuningβ238Updated 2 years ago
- Fun project: LLM powered RAG Discord Bot that works seamlessly on CPUβ32Updated 2 years ago
- Domain Adapted Language Modeling Toolkit - E2E RAGβ332Updated last year
- Text to Python Objects via a LLM Function Callβ58Updated last year
- β200Updated 2 years ago
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 linesβ196Updated last year