aniketmaurya / llm-inference
Large Language Model (LLM) Inference API and Chatbot
β125Updated last year
Alternatives and similar repositories for llm-inference
Users that are interested in llm-inference are comparing it to the libraries listed below
Sorting:
- Mistral + Haystack: build RAG pipelines that rock π€β103Updated last year
- Building a chatbot powered with a RAG pipeline to read,summarize and quote the most relevant papers related to the user query.β167Updated last year
- Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDBβ119Updated last year
- Data extraction with LLM on CPUβ113Updated last year
- β89Updated last year
- π¬ minimalistic ChatBot Interface in pure pythonβ222Updated 9 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated 10 months ago
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Daβ103Updated last month
- β185Updated last year
- Repository of the code base for KT Generation process that we worked at Google Cloud and Searce GenAI Hackathon.β74Updated last year
- Applying Evaluation Driven Development (EDD) to aid in the design decision of RAG pipelinesβ31Updated last year
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 linesβ198Updated last year
- β220Updated last year
- β52Updated last year
- wandbot is a technical support bot for Weights & Biases' AI developer tools that can run in Discord, Slack, ChatGPT and Zendeskβ299Updated 3 weeks ago
- Domain Adapted Language Modeling Toolkit - E2E RAGβ320Updated 6 months ago
- β42Updated last year
- Examples of RAG using Llamaindex with local LLMs - Gemma, Mixtral 8x7B, Llama 2, Mistral 7B, Orca 2, Phi-2, Neural 7Bβ126Updated last year
- RAGArch is a Streamlit-based application that empowers users to experiment with various components and parameters of Retrieval-Augmented β¦β84Updated last year
- β77Updated 11 months ago
- β45Updated last year
- Data extraction with LLM on CPUβ85Updated last year
- A collection of hand on notebook for LLMs practitionerβ47Updated 4 months ago
- β40Updated last year
- A microframework for creating simple AI agents.β91Updated 9 months ago
- A semantic research engine to get relevant papers based on a user query. Application frontend with Chainlit Copilot. Observability with Lβ¦β82Updated last year
- β57Updated last year
- β92Updated last year
- Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraphβ144Updated last year
- Generalist and Lightweight Model for Text Classificationβ128Updated 2 weeks ago