aniketmaurya / llm-inferenceLinks
Large Language Model (LLM) Inference API and Chatbot
β126Updated last year
Alternatives and similar repositories for llm-inference
Users that are interested in llm-inference are comparing it to the libraries listed below
Sorting:
- Building a chatbot powered with a RAG pipeline to read,summarize and quote the most relevant papers related to the user query.β168Updated last year
- Mistral + Haystack: build RAG pipelines that rock π€β105Updated last year
- Data extraction with LLM on CPUβ112Updated last year
- Data extraction with LLM on CPUβ269Updated last year
- Repository of the code base for KT Generation process that we worked at Google Cloud and Searce GenAI Hackathon.β76Updated 2 years ago
- β52Updated 2 years ago
- β223Updated last year
- Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDBβ122Updated last year
- π Datasets and models for instruction-tuningβ238Updated last year
- Data extraction with LLM on CPUβ85Updated last year
- Anthropic Claude2 Hackathon:Building MCTS with Claude for optimal action prediction during patient/doctor interactions.β105Updated 2 years ago
- β80Updated last year
- β185Updated last year
- β199Updated 2 years ago
- Chat with PDF using Zephyr 7B Alpha, Langchain, ChromaDB, and Gradio with Free Google Colabβ136Updated last year
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models accessβ¦β114Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated last year
- Examples of RAG using Llamaindex with local LLMs - Gemma, Mixtral 8x7B, Llama 2, Mistral 7B, Orca 2, Phi-2, Neural 7Bβ129Updated last year
- β93Updated last year
- π¬ minimalistic ChatBot Interface in pure pythonβ225Updated last year
- PanML is a high level generative AI/ML development and analysis library designed for ease of use and fast experimentation.β117Updated 2 years ago
- Chat Bot with LLM and Fact Reference. RAG(Retrieval Augmented Generation) and LangChain backedβ129Updated last year
- β89Updated last year
- Domain Adapted Language Modeling Toolkit - E2E RAGβ327Updated 10 months ago
- LangChain chat model abstractions for dynamic failover, load balancing, chaos engineering, and more!β84Updated last year
- RAGArch is a Streamlit-based application that empowers users to experiment with various components and parameters of Retrieval-Augmented β¦β85Updated last year
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Daβ117Updated 5 months ago
- Data extraction with LLM on CPUβ68Updated last year
- Visualization for a Retrieval-Augmented Generation (RAG) Assistant π€β€οΈπβ192Updated 8 months ago
- β30Updated last year