aniketmaurya / llm-inference
Large Language Model (LLM) Inference API and Chatbot
β124Updated 9 months ago
Alternatives and similar repositories for llm-inference:
Users that are interested in llm-inference are comparing it to the libraries listed below
- Building a chatbot powered with a RAG pipeline to read,summarize and quote the most relevant papers related to the user query.β165Updated 9 months ago
- β88Updated last year
- Mistral + Haystack: build RAG pipelines that rock π€β100Updated 11 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β48Updated 6 months ago
- Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDBβ118Updated last year
- Examples of RAG using Llamaindex with local LLMs - Gemma, Mixtral 8x7B, Llama 2, Mistral 7B, Orca 2, Phi-2, Neural 7Bβ120Updated 11 months ago
- Repository of the code base for KT Generation process that we worked at Google Cloud and Searce GenAI Hackathon.β74Updated last year
- β184Updated last year
- Data extraction with LLM on CPUβ112Updated last year
- β51Updated last year
- β41Updated 10 months ago
- β198Updated 11 months ago
- LangChain chat model abstractions for dynamic failover, load balancing, chaos engineering, and more!β79Updated last year
- End-to-End LLM Guideβ99Updated 6 months ago
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Daβ91Updated last month
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 linesβ196Updated 8 months ago
- β45Updated 9 months ago
- Testing speed and accuracy of RAG with, and without Cross Encoder Reranker.β48Updated last year
- This repository implements the chain of verification paper by Meta AIβ160Updated last year
- A microframework for creating simple AI agents.β91Updated 5 months ago
- Anthropic Claude2 Hackathon:Building MCTS with Claude for optimal action prediction during patient/doctor interactions.β107Updated last year
- β16Updated last year
- β77Updated 8 months ago
- PanML is a high level generative AI/ML development and analysis library designed for ease of use and fast experimentation.β115Updated last year
- β57Updated last year
- β91Updated last year
- Applying Evaluation Driven Development (EDD) to aid in the design decision of RAG pipelinesβ31Updated last year
- β99Updated 9 months ago
- Fun project: LLM powered RAG Discord Bot that works seamlessly on CPUβ32Updated last year
- β76Updated 7 months ago