aniketmaurya / llm-inferenceLinks
Large Language Model (LLM) Inference API and Chatbot
β125Updated last year
Alternatives and similar repositories for llm-inference
Users that are interested in llm-inference are comparing it to the libraries listed below
Sorting:
- Mistral + Haystack: build RAG pipelines that rock π€β105Updated last year
- Building a chatbot powered with a RAG pipeline to read,summarize and quote the most relevant papers related to the user query.β167Updated last year
- Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDBβ119Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated 10 months ago
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 linesβ196Updated last year
- β89Updated last year
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Daβ105Updated 2 months ago
- Data extraction with LLM on CPUβ113Updated last year
- PanML is a high level generative AI/ML development and analysis library designed for ease of use and fast experimentation.β117Updated last year
- Machine Learning Serving focused on GenAI with simplicity as the top priority.β57Updated last month
- Repository of the code base for KT Generation process that we worked at Google Cloud and Searce GenAI Hackathon.β74Updated last year
- π¬ minimalistic ChatBot Interface in pure pythonβ224Updated 10 months ago
- LLM_library is a comprehensive repository serves as a one-stop resource hands-on code, insightful summaries.β69Updated last year
- Data extraction with LLM on CPUβ265Updated last year
- LangChain chat model abstractions for dynamic failover, load balancing, chaos engineering, and more!β81Updated last year
- β52Updated last year
- A template to kick-start your Python project β¨πβ51Updated 5 months ago
- RAGArch is a Streamlit-based application that empowers users to experiment with various components and parameters of Retrieval-Augmented β¦β85Updated last year
- A collection of hand on notebook for LLMs practitionerβ47Updated 4 months ago
- Domain Adapted Language Modeling Toolkit - E2E RAGβ321Updated 6 months ago
- β42Updated last year
- β77Updated 11 months ago
- Notebooks using the Neural Magic libraries πβ41Updated 10 months ago
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models accessβ¦β114Updated last year
- A python package that provides a custom streamlit connection to query data from weaviate, the AI native vector databaseβ55Updated 10 months ago
- docker setup to run the LangChain research-assistant template using langserveβ45Updated last week
- Complete implementation of Llama2 with/without KV cache & inference πβ46Updated last year
- A curated collection of interesting applications, repos, and tutorials using large language models (LLM) like GPT-3β140Updated last year
- Fine Tuning Multimodal LLM "Idefics 9B" on Pokemon Go Dataset available on Hugging Face.β19Updated last year
- A set of scripts and notebooks on LLM finetunning and dataset creationβ111Updated 8 months ago