aniketmaurya / llm-inferenceLinks

Large Language Model (LLM) Inference API and Chatbot

☆126

Alternatives and similar repositories for llm-inference

Users that are interested in llm-inference are comparing it to the libraries listed below

Sorting:

AymenKallala / RAG_Maestro
Building a chatbot powered with a RAG pipeline to read,summarize and quote the most relevant papers related to the user query.
☆168Updated last year
anakin87 / mistral-haystack
Mistral + Haystack: build RAG pipelines that rock 🤘
☆105Updated last year
ravi03071991 / KT_Generator
Repository of the code base for KT Generation process that we worked at Google Cloud and Searce GenAI Hackathon.
☆74Updated last year
fadynakhla / dr-claude
Anthropic Claude2 Hackathon:Building MCTS with Claude for optimal action prediction during patient/doctor interactions.
☆105Updated last year
run-llama / llamaindex_aws_ingestion
☆89Updated last year
katanaml / llm-ollama-llamaindex-invoice-cpu
Data extraction with LLM on CPU
☆113Updated last year
titanml / takeoff-community
TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models access…
☆114Updated last year
timho102003 / NewsGPT
☆222Updated last year
hwchase17 / conversation-qa-gradio
☆52Updated 2 years ago
aniketmaurya / fastserve-ai
Machine Learning Serving focused on GenAI with simplicity as the top priority.
☆59Updated 3 weeks ago
katanaml / llm-mistral-invoice-cpu
Data extraction with LLM on CPU
☆268Updated last year
georgesung / LLM-WikipediaQA
Document Q&A on Wikipedia articles using LLMs
☆78Updated last year
ccurme / yolopandas
☆199Updated 2 years ago
aigeek0x0 / rag-with-langchain-colbert-and-ragatouille
Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDB
☆122Updated last year
BeastByteAI / agent_dingo
A microframework for creating simple AI agents.
☆91Updated last year
run-llama / ai-engineer-workshop
☆185Updated last year
grski / bRAG
☆48Updated last year
andrewnguonly / ChatAbstractions
LangChain chat model abstractions for dynamic failover, load balancing, chaos engineering, and more!
☆82Updated last year
Renumics / renumics-rag
Visualization for a Retrieval-Augmented Generation (RAG) Assistant 🤖❤️📚
☆191Updated 7 months ago
katanaml / llm-ollama-invoice-cpu
Data extraction with LLM on CPU
☆85Updated last year
davanstrien / data-for-fine-tuning-llms
☆77Updated last year
aigeek0x0 / zephyr-7b-alpha-langchain-chatbot
Chat with PDF using Zephyr 7B Alpha, Langchain, ChromaDB, and Gradio with Free Google Colab
☆136Updated last year
deshwalmahesh / PHUDGE
Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…
☆49Updated last year
PrithivirajDamodaran / Route0x
Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da
☆111Updated 4 months ago
llmco / llamaapi-python
☆92Updated last year
Pan-ML / panml
PanML is a high level generative AI/ML development and analysis library designed for ease of use and fast experimentation.
☆117Updated 2 years ago
shroominic / fastui-chat
💬 minimalistic ChatBot Interface in pure python
☆225Updated last year
AI-ANK / RAGArch
RAGArch is a Streamlit-based application that empowers users to experiment with various components and parameters of Retrieval-Augmented …
☆85Updated last year
neuml / txtinstruct
📚 Datasets and models for instruction-tuning
☆238Updated last year
arcee-ai / DALM
Domain Adapted Language Modeling Toolkit - E2E RAG
☆325Updated 8 months ago