aniketmaurya / llm-inference
Large Language Model (LLM) Inference API and Chatbot
β125Updated last year
Alternatives and similar repositories for llm-inference:
Users that are interested in llm-inference are comparing it to the libraries listed below
- Mistral + Haystack: build RAG pipelines that rock π€β103Updated last year
- Building a chatbot powered with a RAG pipeline to read,summarize and quote the most relevant papers related to the user query.β166Updated 11 months ago
- β88Updated last year
- Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDBβ119Updated last year
- Repository of the code base for KT Generation process that we worked at Google Cloud and Searce GenAI Hackathon.β74Updated last year
- A curated collection of interesting applications, repos, and tutorials using large language models (LLM) like GPT-3β141Updated last year
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Daβ101Updated this week
- Data extraction with LLM on CPUβ112Updated last year
- Chat Bot with LLM and Fact Reference. RAG(Retrieval Augmented Generation) and LangChain backedβ128Updated 11 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated 8 months ago
- β40Updated last year
- LangChain chat model abstractions for dynamic failover, load balancing, chaos engineering, and more!β80Updated last year
- Data extraction with LLM on CPUβ85Updated last year
- GenAI Experimentationβ58Updated 2 months ago
- Data extraction with LLM on CPUβ266Updated last year
- β76Updated 9 months ago
- Document Q&A on Wikipedia articles using LLMsβ75Updated last year
- β51Updated last year
- A microframework for creating simple AI agents.β91Updated 8 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β67Updated 5 months ago
- Notebooks using the Neural Magic libraries πβ42Updated 8 months ago
- Using LlamaIndex with Ray for productionizing LLM applicationsβ71Updated last year
- β41Updated last year
- A collection of hand on notebook for LLMs practitionerβ47Updated 2 months ago
- RAGArch is a Streamlit-based application that empowers users to experiment with various components and parameters of Retrieval-Augmented β¦β84Updated last year
- β57Updated last year
- wandbot is a technical support bot for Weights & Biases' AI developer tools that can run in Discord, Slack, ChatGPT and Zendeskβ291Updated last month
- Anthropic Claude2 Hackathon:Building MCTS with Claude for optimal action prediction during patient/doctor interactions.β104Updated last year
- My Digital Palace - A Personal Journal for Reflection - A place to store all my thoughtsβ48Updated 3 weeks ago
- β45Updated 11 months ago