aniketmaurya / llm-inferenceLinks
Large Language Model (LLM) Inference API and Chatbot
☆126Updated last year
Alternatives and similar repositories for llm-inference
Users that are interested in llm-inference are comparing it to the libraries listed below
Sorting:
- Building a chatbot powered with a RAG pipeline to read,summarize and quote the most relevant papers related to the user query.☆167Updated last year
- Mistral + Haystack: build RAG pipelines that rock 🤘☆105Updated last year
- Repository of the code base for KT Generation process that we worked at Google Cloud and Searce GenAI Hackathon.☆74Updated last year
- ☆52Updated last year
- Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDB☆121Updated last year
- Data extraction with LLM on CPU☆267Updated last year
- Data extraction with LLM on CPU☆114Updated last year
- ☆221Updated last year
- ☆185Updated last year
- Chat with PDF using Zephyr 7B Alpha, Langchain, ChromaDB, and Gradio with Free Google Colab☆136Updated last year
- wandbot is a technical support bot for Weights & Biases' AI developer tools that can run in Discord, Slack, ChatGPT and Zendesk☆302Updated last week
- Visualization for a Retrieval-Augmented Generation (RAG) Assistant 🤖❤️📚☆191Updated 6 months ago
- 💬 minimalistic ChatBot Interface in pure python☆225Updated 11 months ago
- Anthropic Claude2 Hackathon:Building MCTS with Claude for optimal action prediction during patient/doctor interactions.☆105Updated last year
- ☆89Updated last year
- Examples of RAG using Llamaindex with local LLMs - Gemma, Mixtral 8x7B, Llama 2, Mistral 7B, Orca 2, Phi-2, Neural 7B☆128Updated last year
- ☆198Updated 2 years ago
- ☆204Updated last year
- Document Q&A on Wikipedia articles using LLMs☆78Updated last year
- A curated collection of interesting applications, repos, and tutorials using large language models (LLM) like GPT-3☆141Updated 2 years ago
- ☆92Updated last year
- Data extraction with LLM on CPU☆85Updated last year
- ☆75Updated last year
- Data extraction with LLM on CPU☆68Updated last year
- LangChain chat model abstractions for dynamic failover, load balancing, chaos engineering, and more!☆81Updated last year
- Using LlamaIndex with Ray for productionizing LLM applications☆71Updated last year
- ☆99Updated last year
- ☆77Updated last year
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆105Updated 3 months ago
- Web App for generating synthetic data☆48Updated 10 months ago