aniketmaurya / llm-inference
Large Language Model (LLM) Inference API and Chatbot
β122Updated 7 months ago
Related projects β
Alternatives and complementary repositories for llm-inference
- Building a chatbot powered with a RAG pipeline to read,summarize and quote the most relevant papers related to the user query.β162Updated 6 months ago
- Mistral + Haystack: build RAG pipelines that rock π€β100Updated 9 months ago
- Data extraction with LLM on CPUβ109Updated 10 months ago
- β87Updated 10 months ago
- This playlab encompasses a multitude of projects crafted through the utilization of Large Language Models, showcasing the versatility andβ¦β74Updated last month
- Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDBβ116Updated 9 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β48Updated 3 months ago
- β41Updated 7 months ago
- A semantic research engine to get relevant papers based on a user query. Application frontend with Chainlit Copilot. Observability with Lβ¦β76Updated 6 months ago
- Repository of the code base for KT Generation process that we worked at Google Cloud and Searce GenAI Hackathon.β74Updated last year
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. π¨π»βπ³β164Updated 3 weeks ago
- RAGArch is a Streamlit-based application that empowers users to experiment with various components and parameters of Retrieval-Augmented β¦β80Updated 9 months ago
- β62Updated 4 months ago
- Chat Bot with LLM and Fact Reference. RAG(Retrieval Augmented Generation) and LangChain backedβ128Updated 6 months ago
- Data extraction with LLM on CPUβ260Updated 7 months ago
- Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraphβ144Updated 7 months ago
- Agentic RAG with Langchain, Qdrant and CrewAIβ36Updated 5 months ago
- Dynamic Metadata based RAG Frameworkβ71Updated 3 months ago
- GenAI Experimentationβ58Updated last month
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 linesβ194Updated 6 months ago
- Experimental Code for StructuredRAG: Structured Outputs in Retrieval-Augmented Generationβ90Updated this week
- PanML is a high level generative AI/ML development and analysis library designed for ease of use and fast experimentation.β113Updated last year
- β179Updated last year
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)β74Updated last month
- FastAPI wrapper around DSPyβ212Updated 7 months ago
- Data extraction with LLM on CPUβ64Updated 11 months ago
- β57Updated last year
- β75Updated 5 months ago
- Machine Learning Serving focused on GenAI with simplicity as the top priority.β55Updated 3 months ago
- Examples of RAG using Llamaindex with local LLMs - Gemma, Mixtral 8x7B, Llama 2, Mistral 7B, Orca 2, Phi-2, Neural 7Bβ119Updated 8 months ago