aniketmaurya / fastserve-ai
Machine Learning Serving focused on GenAI with simplicity as the top priority.
☆57Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for fastserve-ai
- ☆75Updated 5 months ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆43Updated 2 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆61Updated 2 weeks ago
- Self-host LLMs with vLLM and BentoML☆74Updated this week
- Serverless Modal + FastAPI + React + ColPali + Qdrant + GPT4o Vision RAG (V-RAG) Demo☆54Updated this week
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆53Updated 3 weeks ago
- Build Agentic workflows with function calling☆20Updated this week
- Experimental Code for StructuredRAG: Structured Outputs in Retrieval-Augmented Generation☆94Updated this week
- ☆78Updated this week
- LLM reads a paper and produce a working prototype☆36Updated last week
- ☆94Updated 2 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 4 months ago
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)☆74Updated 2 months ago
- ☆18Updated this week
- Routing on Random Forest (RoRF)☆84Updated last month
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆77Updated 5 months ago
- OpenMindedChatbot is a Proof Of Concept that leverages the power of Open source Large Language Models (LLM) with Function Calling capabil…☆28Updated 11 months ago
- Fun project: LLM powered RAG Discord Bot that works seamlessly on CPU☆30Updated last year
- Framework agnostic computer vision inference. Run 1000+ models by changing only one line of code. Supports models from transformers, timm…☆119Updated this week
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Updated 2 months ago
- RAG example using DSPy, Gradio, FastAPI☆66Updated 7 months ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆36Updated 9 months ago
- tiny_fnc_engine is a minimal python library that provides a flexible engine for calling functions extracted from a LLM.☆37Updated 2 months ago
- Simple examples using Argilla tools to build AI☆40Updated this week
- ☆39Updated 9 months ago
- End-to-End LLM Guide☆97Updated 4 months ago
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆134Updated 3 months ago
- Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDB☆116Updated 9 months ago
- Embed anything.☆29Updated 5 months ago