aniketmaurya / fastserve-aiView external linksLinks
Machine Learning Serving focused on GenAI with simplicity as the top priority.
☆59Jan 5, 2026Updated last month
Alternatives and similar repositories for fastserve-ai
Users that are interested in fastserve-ai are comparing it to the libraries listed below
Sorting:
- This is an example of creating an AI agent with flowchart☆12Jul 22, 2024Updated last year
- This repository contains the implementation of evaluation metrics for recommendation systems. We have compared similarity, candidate gene…☆27Feb 21, 2025Updated 11 months ago
- yolosegment2labelme - a Python package that allows you to convert YOLO segmentation prediction results to LabelMe and anylabeling JSON fo…☆10May 8, 2024Updated last year
- RAG Based LLM Chatbot Built using Open Source Stack (Llama 3.2 Model, BGE Embeddings, and Qdrant running locally within a Docker Containe…☆15Jan 9, 2025Updated last year
- This is a repository for the course "From Beginner to LLM Developer" by Towards AI.☆12Jan 2, 2025Updated last year
- Streamlit Cookbook, published by Packt☆14Jun 6, 2025Updated 8 months ago
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Mar 27, 2024Updated last year
- Fine-tuning large language models (LLMs) is crucial for enhancing performance across domain-specific task applications. This comprehensiv…☆12Sep 19, 2024Updated last year
- ChatBot App built using LangChain and Lightning AI☆18Mar 4, 2023Updated 2 years ago
- AI assistant that Intuitively Adapts to You☆79Feb 2, 2024Updated 2 years ago
- Triton Server Component for lightning.ai☆14Feb 15, 2023Updated 3 years ago
- Rust implementation of Surya☆65Mar 1, 2025Updated 11 months ago
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆138Jul 25, 2024Updated last year
- Material for the series of seminars on Large Language Models☆34Apr 21, 2024Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆267Dec 4, 2025Updated 2 months ago
- A SaaS Startup using Generative AI. This is a code bug fixer SaaS built using Azure OpenAI, Stripe, SQLite, and web technologies.☆16Sep 22, 2023Updated 2 years ago
- Securing LLM's Against Top 10 OWASP Large Language Model Vulnerabilities 2024☆20May 10, 2024Updated last year
- ☆119Dec 18, 2024Updated last year
- Incredibly descriptive audiovisual summaries for videos☆41Aug 2, 2024Updated last year
- A Holistic Embodied Cognition Benchmark☆18Apr 3, 2025Updated 10 months ago
- Use GitHub Actions to send a tweet when you make a new release☆18Dec 14, 2020Updated 5 years ago
- ☆20Jan 3, 2024Updated 2 years ago
- Build Agentic workflows with function calling using open LLMs☆28Feb 2, 2026Updated 2 weeks ago
- Reference code base for ML Engineering in Action, Manning Publications Author: Ben Wilson☆20Oct 22, 2023Updated 2 years ago
- This project allows you to plug in a GitHub repository URL, generate vectors for a LLM and use ChatGPT models to interact. The main frame…☆19Jun 4, 2023Updated 2 years ago
- Code repository for the paper - "Superposed Decoding: Multiple Generations from a Single Autoregressive Inference Pass"☆21Aug 22, 2024Updated last year
- serving a torch model using Celery, Redis and RabbitMQ to serve users asynchronously☆25Jan 21, 2024Updated 2 years ago
- AGI SDK☆382Updated this week
- RayGen: Multi-Modal Dataset Reinforcement for MobileCLIP and MobileCLIP2☆38Aug 29, 2025Updated 5 months ago
- ☆96Mar 26, 2024Updated last year
- This code implements a Radial Basis Function (RBF) based Kolmogorov-Arnold Network (KAN) for function approximation.☆29Jun 15, 2024Updated last year
- ☆26Feb 15, 2023Updated 3 years ago
- LLM Engineering CrashCourse☆101Feb 17, 2024Updated 2 years ago
- Data Structures with Python(AIX20001) 강의 자료실☆18Jun 14, 2024Updated last year
- Python Script for Structuring data from SEC Form D filings using DuckDB and Python with a display layer using Evidence☆28Aug 17, 2024Updated last year
- ☆27Jul 9, 2024Updated last year
- ☆34Aug 16, 2024Updated last year
- low-code multi-agent automation framework☆264Oct 22, 2025Updated 3 months ago
- PyTorch compiler that accelerates training and inference. Get built-in optimizations for performance, memory, parallelism, and easily wri…☆1,442Updated this week