Machine Learning Serving focused on GenAI with simplicity as the top priority.
☆60Apr 6, 2026Updated last month
Alternatives and similar repositories for fastserve-ai
Users that are interested in fastserve-ai are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is an example of creating an AI agent with flowchart☆12Jul 22, 2024Updated last year
- Streamlit Cookbook, published by Packt☆14Jun 6, 2025Updated 11 months ago
- A template to kick-start your Python project ✨🚀☆54Jul 19, 2025Updated 10 months ago
- Triton Server Component for lightning.ai☆14Feb 15, 2023Updated 3 years ago
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Mar 27, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This is a repository for the course "From Beginner to LLM Developer" by Towards AI.☆12Jan 2, 2025Updated last year
- Multi-Agent AI App from Scratch in python without any depedency of framework☆15Jan 7, 2025Updated last year
- Fine-tuning large language models (LLMs) is crucial for enhancing performance across domain-specific task applications. This comprehensiv…☆13Sep 19, 2024Updated last year
- Explore 160+ notebook visual analytics tools in your browser!☆66Mar 29, 2024Updated 2 years ago
- A collection of interesting links, articles, research papers and projects related to knowledge graphs, GenAI and LLMs (large language mod…☆28Jul 5, 2024Updated last year
- Track and Collaborate on ML & AI Experiments.☆44Mar 10, 2025Updated last year
- RAG Based LLM Chatbot Built using Open Source Stack (Llama 3.2 Model, BGE Embeddings, and Qdrant running locally within a Docker Containe…☆20Jan 9, 2025Updated last year
- Personal README for Github profile.☆10Jun 12, 2023Updated 2 years ago
- AI assistant that Intuitively Adapts to You☆79Feb 2, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Find the optimal model serving solution for 🤗 Hugging Face models 🚀☆45Jul 20, 2025Updated 10 months ago
- Rust implementation of Surya☆67Mar 1, 2025Updated last year
- Reference code base for ML Engineering in Action, Manning Publications Author: Ben Wilson☆21Oct 22, 2023Updated 2 years ago
- Material for the series of seminars on Large Language Models☆34Apr 21, 2024Updated 2 years ago
- This Streamlit application creates an interactive Data Visualization Assistant that can understand Natural Language Queries and generate …☆18Jan 13, 2025Updated last year
- Build Agentic workflows with function calling using open LLMs☆28May 4, 2026Updated 3 weeks ago
- Incredibly descriptive audiovisual summaries for videos☆41Aug 2, 2024Updated last year
- Securing LLM's Against Top 10 OWASP Large Language Model Vulnerabilities 2024☆23May 10, 2024Updated 2 years ago
- serving a torch model using Celery, Redis and RabbitMQ to serve users asynchronously☆25Jan 21, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A high-throughput and memory-efficient inference and serving engine for LLMs☆266Dec 4, 2025Updated 5 months ago
- Efficient vector database for hundred millions of embeddings.☆215May 17, 2024Updated 2 years ago
- DiffusionWithAutoscaler☆29Apr 2, 2024Updated 2 years ago
- ☆26Feb 15, 2023Updated 3 years ago
- Median is an open-source flashcard application that leverages the power of spaced repetition and artificial intelligence to transform the…☆22Nov 4, 2024Updated last year
- An intelligence operating system☆394May 22, 2026Updated last week
- Cookbook for Crafting Good Code☆57Mar 19, 2024Updated 2 years ago
- ☆97Mar 26, 2024Updated 2 years ago
- A prompting library☆192Jul 1, 2025Updated 10 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆19Jan 3, 2024Updated 2 years ago
- Fun project: LLM powered RAG Discord Bot that works seamlessly on CPU☆32Nov 12, 2023Updated 2 years ago
- ☆16Oct 17, 2024Updated last year
- [deprecated] AI Gateway - core infrastructure stack for building production-ready AI Applications☆160Apr 8, 2024Updated 2 years ago
- a flying dog eating bones☆19Jun 22, 2024Updated last year
- Testing FPC1020 fingerprint sensors with Arduino☆10Mar 25, 2020Updated 6 years ago
- Retrieval-Augmented Generation with pgvector as vector database☆13Jan 23, 2024Updated 2 years ago