Smart AI model cascading for cost, latency, KPI optimization
☆284Mar 2, 2026Updated this week
Alternatives and similar repositories for cascadeflow
Users that are interested in cascadeflow are comparing it to the libraries listed below
Sorting:
- 🚀 AI LaunchKit - Complete self-hosted AI development toolkit with 50+ pre-configured tools including n8n, bolt.diy, ComfyUI, and more. O…☆95Jan 26, 2026Updated last month
- OpenLR library for Python☆15Jul 14, 2025Updated 7 months ago
- Public website for CodeX Academy.☆12Jan 27, 2023Updated 3 years ago
- ☆34Updated this week
- ☆12Jun 10, 2023Updated 2 years ago
- ☆13Dec 24, 2024Updated last year
- High-level overview on exam topics.☆13Jan 24, 2026Updated last month
- This repository contains the code of the Rasa workshop at PyData NYC 2018☆12Oct 19, 2018Updated 7 years ago
- ☆32Jan 25, 2026Updated last month
- O'Reilly Course, In-Memory Computing Essentials☆10Oct 16, 2020Updated 5 years ago
- LLM inference in C/C++☆25Updated this week
- ☆17Jul 20, 2025Updated 7 months ago
- ☆12Dec 6, 2021Updated 4 years ago
- OW-OVD: Unified Open World and Open Vocabulary Object Detection (CVPR 2025)☆24Dec 2, 2024Updated last year
- Run Claude Code (and codex) to generate a project plan, then run them in a loop for days until they're done☆14Jan 18, 2026Updated last month
- Docker image for Dataiku Science Studio☆10Apr 20, 2017Updated 8 years ago
- ☆18Jun 18, 2025Updated 8 months ago
- Workflow based on github issues.☆11Apr 30, 2019Updated 6 years ago
- Spring Boot: Tips, Tricks and Techniques [Video], published by Packt☆11Aug 1, 2024Updated last year
- Lightweight, model-agnostic chat history compression (trim + summarize) for AI assistants.☆22Sep 14, 2025Updated 5 months ago
- ☆10Mar 18, 2019Updated 6 years ago
- Flexible GraphRAG: Python, LlamaIndex, Docker Compose: 8 Graph dbs, 10 Vector dbs, OpenSearch, Elasticsearch, Alfresco. 13 data sources (…☆106Updated this week
- Knowledge work sdk☆48Feb 23, 2026Updated last week
- ☆12Oct 28, 2015Updated 10 years ago
- Deployment files for Fortio.☆15May 11, 2019Updated 6 years ago
- A Tensorflow-lite segmention example modify form object detect example.☆13Oct 20, 2020Updated 5 years ago
- ☆11Oct 5, 2022Updated 3 years ago
- Python code for Coursera Neural Networks class taught by Professor Geoffrey Hinton☆23Dec 17, 2012Updated 13 years ago
- A helm chart for deploying Neoload Web on your Kubernetes cluster☆13Feb 25, 2026Updated last week
- Istio test drive☆11Jul 18, 2018Updated 7 years ago
- "Speed Gonzales" blurring API for the Panoramax project☆10May 13, 2025Updated 9 months ago
- A REST API uploader and downloader written in Go language☆10Jun 1, 2023Updated 2 years ago
- GraphQL extension for StarUML3☆16Apr 29, 2022Updated 3 years ago
- Generate AGENTS.md from your codebase in one command. Free, instant, no API key☆55Feb 10, 2026Updated 3 weeks ago
- Decred: On-chain atomic swaps for Viacoin, Litecoin and other cryptocurrencies.☆12Jan 30, 2023Updated 3 years ago
- ☆13Jan 11, 2023Updated 3 years ago
- An object oriented bitemporal database layer☆13Nov 16, 2017Updated 8 years ago
- ☆14Aug 16, 2021Updated 4 years ago
- ☆17Jan 9, 2023Updated 3 years ago