Derecho-Project / cascadeLinks
A C++ distributed framework for responsive Cloud applications.
☆79Updated 2 weeks ago
Alternatives and similar repositories for cascade
Users that are interested in cascade are comparing it to the libraries listed below
Sorting:
- ScalarLM - a unified training and inference stack☆44Updated 2 weeks ago
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆87Updated last week
- Estimating hardware and cloud costs of LLMs and transformer projects☆17Updated 3 weeks ago
- Exploration of Vector database Index for fast approximate nearest neighbour search.☆28Updated 11 months ago
- BlindBox is a tool to isolate and deploy applications inside Trusted Execution Environments for privacy-by-design apps☆58Updated last year
- This is a landscape of the infrastructure that powers the generative AI ecosystem☆147Updated 9 months ago
- GroqFlow provides an automated tool flow for compiling machine learning and linear algebra workloads into Groq programs and executing tho…☆107Updated 2 months ago
- ☆67Updated last month
- Intent Driven Orchestration enables management of applications through their Service Level Objectives, while minimizing developer and adm…☆38Updated 3 months ago
- An LLM inference engine, written in C++☆15Updated last month
- Transformer GPU VRAM estimator☆66Updated last year
- Pretrain, finetune and serve LLMs on Intel platforms with Ray☆129Updated last week
- Horizon chart for CPU/GPU/Neural Engine utilization monitoring. Supports Apple M1-M4, Nvidia GPUs, AMD GPUs☆25Updated 3 weeks ago
- DCPerf benchmark suite for hyperscale cloud applications☆191Updated last week
- Runner in charge of collecting metrics from LLM inference endpoints for the Unify Hub☆17Updated last year
- Aana SDK is a powerful framework for building AI enabled multimodal applications.☆49Updated 2 weeks ago
- ☆62Updated 5 months ago
- Transactional functions-as-a-service for database-oriented applications.☆152Updated last year
- A tool to detect infrastructure issues on cloud native AI systems☆42Updated last month
- ☆25Updated this week
- Tenstorrent console based hardware information program☆45Updated last week
- All-in-Storage Solution based on DiskANN for DRAM-free Approximate Nearest Neighbor Search☆65Updated 2 weeks ago
- ML model training for edge devices☆165Updated last year
- GPU documentation for humans☆81Updated last week
- Compression for Foundation Models☆33Updated 3 months ago
- How much energy do GenAI models consume?☆45Updated 2 months ago
- Tutorial to get started with SkyPilot!☆58Updated last year
- 🏙 Interactive performance profiling and debugging tool for PyTorch neural networks.☆62Updated 5 months ago
- A curated list of awesome Compound AI Systems☆30Updated 2 weeks ago
- 📡 Deploy AI models and apps to Kubernetes without developing a hernia☆32Updated last year