Derecho-Project / cascadeLinks

A C++ distributed framework for responsive Cloud applications.

☆79

Alternatives and similar repositories for cascade

Users that are interested in cascade are comparing it to the libraries listed below

Sorting:

tensorwavecloud / ScalarLM
ScalarLM - a unified training and inference stack
☆44Updated 2 weeks ago
EmbeddedLLM / vllm
vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs
☆87Updated last week
isEmmanuelOlowe / llm-cost-estimator
Estimating hardware and cloud costs of LLMs and transformer projects
☆17Updated 3 weeks ago
sushrut141 / vamana
Exploration of Vector database Index for fast approximate nearest neighbour search.
☆28Updated 11 months ago
mithril-security / blindbox
BlindBox is a tool to isolate and deploy applications inside Trusted Execution Environments for privacy-by-design apps
☆58Updated last year
tensorchord / ai-infra-landscape
This is a landscape of the infrastructure that powers the generative AI ecosystem
☆147Updated 9 months ago
groq / groqflow
GroqFlow provides an automated tool flow for compiling machine learning and linear algebra workloads into Groq programs and executing tho…
☆107Updated 2 months ago
illinoisdata / kishu
☆67Updated last month
intel / intent-driven-orchestration
Intent Driven Orchestration enables management of applications through their Service Level Objectives, while minimizing developer and adm…
☆38Updated 3 months ago
microsoft / glinthawk
An LLM inference engine, written in C++
☆15Updated last month
furiousteabag / vram-calculator
Transformer GPU VRAM estimator
☆66Updated last year
intel / llm-on-ray
Pretrain, finetune and serve LLMs on Intel platforms with Ray
☆129Updated last week
okuvshynov / cubestat
Horizon chart for CPU/GPU/Neural Engine utilization monitoring. Supports Apple M1-M4, Nvidia GPUs, AMD GPUs
☆25Updated 3 weeks ago
facebookresearch / DCPerf
DCPerf benchmark suite for hyperscale cloud applications
☆191Updated last week
unifyai / aibench-llm-endpoints
Runner in charge of collecting metrics from LLM inference endpoints for the Unify Hub
☆17Updated last year
mobiusml / aana_sdk
Aana SDK is a powerful framework for building AI enabled multimodal applications.
☆49Updated 2 weeks ago
ppl-ai / libfabric-efa-demo
☆62Updated 5 months ago
DBOS-project / apiary
Transactional functions-as-a-service for database-oriented applications.
☆152Updated last year
IBM / autopilot
A tool to detect infrastructure issues on cloud native AI systems
☆42Updated last month
skypilot-org / skypilot-catalog
☆25Updated this week
tenstorrent / tt-smi
Tenstorrent console based hardware information program
☆45Updated last week
kioxia-jp / aisaq-diskann
All-in-Storage Solution based on DiskANN for DRAM-free Approximate Nearest Neighbor Search
☆65Updated 2 weeks ago
ShishirPatil / poet
ML model training for edge devices
☆165Updated last year
modal-labs / gpu-glossary
GPU documentation for humans
☆81Updated last week
eth-easl / deltazip
Compression for Foundation Models
☆33Updated 3 months ago
ml-energy / leaderboard
How much energy do GenAI models consume?
☆45Updated 2 months ago
skypilot-org / skypilot-tutorial
Tutorial to get started with SkyPilot!
☆58Updated last year
CentML / DeepView.Profile
🏙 Interactive performance profiling and debugging tool for PyTorch neural networks.
☆62Updated 5 months ago
compound-ai-systems / awesome-compound-ai-systems
A curated list of awesome Compound AI Systems
☆30Updated 2 weeks ago
prem-research / prem-operator
📡 Deploy AI models and apps to Kubernetes without developing a hernia
☆32Updated last year