Derecho-Project / cascadeLinks
A C++ distributed framework for responsive Cloud applications.
☆82Updated 3 weeks ago
Alternatives and similar repositories for cascade
Users that are interested in cascade are comparing it to the libraries listed below
Sorting:
- ScalarLM - a unified training and inference stack☆97Updated 2 months ago
- CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning☆417Updated last month
- Perplexity open source garden for inference technology☆359Updated last month
- ☆71Updated 11 months ago
- DCPerf benchmark suite for hyperscale cloud applications☆231Updated this week
- Lightweight daemon for monitoring CUDA runtime API calls with eBPF uprobes☆146Updated 10 months ago
- ☆466Updated 2 months ago
- Transactional functions-as-a-service for database-oriented applications.☆155Updated 2 years ago
- build your own vector database -- the littlest hnsw☆67Updated last year
- CUDA checkpoint and restore utility☆410Updated 4 months ago
- A minimalistic C++ Jinja templating engine for LLM chat templates☆203Updated 4 months ago
- An early research stage expert-parallel load balancer for MoE models based on linear programming.☆495Updated 2 months ago
- ☆44Updated this week
- The main code repository for the Derecho project.☆205Updated 3 weeks ago
- Dynolog is a telemetry daemon for performance monitoring and tracing. It exports metrics from different components in the system like the…☆362Updated last week
- 🏙 Interactive performance profiling and debugging tool for PyTorch neural networks.☆64Updated last year
- Aana SDK is a powerful framework for building AI enabled multimodal applications.☆55Updated 5 months ago
- Intent Driven Orchestration enables management of applications through their Service Level Objectives, while minimizing developer and adm…☆48Updated 2 months ago
- Open Source Continuous Inference Benchmarking - GB200 NVL72 vs MI355X vs B200 vs GB300 NVL72 vs H100 & soon™ TPUv6e/v7/Trainium2/3- DeepS…☆445Updated this week
- AI-Driven Research Systems (ADRS)☆119Updated last month
- Messaging and state layer for distributed serverless applications☆69Updated 3 months ago
- AI/GPU flame graph☆242Updated 4 months ago
- PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP☆141Updated 4 months ago
- Rust crates for XetHub☆78Updated last year
- Editor with LLM generation tree exploration☆83Updated 11 months ago
- ☆280Updated this week
- Pretrain, finetune and serve LLMs on Intel platforms with Ray☆131Updated 4 months ago
- SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?☆259Updated last month
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆93Updated this week
- A GPU-driven system framework for scalable AI applications☆124Updated last year