Derecho-Project / cascadeLinks
A C++ distributed framework for responsive Cloud applications.
☆82Updated last month
Alternatives and similar repositories for cascade
Users that are interested in cascade are comparing it to the libraries listed below
Sorting:
- Transactional functions-as-a-service for database-oriented applications.☆156Updated 2 years ago
- ScalarLM - a unified training and inference stack☆93Updated last month
- Open Source Continuous Inference Benchmarking - GB200 NVL72 vs MI355X vs B200 vs H200 vs MI325X & soon™ TPUv6e/v7/Trainium2/3/GB300 NVL72…☆400Updated this week
- Perplexity open source garden for inference technology☆306Updated 2 weeks ago
- Rust crates for XetHub☆75Updated last year
- Lightweight daemon for monitoring CUDA runtime API calls with eBPF uprobes☆143Updated 8 months ago
- ☆43Updated this week
- ☆461Updated last month
- DCPerf benchmark suite for hyperscale cloud applications☆225Updated this week
- The main code repository for the Derecho project.☆204Updated last week
- CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning☆225Updated last week
- ☆72Updated 10 months ago
- Dynolog is a telemetry daemon for performance monitoring and tracing. It exports metrics from different components in the system like the…☆356Updated this week
- Measure and optimize the energy consumption of your AI applications!☆325Updated last month
- torchcomms: a modern PyTorch communications API☆309Updated last week
- Pretrain, finetune and serve LLMs on Intel platforms with Ray☆131Updated 3 months ago
- Framework-Agnostic RL Environments for LLM Fine-Tuning☆39Updated 3 weeks ago
- A minimalistic C++ Jinja templating engine for LLM chat templates☆202Updated 3 months ago
- ☆234Updated 6 months ago
- CUDA checkpoint and restore utility☆397Updated 3 months ago
- Awesome List of Vector DB resources☆174Updated 2 years ago
- A Lossless Compression Library for AI pipelines☆289Updated 5 months ago
- AI/GPU flame graph☆232Updated 2 months ago
- ☆273Updated last week
- This is the documentation repository for SGLang. It is auto-generated from https://github.com/sgl-project/sglang/tree/main/docs.☆95Updated this week
- Horizon chart for CPU/GPU/Neural Engine utilization monitoring. Supports Apple M1-M4, Nvidia GPUs, AMD GPUs☆26Updated 4 months ago
- Route LLM requests to the best model for the task at hand.☆145Updated this week
- Hand-Rolled GPU communications library☆76Updated last month
- PuppyGraph standalone web server for visualize graph queries.☆45Updated 10 months ago
- build your own vector database -- the littlest hnsw☆67Updated 11 months ago