Derecho-Project / cascadeLinks
A C++ distributed framework for responsive Cloud applications.
☆82Updated last week
Alternatives and similar repositories for cascade
Users that are interested in cascade are comparing it to the libraries listed below
Sorting:
- ScalarLM - a unified training and inference stack☆93Updated last week
- build your own vector database -- the littlest hnsw☆66Updated 10 months ago
- Framework-Agnostic RL Environments for LLM Fine-Tuning☆39Updated last week
- Intent Driven Orchestration enables management of applications through their Service Level Objectives, while minimizing developer and adm…☆45Updated last week
- Transactional functions-as-a-service for database-oriented applications.☆156Updated 2 years ago
- Pretrain, finetune and serve LLMs on Intel platforms with Ray☆130Updated 2 months ago
- Lightweight daemon for monitoring CUDA runtime API calls with eBPF uprobes☆140Updated 8 months ago
- Official code for "SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient"☆147Updated last year
- Route LLM requests to the best model for the task at hand.☆133Updated last week
- Perplexity open source garden for inference technology☆274Updated last week
- CUDA checkpoint and restore utility☆393Updated 2 months ago
- Open Source Continuous Inference Benchmarking - GB200 NVL72 vs MI355X vs B200 vs H200 vs MI325X & soon™ TPUv6e/v7/Trainium2/3/GB300 NVL72…☆383Updated this week
- ☆72Updated 9 months ago
- Tutorial to get started with SkyPilot!☆58Updated last year
- Messaging and state layer for distributed serverless applications☆68Updated last month
- Measure and optimize the energy consumption of your AI applications!☆311Updated last week
- A Lossless Compression Library for AI pipelines☆286Updated 4 months ago
- Finetune LLMs on K8s by using Runbooks☆170Updated last year
- ☆233Updated 5 months ago
- AskIt: Unified programming interface for programming with LLMs (GPT-3.5, GPT-4, Gemini, Claude, Cohere, Llama 2)☆79Updated 10 months ago
- Rust crates for XetHub☆71Updated last year
- A minimalistic C++ Jinja templating engine for LLM chat templates☆198Updated 2 months ago
- ReLM is a Regular Expression engine for Language Models☆107Updated 2 years ago
- Benchmark and optimize LLM inference across frameworks with ease☆138Updated 2 months ago
- DCPerf benchmark suite for hyperscale cloud applications☆218Updated last week
- All-in-Storage Solution based on DiskANN for DRAM-free Approximate Nearest Neighbor Search☆90Updated 5 months ago
- ☆52Updated last year
- Exploration of Vector database Index for fast approximate nearest neighbour search.☆32Updated last year
- ☆455Updated this week
- Source code for Intel's Polite Guard NLP project☆37Updated 3 months ago