Xtra-Computing / raintreebookLinks
A book about Ph.D. student and research career planning
☆28Updated 2 months ago
Alternatives and similar repositories for raintreebook
Users that are interested in raintreebook are comparing it to the libraries listed below
Sorting:
- Python library for data stream learning☆26Updated last year
- OEBench: Investigating Open Environment Challenges in Real-World Relational Data Streams (VLDB 2024)☆13Updated last year
- [OSDI'24] Serving LLM-based Applications Efficiently with Semantic Variable☆203Updated last year
- ☆156Updated 5 months ago
- ☆19Updated 6 months ago
- [ASPLOS'25] Towards End-to-End Optimization of LLM-based Applications with Ayo☆56Updated 4 months ago
- A framework for generating realistic LLM serving workloads☆93Updated 2 months ago
- [SIGMOD 2025] PQCache: Product Quantization-based KVCache for Long Context LLM Inference☆81Updated 2 weeks ago
- ArkVale: Efficient Generative LLM Inference with Recallable Key-Value Eviction (NIPS'24)☆49Updated last year
- Scalable long-context LLM decoding that leverages sparsity—by treating the KV cache as a vector storage system.☆106Updated 3 months ago
- ☆26Updated 3 months ago
- ☆144Updated last year
- ☆54Updated 3 months ago
- Fast Parallel Probabilistic Graphical Model Learning and Inference [IPDPS'22, PPoPP'23, USENIX ATC'24]☆78Updated last month
- GBDT-based model with efficient unlearning (SIGMOD 2023)☆10Updated 3 months ago
- PipeRAG: Fast Retrieval-Augmented Generation via Algorithm-System Co-design (KDD 2025)☆29Updated last year
- ☆31Updated last year
- ☆47Updated 3 years ago
- Open-source implementation for "Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow"☆74Updated 2 months ago
- FGNN's artifact evaluation (EuroSys 2022)☆17Updated 3 years ago
- ☆42Updated last year
- Dorylus: Affordable, Scalable, and Accurate GNN Training☆76Updated 4 years ago
- Source code of "FlowWalker: A Memory-efficient and High-performance GPU-based Dynamic Graph Random Walk Framework"☆11Updated last year
- GPU-accelerated vector query processing system that supports large vector datasets beyond GPU memory.☆37Updated last year
- [NeurIPS 2024] Efficient LLM Scheduling by Learning to Rank☆66Updated last year
- ☆23Updated last year
- ☆102Updated last year
- Surrogate-based Hyperparameter Tuning System☆27Updated 2 years ago
- A resilient distributed training framework☆96Updated last year
- Modular and structured prompt caching for low-latency LLM inference☆105Updated last year