PKUFlyingPig / Hadoop_vs_Spark
研究生课《网络大数据管理理论和应用》大作业项目代码
☆12Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Hadoop_vs_Spark
- [OSDI'24] Serving LLM-based Applications Efficiently with Semantic Variable☆109Updated last month
- PaperHelper: Knowledge-Based LLM QA Paper Reading Assistant with Reliable References☆9Updated 4 months ago
- NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference☆60Updated 2 weeks ago
- Efficient, Flexible, and Highly Fault-Tolerant Model Service Management Based on SGLang☆22Updated this week
- 分层解耦的深度学习推理引擎☆60Updated 2 months ago
- Summary of system papers/frameworks/codes/tools on training or serving large model☆56Updated 10 months ago
- 开源软件供应链点亮计划 - 暑期 202X☆6Updated 4 months ago
- Oh-My-Papers: a Hybrid Context-aware Paper Recommendation System☆25Updated last year
- A Suite for Parallel Inference of Diffusion Transformers (DiTs) on multi-GPU Clusters☆32Updated 3 months ago
- Deploy ChatGLM on Modelz☆15Updated last year
- Stanford CS149 -- Assignment 1☆16Updated 3 years ago
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆14Updated this week
- Materials for learning SGLang☆75Updated this week
- Quantized Attention on GPU☆29Updated this week
- Yet another toy processor implementation☆14Updated 3 years ago
- A minimal implementation of vllm.☆30Updated 3 months ago
- TensorRT LLM Benchmark Configuration☆11Updated 3 months ago
- Wiki fo HPC☆84Updated 10 months ago
- ☆19Updated last year
- DIY Compiler☆45Updated 4 months ago
- Tutorial for assignment of Introduction to Database System☆13Updated last week
- 操作系统 2019 ucore labs☆46Updated 5 years ago
- ☆11Updated 3 years ago
- A TVM-like CUDA/C code generator.☆9Updated 2 years ago
- My solution to the labs of the book "Modern Operating System: Principle and Implementation" by Haibo Chen☆17Updated 3 years ago
- Proposal for the next generation of course-oriented IR.☆10Updated 2 years ago
- ☆31Updated 5 months ago
- ☆41Updated 11 months ago
- A sparse attention kernel supporting mix sparse patterns☆52Updated 3 weeks ago
- Decoding Attention is specially optimized for multi head attention (MHA) using CUDA core for the decoding stage of LLM inference.☆24Updated this week