gta0804 / MASS
Offical implementation of MASS: Multi-Agent Simulation Scaling via Investor Behavior Modeling
☆82Updated this week
Alternatives and similar repositories for MASS
Users that are interested in MASS are comparing it to the libraries listed below
Sorting:
- ShortcutsBench: A Large-Scale Real-World Benchmark for API-Based Agents☆96Updated 3 weeks ago
- Open-source implementation for "Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow"☆40Updated 5 months ago
- 新燕园人的私人班车助手(非官方)。☆55Updated 2 months ago
- ☆21Updated 11 months ago
- [DAC'25] Official implement of "HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference"☆18Updated last week
- ☆19Updated last year
- Course website for Operating System course in Peking University.☆13Updated 3 years ago
- ☆73Updated 3 years ago
- ☆37Updated 6 months ago
- InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management (OSDI'24)☆130Updated 10 months ago
- A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems☆167Updated 7 months ago
- ☆13Updated 10 months ago
- [ICML 2024] Serving LLMs on heterogeneous decentralized clusters.☆25Updated last year
- ArkVale: Efficient Generative LLM Inference with Recallable Key-Value Eviction (NIPS'24)☆39Updated 5 months ago
- Summary of some awesome work for optimizing LLM inference☆73Updated last month
- ☆12Updated 3 months ago
- Open test cases of PKU compiler course.☆25Updated 3 years ago
- This repository is established to store personal notes and annotated papers during daily research.☆122Updated 3 weeks ago
- ☆99Updated last year
- ☆19Updated 9 months ago
- ☆16Updated last year
- This repository serves as a comprehensive survey of LLM development, featuring numerous research papers along with their corresponding co…☆125Updated 3 months ago
- Compiler for Dynamic Neural Networks☆46Updated last year
- A Compiler from "Mx* language" (A C++ & Java like language) to RV32I Assembly, with optimizations on LLVM IR. SJTU CS2966 Project.☆11Updated 2 years ago
- Efficient Interactive LLM Serving with Proxy Model-based Sequence Length Prediction | A tiny BERT model can tell you the verbosity of an …☆33Updated 11 months ago
- Artifacts for our ASPLOS'23 paper ElasticFlow☆51Updated last year
- Medusa: Accelerating Serverless LLM Inference with Materialization [ASPLOS'25]☆22Updated this week
- Code release for AdapMoE accepted by ICCAD 2024☆23Updated 2 weeks ago
- HPC-Lab for High Performance Computing course, 2023 Spring , Tsinghua Universit. 高性能计算导论 @ THU.☆24Updated last year
- ☆12Updated this week