☆102Jul 2, 2023Updated 2 years ago
Alternatives and similar repositories for aisys2023
Users that are interested in aisys2023 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆75Nov 21, 2024Updated last year
- Summary of system papers/frameworks/codes/tools on training or serving large model☆57Dec 17, 2023Updated 2 years ago
- Know Your Enemy To Save Cloud Energy: Energy-Performance Characterization of Machine Learning Serving (HPCA '23)☆14Jun 20, 2025Updated 10 months ago
- Network Contention-Aware Cluster Scheduling with Reinforcement Learning (IEEE ICPADS'23)☆20Jul 8, 2025Updated 10 months ago
- [⛔️ DEPRECATED] Friendli: the fastest serving engine for generative AI☆50Jun 25, 2025Updated 10 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆33Apr 28, 2026Updated last week
- Region-level profiling for CUDA kernels with trace, NVBit, CUPTI, NSys, and an interactive Explorer.☆115Apr 17, 2026Updated 3 weeks ago
- ☆60Sep 18, 2025Updated 7 months ago
- FriendliAI Model Hub☆90Jun 9, 2022Updated 3 years ago
- MIST: High-performance IoT Stream Processing☆18Mar 19, 2019Updated 7 years ago
- Autoware reference system integrated with the PAAM framework☆11Apr 8, 2024Updated 2 years ago
- Load & manage evolving datasets efficiently☆23Aug 22, 2025Updated 8 months ago
- 🧙🏻 Code and benchmark for our Findings of ACL 2024 paper - "TimeChara: Evaluating Point-in-Time Character Hallucination of Role-Playing…☆21Dec 20, 2024Updated last year
- 📸 Code and Dataset for our ACL 2023 paper: "MPCHAT: Towards Multimodal Persona-Grounded Conversation"☆22Sep 5, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Cruise: A Distributed Machine Learning Framework with Automatic System Configuration☆26Mar 19, 2019Updated 7 years ago
- Official Pytorch Implementation of Unsupervised Representation Learning for Binary Networks by Joint Classifier Training (CVPR 2022)☆11Apr 10, 2022Updated 4 years ago
- ☆10Mar 3, 2024Updated 2 years ago
- ☆18Dec 4, 2017Updated 8 years ago
- ☆48Sep 7, 2024Updated last year
- ☆27Aug 31, 2023Updated 2 years ago
- 한국어 뉴스레터 모음 / A curated list of awesome korean newsletters☆48Jul 4, 2025Updated 10 months ago
- [ACM EuroSys 2023] Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access☆56Aug 6, 2025Updated 9 months ago
- A GPU Cluster Simulator for Distributed Deep Learning Training.☆11Jan 15, 2022Updated 4 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Cache design for CNN on mobile☆33Jul 31, 2018Updated 7 years ago
- 4명의 김씨, 한명의 진씨, 한명의 임씨가 모여서 인공지능을 공부하고 있습니다.☆13Jun 30, 2021Updated 4 years ago
- Learn Simply☆13Dec 2, 2025Updated 5 months ago
- ☆90Mar 28, 2024Updated 2 years ago
- ☆13Apr 13, 2026Updated 3 weeks ago
- ☆158Oct 9, 2024Updated last year
- Large Language Model (LLM) Systems Paper List☆1,957Apr 17, 2026Updated 3 weeks ago
- Thunder Research Group's Collective Communication Library☆53Jul 8, 2025Updated 10 months ago
- ☆328Jan 22, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ICML 2025] Efficiently Serving Large Multimodal Models Using EPD Disaggregation☆24May 29, 2025Updated 11 months ago
- Demonstration of flow control over RDMA fabric☆13Jun 28, 2018Updated 7 years ago
- Measure and optimize the energy consumption of your AI applications!☆356Apr 28, 2026Updated last week
- Lightweight and Parallel Deep Learning Framework☆262Nov 26, 2022Updated 3 years ago
- A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems☆256Mar 19, 2026Updated last month
- Dynamic Memory Management for Serving LLMs without PagedAttention☆483May 30, 2025Updated 11 months ago
- An experimental deoplete source for LaTeX completion☆12Oct 14, 2017Updated 8 years ago