CHAI is a library for dynamic pruning of attention heads for efficient LLM inference.
☆24Dec 11, 2024Updated last year
Alternatives and similar repositories for chai
Users that are interested in chai are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains the implementation of DPMLBench: Holistic Evaluation of Differentially Private Machine Learning☆11Nov 24, 2023Updated 2 years ago
- ☆19Mar 22, 2024Updated 2 years ago
- Trace Replay and Network Simulation Framework☆21Apr 14, 2021Updated 5 years ago
- ☆10Mar 31, 2022Updated 4 years ago
- Customized Inference Engine for Multiverse Models☆25Jun 27, 2025Updated 11 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 数据结构课程设计---哈夫曼编码/解码☆12Dec 19, 2021Updated 4 years ago
- ☆14Apr 21, 2023Updated 3 years ago
- 新托福备考心得之附件资源☆14Oct 7, 2023Updated 2 years ago
- APEX+ is an LLM Serving Simulator☆48Jun 16, 2025Updated 11 months ago
- Minimum docker/fastapi/celery/flower setup☆11Aug 13, 2021Updated 4 years ago
- A BLE/GATT based data and control plane for the Internet of Things.☆16Feb 4, 2017Updated 9 years ago
- CheepSync is an open source time synchronization service for BLE advertisers in ADV_NONCONN_IND mode☆12Oct 19, 2015Updated 10 years ago
- SeeSo(Eye-Tracking SDK) Sample vanillaJS script.☆14Oct 17, 2023Updated 2 years ago
- ☆14Dec 26, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- The iOS application evaluating Multipath TCP and Multipath QUIC☆14Jan 28, 2020Updated 6 years ago
- Declare your datasets and download them using a simple tool☆14Aug 2, 2024Updated last year
- Metis: Understanding and Enhancing Regular Expressions in Network☆14Aug 19, 2022Updated 3 years ago
- Planter is a modular framework for realising in one-click in-network machine learning algorithms.☆27Jun 13, 2024Updated 2 years ago
- A lightweight wrapper around https://github.com/facebookresearch/encodec that enables dynamic streamed reading, seeking, metadata and GPU…☆15May 5, 2024Updated 2 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- AudioSR-Upsampling (any -> 48kHz)☆42Feb 13, 2024Updated 2 years ago
- ☆28Jun 12, 2023Updated 3 years ago
- 本人工作中常用到的Word宏(原创)☆14Oct 21, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆14Jun 24, 2022Updated 3 years ago
- Experimental GitHub read-only mirror of ns-3 development repository, will be kept in sync with original Mercurial repository; pull reques…☆13Sep 27, 2017Updated 8 years ago
- [ICCV 2023] DataDAM: Efficient Dataset Distillation with Attention Matching☆34Jun 20, 2024Updated last year
- FedD3: Federated Learning via Decentralized Dataset Distillation☆30Apr 8, 2023Updated 3 years ago
- Code to reproduce experiments in "Antipodes of Label Differential Privacy PATE and ALIBI"☆32Apr 25, 2022Updated 4 years ago
- Framework to reduce autotune overhead to zero for well known deployments.☆101Sep 19, 2025Updated 8 months ago
- A network-level collaboration framework for personal mobile devices☆15Jun 24, 2020Updated 5 years ago
- This library is a location of the LegacyLogger for PyTorch Lightning.☆26Sep 4, 2025Updated 9 months ago
- The Shape of Data: Intrinsic Distance for Comparing Data Distributions☆12Sep 25, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆25Dec 12, 2024Updated last year
- RND1: Scaling Diffusion Language Models☆184Feb 22, 2026Updated 3 months ago
- Supervoice diffusion enhance☆28Jul 15, 2024Updated last year
- ☆17Jun 7, 2020Updated 6 years ago
- Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scroll…☆28May 16, 2024Updated 2 years ago
- Hide some secret 😎 data in a Neural Network - text, malicious software or watermark your NN☆41Jun 29, 2022Updated 3 years ago
- logit lens for VGGT☆28Dec 2, 2025Updated 6 months ago