cyd3r / notify-free-gpu
A telegram bot that sends you a message when the GPU is in use
☆9Updated 10 months ago
Alternatives and similar repositories for notify-free-gpu:
Users that are interested in notify-free-gpu are comparing it to the libraries listed below
- Experimental scripts for researching data adaptive learning rate scheduling.☆23Updated last year
- [ICLR 2025] Official PyTorch implementation of "Forgetting Transformer: Softmax Attention with a Forget Gate"☆80Updated this week
- IntLLaMA: A fast and light quantization solution for LLaMA☆18Updated last year
- An auxiliary project analysis of the characteristics of KV in DiT Attention.☆28Updated 4 months ago
- NeuMeta transforms neural networks by allowing a single model to adapt on the fly to different sizes, generating the right weights when n…☆42Updated 4 months ago
- Official implementation of ViewFusion: Learning Composable Diffusion Models for Novel View Synthesis☆34Updated 2 months ago
- Code for "Don't trust your eyes: on the (un)reliability of feature visualizations" (ICML 2024)☆32Updated last year
- (CVPR 2024) Bayesian Diffusion Models for 3D Shape Reconstruction☆27Updated 10 months ago
- Implementation of the model "Hedgehog" from the paper: "The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry"☆13Updated last year
- Triton implement of bi-directional (non-causal) linear attention☆44Updated last month
- The official repo of continuous speculative decoding☆25Updated this week
- [ICLR 2025] Official implementation and benchmark evaluation repository of <PhysBench: Benchmarking and Enhancing Vision-Language Models …☆44Updated 2 weeks ago
- ☆30Updated 10 months ago
- Code for the paper "On the Expressivity Role of LayerNorm in Transformers' Attention" (Findings of ACL'2023)☆46Updated 6 months ago
- This is the official code release for [LiFT: A Surprisingly Simple Lightweight Feature Transform for Dense ViT Descriptors](https://arxiv…☆37Updated 5 months ago
- ☆17Updated last year
- FlexAttention w/ FlashAttention3 Support☆26Updated 5 months ago
- Paper survey of efficient computation for large scale models.☆32Updated 3 months ago
- ☆21Updated 3 weeks ago
- GIFT: Generative Interpretable Fine-Tuning☆20Updated 5 months ago
- [IPDPS 2024] Adaptive neighbor sampling for temporal GNN☆12Updated last month
- This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompt…☆30Updated 5 months ago
- Implementation of IceFormer: Accelerated Inference with Long-Sequence Transformers on CPUs (ICLR 2024).☆22Updated 9 months ago
- ☆14Updated 4 months ago
- An operation trying to do the opposite of F.grid_sample☆20Updated last year
- MambaFormer in-context learning experiments and implementation for https://arxiv.org/abs/2402.04248☆51Updated 9 months ago
- Official implementation of the WACV 2025 paper "3D Part Segmentation via Geometric Aggregation of 2D Visual Features"☆15Updated 2 weeks ago
- [NeurIPS 2023] CircuitFormer: Circuit as Set of Points☆38Updated last year
- Hierarchical State Space Models☆44Updated 11 months ago
- ☆31Updated 9 months ago