A WebUI for Side-by-Side Comparison of Media (Images/Videos) Across Multiple Folders
☆25Feb 21, 2025Updated last year
Alternatives and similar repositories for VisCompare
Users that are interested in VisCompare are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A record of reading list on some MLsys popular topic☆23Mar 20, 2025Updated last year
- A selective knowledge distillation algorithm for efficient speculative decoders☆36Nov 27, 2025Updated 3 months ago
- [ICML 2025] SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity☆71Mar 10, 2026Updated 2 weeks ago
- A sparse attention kernel supporting mix sparse patterns☆480Jan 18, 2026Updated 2 months ago
- [ICML 2025] XAttention: Block Sparse Attention with Antidiagonal Scoring☆273Jul 6, 2025Updated 8 months ago
- [ICML 2024] Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference☆377Jul 10, 2025Updated 8 months ago
- [ICLR 2025] DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads☆531Feb 10, 2025Updated last year
- ☆30Jan 24, 2026Updated 2 months ago
- MS108 Course Project, SJTU ACM Class.☆33Dec 20, 2022Updated 3 years ago
- ☆14Jul 17, 2024Updated last year
- [ICLR 2025] VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation☆421Apr 25, 2025Updated 10 months ago
- [ICLR 2025] COAT: Compressing Optimizer States and Activation for Memory-Efficient FP8 Training☆262Aug 9, 2025Updated 7 months ago
- ☆121Feb 17, 2026Updated last month
- Real-Time VLAs via Future-state-aware Asynchronous Inference.☆352Mar 6, 2026Updated 2 weeks ago
- HART: Efficient Visual Generation with Hybrid Autoregressive Transformer☆650Oct 16, 2024Updated last year
- Fast, memory-efficient attention column reduction (e.g., sum, mean, max)☆42Feb 10, 2026Updated last month
- Code for Sergeev et al. (2020)☆14Apr 15, 2023Updated 2 years ago
- [NOTE] I do not have enough ressources to maintain VMS, please use Ostris's AI-Tookit instead☆43Oct 3, 2025Updated 5 months ago
- Official repository for "TrustGeoGen: Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving"☆23Sep 1, 2025Updated 6 months ago
- Notes and exercises from Mathematical Logic, 2nd ed. by Ebbinghaus☆13Sep 19, 2016Updated 9 years ago
- テスト姬 (评测姬)☆34Mar 13, 2026Updated last week
- Model Compression Toolbox for Large Language Models and Diffusion Models☆764Aug 14, 2025Updated 7 months ago
- Pytorch implementation of "Oscillation-Reduced MXFP4 Training for Vision Transformers" on DeiT Model Pre-training☆37Jun 20, 2025Updated 9 months ago
- Code for the paper “Four Over Six: More Accurate NVFP4 Quantization with Adaptive Block Scaling”☆140Mar 7, 2026Updated 2 weeks ago
- [EMNLP 2024] RoLoRA: Fine-tuning Rotated Outlier-free LLMs for Effective Weight-Activation Quantization☆38Sep 24, 2024Updated last year
- This is the official Python version of CoreInfer: Accelerating Large Language Model Inference with Semantics-Inspired Adaptive Sparse Act…☆17Oct 25, 2024Updated last year
- [ICLR 2026 Oral] Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation☆92Mar 12, 2026Updated last week
- HALO: Hadamard-Assisted Low-Precision Optimization and Training method for finetuning LLMs. 🚀 The official implementation of https://arx…☆29Feb 17, 2025Updated last year
- [NeurIPS 2025] Radial Attention: O(nlogn) Sparse Attention with Energy Decay for Long Video Generation☆588Nov 11, 2025Updated 4 months ago
- [CVPR'24] Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression☆15Jul 1, 2024Updated last year
- 16-fold memory access reduction with nearly no loss☆108Mar 26, 2025Updated 11 months ago
- ☆11May 24, 2024Updated last year
- Memory-optimized training scripts for video models based on Diffusers☆14Jan 3, 2025Updated last year
- ☆21Dec 15, 2025Updated 3 months ago
- implementation of https://arxiv.org/pdf/2312.09299☆21Jul 3, 2024Updated last year
- a simple variational auto encoder with some exploration☆12Nov 22, 2024Updated last year
- [ICLR2025] Breaking Throughput-Latency Trade-off for Long Sequences with Speculative Decoding☆145Dec 4, 2024Updated last year
- ☆19Dec 4, 2025Updated 3 months ago
- Example implementation of "Exact Byte-Level Probabilities from Tokenized Language Models for FIM-Tasks and Model Ensembles" by Buu Phan, …☆18Jan 22, 2026Updated 2 months ago