A WebUI for Side-by-Side Comparison of Media (Images/Videos) Across Multiple Folders
☆26Feb 21, 2025Updated last year
Alternatives and similar repositories for VisCompare
Users that are interested in VisCompare are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS'25 Spotlight] Adaptive Attention Sparsity with Hierarchical Top-p Pruning☆102Apr 20, 2026Updated 2 months ago
- A record of reading list on some MLsys popular topic☆25Mar 20, 2025Updated last year
- A selective knowledge distillation algorithm for efficient speculative decoders☆40Nov 27, 2025Updated 7 months ago
- [ICML 2025] SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity☆77Mar 10, 2026Updated 3 months ago
- [ICML 2025] XAttention: Block Sparse Attention with Antidiagonal Scoring☆280Jul 6, 2025Updated 11 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [ICLR 2025] DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads☆538Feb 10, 2025Updated last year
- MS108 Course Project, SJTU ACM Class.☆34Dec 20, 2022Updated 3 years ago
- ☆14Jul 17, 2024Updated last year
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆37Jun 26, 2026Updated last week
- ☆21Sep 28, 2024Updated last year
- A Compiler from "Mx* language" (A C++ & Java like language) to RV32I Assembly, with optimizations on LLVM IR. SJTU CS2966 Project.☆13Feb 12, 2023Updated 3 years ago
- [NeurIPS'25] KVCOMM: Online Cross-context KV-cache Communication for Efficient LLM-based Multi-agent Systems☆17Nov 1, 2025Updated 8 months ago
- HART: Efficient Visual Generation with Hybrid Autoregressive Transformer☆647Oct 16, 2024Updated last year
- Fast, memory-efficient attention column reduction (e.g., sum, mean, max)☆49Feb 10, 2026Updated 4 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆138Feb 17, 2026Updated 4 months ago
- SJTU CS2951 Computer Architecture Course Project, A Verilog HDL implemented RISC-V CPU.☆11Jan 15, 2022Updated 4 years ago
- The official implementation of the paper SAEdit: Token-level control for continuous image editing via Sparse AutoEncoder☆23Oct 19, 2025Updated 8 months ago
- Code from PLDI '21 paper "Provable Repair of Deep Neural Networks."☆10Nov 26, 2022Updated 3 years ago
- [NOTE] I do not have enough ressources to maintain VMS, please use Ostris's AI-Tookit instead☆42Oct 3, 2025Updated 9 months ago
- Official repository for "TrustGeoGen: Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving"☆23Sep 1, 2025Updated 10 months ago
- LaTeX template for a CMU Robotics Institute Thesis☆33Oct 4, 2017Updated 8 years ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- Model Compression Toolbox for Large Language Models and Diffusion Models☆790Aug 14, 2025Updated 10 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Pytorch implementation of "Oscillation-Reduced MXFP4 Training for Vision Transformers" on DeiT Model Pre-training☆40May 4, 2026Updated 2 months ago
- ArkVale: Efficient Generative LLM Inference with Recallable Key-Value Eviction (NIPS'24)☆53Dec 17, 2024Updated last year
- [EMNLP 2024] RoLoRA: Fine-tuning Rotated Outlier-free LLMs for Effective Weight-Activation Quantization☆40Sep 24, 2024Updated last year
- [NeurIPS 2025🔥:] EVODiff is an inference-time refinement method for diffusion models that improves sampling efficiency and generative f…☆32Feb 2, 2026Updated 5 months ago
- Code for the papers: “Four Over Six: More Accurate NVFP4 Quantization with Adaptive Block Scaling” and “Adaptive Block-Scaled Data Types”☆195Apr 21, 2026Updated 2 months ago
- 16-fold memory access reduction with nearly no loss☆107Mar 26, 2025Updated last year
- This is the official Python version of CoreInfer: Accelerating Large Language Model Inference with Semantics-Inspired Adaptive Sparse Act…☆18Oct 25, 2024Updated last year
- 3D AI Product Designer Inspired by Biological Morphology, built on ComfyUI and Gradio.☆19Dec 6, 2024Updated last year
- An LLM inference engine, written in C++☆20Mar 30, 2026Updated 3 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- HALO: Hadamard-Assisted Low-Precision Optimization and Training method for finetuning LLMs. 🚀 The official implementation of https://arx…☆31Feb 17, 2025Updated last year
- Memory-optimized training scripts for video models based on Diffusers☆17Jan 3, 2025Updated last year
- [NAACL 2024] Z-GMOT: Zero-shot Generic Multiple Object Tracking☆12May 19, 2026Updated last month
- [CVPR 2024] On the Content Bias in Fréchet Video Distance☆147Sep 28, 2024Updated last year
- a simple variational auto encoder with some exploration☆13Nov 22, 2024Updated last year
- ☆20Dec 4, 2025Updated 6 months ago
- Example implementation of "Exact Byte-Level Probabilities from Tokenized Language Models for FIM-Tasks and Model Ensembles" by Buu Phan, …☆18Jan 22, 2026Updated 5 months ago