philippe-eecs/small-vision

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/philippe-eecs/small-vision)

philippe-eecs / small-vision

A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.

☆34

Alternatives and similar repositories for small-vision

Users that are interested in small-vision are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zsxkib / ST-MFNet
View on GitHub
[IEEE/CVF CVPR'2022] "ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation", Duolikun Danier, Fan Zhang, David Bull
☆13Oct 9, 2023Updated 2 years ago
showlab / DiffSim
View on GitHub
[ICCV 2025] Official repository of DiffSim: Taming Diffusion Models for Evaluating Visual Similarity
☆31Jul 14, 2025Updated last year
philippe-eecs / vitok
View on GitHub
☆34May 14, 2025Updated last year
viddle-app / animatediff
View on GitHub
Animatediff implementation. Includes a ControlNet pipeline.
☆19Dec 24, 2023Updated 2 years ago
Anima-Lab / MaskDiT
View on GitHub
Code for Fast Training of Diffusion Models with Masked Transformers
☆428May 15, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
SherifAbdulatif / CMGAN
View on GitHub
Conformer-based Metric GAN for speech enhancement
☆27May 3, 2024Updated 2 years ago
huggingface / docmatix
View on GitHub
A huge dataset for Document Visual Question Answering
☆24Jul 29, 2024Updated last year
tuanio / nextformer
View on GitHub
PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"
☆10Dec 15, 2022Updated 3 years ago
dclelland / Plinth
View on GitHub
Hardware-accelerated matrix/numeric programming library for Swift
☆12Sep 2, 2025Updated 10 months ago
GindaChen / FlexFlashAttention3
View on GitHub
FlexAttention w/ FlashAttention3 Support
☆27Oct 5, 2024Updated last year
gcorso / disco-diffdock
View on GitHub
Code for the paper DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents, ICML 2024
☆93Jun 12, 2024Updated 2 years ago
fudan-generative-vision / MixFlow
View on GitHub
[CVPR 2026] MixFlow Training: Alleviating Exposure Bias with Slowed Interpolation Mixture
☆21Dec 23, 2025Updated 6 months ago
GraphPKU / CoI
View on GitHub
Chain of Images for Intuitively Reasoning
☆10Nov 29, 2023Updated 2 years ago
borisdayma / sora-mini
View on GitHub
☆18Feb 16, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
doc-doc / NExT-OE
View on GitHub
NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR'21)
☆30Jul 18, 2023Updated 3 years ago
yuguochencuc / DBT-Net
View on GitHub
The audio demos with respect to the paper "DBT-Net: Dual-branch federative magnitude and phase estimation with attention-in-attention tra…
☆30Jul 25, 2022Updated 3 years ago
sail-sg / ScaleLong
View on GitHub
The official repository of paper "ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection" (N…
☆50Oct 23, 2023Updated 2 years ago
jfsantos / AuditoryFilters.jl
View on GitHub
Auditory filterbanks in Julia
☆21Dec 11, 2018Updated 7 years ago
snapcrafters / spelunky
View on GitHub
Spelunky HD Classic
☆12Dec 2, 2023Updated 2 years ago
huaidanquede / Dense-TSNet
View on GitHub
offical code for Dense-TSNet
☆12Sep 17, 2024Updated last year
RGenDiff / open-litevae
View on GitHub
Implementation of LiteVAE
☆18Feb 11, 2025Updated last year
snap-research / SF-V
View on GitHub
This respository contains the code for the NeurIPS 2024 paper SF-V: Single Forward Video Generation Model.
☆99Nov 27, 2024Updated last year
dsi-icl / biobank-read
View on GitHub
Python for UK Biobank data analysis
☆10Dec 3, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
bradhowes / AUv3Support
View on GitHub
Swift package containing useful code for AUv3 components.
☆25Dec 31, 2025Updated 6 months ago
VILA-Lab / i-mae
View on GitHub
i-mae Pytorch Repo
☆20Apr 6, 2024Updated 2 years ago
itsnamgyu / block-transformer
View on GitHub
Block Transformer: Global-to-Local Language Modeling for Fast Inference (NeurIPS 2024)
☆166Apr 13, 2025Updated last year
hywang66 / LARP
View on GitHub
Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025 Oral).
☆107Feb 11, 2025Updated last year
gemlab-vt / CONFORM
View on GitHub
Contrast is All You Need For High-Fidelity Text-to-Image Diffusion Models [CVPR 2024]
☆27Oct 7, 2024Updated last year
orrzohar / Video-STaR
View on GitHub
[ICLR 2025] Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision
☆72Jul 10, 2024Updated 2 years ago
Qichuzyy / POA
View on GitHub
Official implementation of ECCV24 paper: POA
☆24Aug 8, 2024Updated last year
yuguochencuc / SF-Net
View on GitHub
The implementation of "Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement"
☆53Feb 16, 2023Updated 3 years ago
kennethwdk / PINet
View on GitHub
Code for "Robust Pose Estimation in Crowded Scenes with Direct Pose-Level Inference", NeurIPS 2021
☆15Dec 2, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ruili33 / TPO
View on GitHub
☆41Sep 9, 2025Updated 10 months ago
SJTU-Intelligent-Optics-Lab / Annotation-efficient-learning-for-OCT-segmentation
View on GitHub
☆12Jul 1, 2023Updated 3 years ago
sangyun884 / rfpp
View on GitHub
The codebase of our paper "Improving the Training of Rectified Flows", NeurIPS 2024
☆133Oct 18, 2024Updated last year
UCSB-AI / Discffusion
View on GitHub
Official repo for the TMLR paper "Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners"
☆29Apr 27, 2024Updated 2 years ago
xiaomabufei / SKDF
View on GitHub
☆14Feb 21, 2024Updated 2 years ago
diggerdu / AudioMamba
View on GitHub
☆12Jun 1, 2024Updated 2 years ago
AMLAB-Wakayama / gammachirp-filterbank
View on GitHub
An original package of the dynamic compressive gammachirp filterbank (dcGC-FB)
☆14Jul 7, 2026Updated 2 weeks ago