☆17Jan 26, 2024Updated 2 years ago
Alternatives and similar repositories for ComCAT
Users that are interested in ComCAT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [EMNLP'22] Weakly-Supervised Temporal Article Grounding☆14Nov 25, 2023Updated 2 years ago
- Codebase for SIGNET: Efficient Neural Representations for Light Fields☆15Jul 27, 2023Updated 2 years ago
- [ICLR 2025] FreqPrior: Improving Video Diffusion Models with Frequency Filtering Gaussian Noise☆14Mar 5, 2025Updated last year
- Official implementation of "Diffusion Model Guided Sampling with Pixel-Wise Aleatoric Uncertainty Estimation"☆16Apr 1, 2025Updated 11 months ago
- Take Your Model Further: A General Post-refinement Network for Light Field Disparity Estimation via BadPix Correction☆10Feb 28, 2023Updated 3 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- A pretrained model which can convert an anime image to a sketch.☆13Apr 16, 2020Updated 5 years ago
- All code and data necessary to replicate experiments in the paper BAGM: A Backdoor Attack for Manipulating Text-to-Image Generative Model…☆13Sep 16, 2024Updated last year
- [CVPR2025]DOF-GS: Adjustable Depth-of-Field 3D Gaussian Splatting for Refocusing, Defocus Rendering and Blur Removal☆24Jun 11, 2025Updated 9 months ago
- Automatic Metric for Evaluating Generated Videos☆36Dec 8, 2025Updated 3 months ago
- [IROS24] TD-NeRF: Novel Truncated Depth Prior for Joint Camera Pose and Neural Radiance Field Optimization☆22May 21, 2024Updated last year
- ☆22Mar 18, 2023Updated 3 years ago
- ☆22Oct 27, 2024Updated last year
- Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Models (ICLR 2026)☆47Mar 3, 2026Updated 3 weeks ago
- This project is the official implementation of our accepted IEEE TPAMI paper Diverse Sample Generation: Pushing the Limit of Data-free Qu…☆15Feb 26, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Triton implement of bi-directional (non-causal) linear attention☆71Mar 1, 2026Updated 3 weeks ago
- ☆35Feb 5, 2024Updated 2 years ago
- The official implementation of the paper SAEdit: Token-level control for continuous image editing via Sparse AutoEncoder☆20Oct 19, 2025Updated 5 months ago
- AITuberのデモリポジトリです☆10Mar 11, 2023Updated 3 years ago
- ☆18Aug 7, 2025Updated 7 months ago
- TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs☆24Sep 21, 2025Updated 6 months ago
- [CVPR 2026] PAI-Bench: A Comprehensive Benchmark for Physical AI☆57Feb 21, 2026Updated last month
- [arXiv 2024] I4VGen: Image as Free Stepping Stone for Text-to-Video Generation☆24Oct 6, 2024Updated last year
- Open-Retrieval Conversational Machine Reading: A new setting & OR-ShARC dataset☆13Nov 19, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆12Feb 22, 2021Updated 5 years ago
- VoxSRC2022 workshop development kit☆19Jul 21, 2022Updated 3 years ago
- MaXM is a suite of test-only benchmarks for multilingual visual question answering in 7 languages: English (en), French (fr), Hindi (hi),…☆13Jan 16, 2024Updated 2 years ago
- ☆12Mar 12, 2023Updated 3 years ago
- [ACL 2023] Code and data for our paper "Measuring Progress in Fine-grained Vision-and-Language Understanding"☆13Jun 11, 2023Updated 2 years ago
- Natural Perturbation for Robust Question Answering☆12Apr 7, 2020Updated 5 years ago
- ☆10Dec 3, 2023Updated 2 years ago
- MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance☆26Dec 12, 2024Updated last year
- ☆18Jun 20, 2025Updated 9 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- [ICLR 2026] PyTorch implementation of "The Less You Depend, The More You Learn: Synthesizing Novel Views from Sparse, Unposed Images with…☆53Jan 26, 2026Updated 2 months ago
- Code for Static and Dynamic Concepts for Self-supervised Video Representation Learning.☆11Jul 28, 2022Updated 3 years ago
- Source code of Universal Weighting Metric Learning for Cross-Modal Matching. The paper is accepted by CVPR2020.☆22Nov 2, 2022Updated 3 years ago
- Learn to Resolve Conversational Dependency: A Consistency Training Framework for Conversational Question Answering (Kim et al., ACL 2021)☆32Jan 2, 2023Updated 3 years ago
- ☆12Oct 10, 2017Updated 8 years ago
- The official code and data for paper "VidEgoThink: Assessing Egocentric Video Understanding Capabilities for Embodied AI"☆16Mar 25, 2025Updated last year
- Code for ACL 2023 Oral Paper: ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning☆12Aug 23, 2025Updated 7 months ago