☆53Jan 5, 2026Updated 2 months ago
Alternatives and similar repositories for Taming-Hallucinations
Users that are interested in Taming-Hallucinations are comparing it to the libraries listed below
Sorting:
- [CVPR 2026 Findings] Eevee: Towards Close-up High-resolution Video-based Virtual Try-on☆70Feb 27, 2026Updated 3 weeks ago
- Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization☆163Mar 9, 2026Updated last week
- [ICLR2026] Everything in Its Place: Benchmarking Spatial Intelligence of Text-to-Image Models☆121Jan 30, 2026Updated last month
- Official implementation of the ICLR 2026 paper "Urban Socio-Semantic Segmentation with Vision-Language Reasoning"☆166Mar 12, 2026Updated last week
- [ICLR 2026] Harder Is Better: Boosting Mathematical Reasoning via Difficulty-Aware GRPO and Multi-Aspect Question Reformulation☆122Feb 15, 2026Updated last month
- [ICCV25] USP: Unified Self-Supervised Pretraining for Image Generation and Understanding☆92Oct 11, 2025Updated 5 months ago
- MobilityBench: A Scalable Benchmark for Evaluating Route-Planning Agents in Real-World Mobility Scenarios☆127Mar 4, 2026Updated 2 weeks ago
- IntRR:A Framework for Integrating SID Redistribution and Length Reduction☆39Feb 27, 2026Updated 3 weeks ago
- ☆54Feb 9, 2026Updated last month
- [ICLR2026] Advancing End-To-End Pixel-Space Generative Modeling Via Self-Supervised Pre-Training☆138Dec 8, 2025Updated 3 months ago
- Next Token Is Enough: Realistic Image Quality and Aesthetic Scoring with Multimodal Large Language Model.☆80Jun 30, 2025Updated 8 months ago
- [www2025]DSFNet: Learning Disentangled Scenario Factorization for Multi-Scenario Route Ranking. This paper’s open dataset and implementat…☆30Sep 9, 2025Updated 6 months ago
- [ICCV 2025] Factorized Learning for Temporally Grounded Video-Language Models☆24Jan 1, 2026Updated 2 months ago
- [ICCV25] LD-RPS☆28Jul 17, 2025Updated 8 months ago
- [NeurIPS 2025 Spotlight] Implementation of "KLASS: KL-Guided Fast Inference in Masked Diffusion Models"☆25Jan 3, 2026Updated 2 months ago
- [EMNLP 2024] Quantize LLM to extremely low-bit, and finetune the quantized LLMs☆15Jul 18, 2024Updated last year
- Instance-level Facial Attributes Editing (CVIU 2021)☆15Jul 19, 2022Updated 3 years ago
- Welcome to the official repository of Emotion-Qwen.☆26Jun 10, 2025Updated 9 months ago
- Minute-long video generation at 24FPS.☆59Feb 2, 2026Updated last month
- TTRV: Test-Time Reinforcement Learning for Vision–Language Models (CVPR 2026)☆37Mar 8, 2026Updated 2 weeks ago
- ☆44Jan 19, 2026Updated 2 months ago
- Official codes for the paper "GARDO: Reinforcing Diffusion Models without Reward Hacking"☆55Feb 2, 2026Updated last month
- ☆13Feb 26, 2024Updated 2 years ago
- Mixture of Experts from scratch☆13Apr 12, 2024Updated last year
- ☆43Jan 27, 2026Updated last month
- [CVPR 2026] UFVideo: Towards Unified Fine-Grained Video Cooperative Understanding with Large Language Models☆37Feb 21, 2026Updated last month
- ☆27Feb 12, 2026Updated last month
- ☆124Jan 21, 2026Updated 2 months ago
- Official Implementation for *PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling*☆34Dec 13, 2025Updated 3 months ago
- ☆15Aug 12, 2022Updated 3 years ago
- Why We Feel: Breaking Boundaries in Emotional Reasoning with Multimodal Large Language Models☆25Sep 30, 2025Updated 5 months ago
- A Unimodal Valence-Arousal Driven Contrastive Learning Framework for Multimodal Multi-Label Emotion Recognition (ACM MM 2024 oral)☆27Nov 4, 2024Updated last year
- 用RLHF可选LoRA对LLaMA和MOSS进行训练|Training LLaMA or MOSS with RLHF [LoRA]☆21May 16, 2023Updated 2 years ago
- [ICCV2023] NoiseDet: Learning from Noisy Data for Semi-Superivsed 3D Object Detection☆20Feb 5, 2023Updated 3 years ago
- A4-Agent: An Agentic Framework for Zero-Shot Affordance Reasoning☆36Mar 12, 2026Updated last week
- This paper presents our winning submission to Subtask 2 of SemEval 2024 Task 3 on multimodal emotion cause analysis in conversations.☆23Aug 2, 2024Updated last year
- ACM Multimedia 2023 (Oral) - RTQ: Rethinking Video-language Understanding Based on Image-text Model☆16Jan 31, 2024Updated 2 years ago
- Official repository for ZeroFlow: Scalable Scene Flow via Distillation☆22Feb 22, 2024Updated 2 years ago
- Official implementation of "ReCamDriving: LiDAR-Free Camera-Controlled Novel Trajectory Video Generation"☆91Dec 24, 2025Updated 2 months ago