Fr0zenCrane / CockatielView external linksLinks
The official implementation of our paper "Cockatiel: Ensembling Synthetic and Human Preferenced Training for Detailed Video Caption"
☆38May 21, 2025Updated 8 months ago
Alternatives and similar repositories for Cockatiel
Users that are interested in Cockatiel are comparing it to the libraries listed below
Sorting:
- ☆53May 6, 2025Updated 9 months ago
- ☆13May 17, 2025Updated 8 months ago
- An efficient multi-modal instruction-following data synthesis tool and the official implementation of Oasis https://arxiv.org/abs/2503.08…☆37Jun 4, 2025Updated 8 months ago
- RLHF for Stable Diffusion☆14Jul 9, 2023Updated 2 years ago
- ☆28Mar 4, 2025Updated 11 months ago
- ☆18Oct 23, 2024Updated last year
- ☆25Nov 17, 2025Updated 2 months ago
- Generated Faces in the Wild Dataset and Code☆18Mar 2, 2025Updated 11 months ago
- [CVPR 2025] InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption 🔍☆46Jul 5, 2025Updated 7 months ago
- A WebUI for Side-by-Side Comparison of Media (Images/Videos) Across Multiple Folders☆25Feb 21, 2025Updated 11 months ago
- TinyLLaVA-Video-R1: Towards Smaller LMMs for Video Reasoning☆114Dec 24, 2025Updated last month
- ☆38Feb 4, 2026Updated last week
- [SIGGRAPH2025] Generative Video Matting☆57Aug 12, 2025Updated 6 months ago
- ☆36Feb 2, 2026Updated last week
- ☆110Updated this week
- MMD Motion Auto-Trace Installer on Conda☆28Oct 31, 2022Updated 3 years ago
- [ICML2025] LoRA fine-tune directly on the quantized models.☆39Nov 25, 2024Updated last year
- [ICLR 2025] AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark☆138Jun 4, 2025Updated 8 months ago
- ☆68Aug 16, 2024Updated last year
- PEA-Diffusion: Parameter-Efficient Adapter with Knowledge Distillation in non-English Text-to-Image Generation☆37Oct 28, 2024Updated last year
- OPSTL: Self-supervised Skeleton-based Action Recognition in Occluded Environments☆14Oct 25, 2023Updated 2 years ago
- ☆18Jun 10, 2025Updated 8 months ago
- 专门用于处理视觉丰富文档转换后md文件的rag系统☆10Mar 16, 2025Updated 10 months ago
- Writing FLUX in Triton☆41Sep 22, 2024Updated last year
- [CVPR 2025] PatchDPO: Patch-level DPO for Finetuning-free Personalized Image Generation☆44Jul 1, 2025Updated 7 months ago
- [NOTE] I do not have enough ressources to maintain VMS, please use Ostris's AI-Tookit instead☆43Oct 3, 2025Updated 4 months ago
- MoviiGen 1.1: Towards Cinematic-Quality Video Generative Models☆183Jul 21, 2025Updated 6 months ago
- The repository of VG-Refiner paper☆17Dec 9, 2025Updated 2 months ago
- Official implementation of LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment.☆85May 4, 2025Updated 9 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆12Nov 14, 2025Updated 3 months ago
- Vision-Language Models Toolbox: Your all-in-one solution for multimodal research and experimentation☆12Feb 16, 2025Updated 11 months ago
- 🧪 A minimal visual tool to verify YOLO-based object detection algorithms in custom scenarios.☆14Jan 8, 2026Updated last month
- ☆97Jun 23, 2025Updated 7 months ago
- ☆63Jul 10, 2025Updated 7 months ago
- [CVPR 2025🔥] Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model☆194May 11, 2025Updated 9 months ago
- The raw UserRL repo under construction☆94Sep 25, 2025Updated 4 months ago
- I-CHING package(Python周易占卜)☆10Feb 22, 2021Updated 4 years ago
- Make your Turtlebot2 run on ROS Melodic (Ubuntu 18.04).☆10Jul 2, 2021Updated 4 years ago
- ☆24Jun 19, 2025Updated 7 months ago