☆13Dec 9, 2024Updated last year
Alternatives and similar repositories for MM-self-improve-qwen2vl
Users that are interested in MM-self-improve-qwen2vl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model☆13Feb 15, 2024Updated 2 years ago
- A Self-Training Framework for Vision-Language Reasoning☆90Jan 23, 2025Updated last year
- An Arena-style Automated Evaluation Benchmark for Detailed Captioning☆59Jun 1, 2025Updated 10 months ago
- An implementation of Scalable Evaluation and Improvement of Document Set Expansion via Neural Positive-Unlabeled Learning without AllenNL…☆19Feb 20, 2024Updated 2 years ago
- Scripts for training Qwen 2.5 VL with ms-swift and GRPO☆12Feb 27, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- text classification compitioin top 10 strategy☆18Aug 14, 2021Updated 4 years ago
- Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoning☆42Nov 11, 2025Updated 5 months ago
- Official Repo for "Why Settle for One? Text-to-ImageSet Generation and Evaluation"☆21Oct 1, 2025Updated 6 months ago
- Plot package similar to gnuplot☆23Mar 26, 2024Updated 2 years ago
- Code for Research Project TLDR☆25Jul 28, 2025Updated 8 months ago
- ☆12Aug 8, 2024Updated last year
- Code for "From Ideal to Real: Unified and Data-Efficient Dense Prediction for Real-World Scenarios"☆28Jul 7, 2025Updated 9 months ago
- the official repo for EMNLP 2024 (main) paper "EFUF: Efficient Fine-grained Unlearning Framework for Mitigating Hallucinations in Multimo…☆21Apr 9, 2025Updated last year
- ☆21Feb 18, 2022Updated 4 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models☆30Nov 25, 2024Updated last year
- ☆19Jun 4, 2020Updated 5 years ago
- Partially Non-Autoregressive Image Captioning☆10Sep 30, 2021Updated 4 years ago
- TRISTAN: TRI's Situation and Trajectory Anticipation Networks☆14Jul 18, 2023Updated 2 years ago
- The implement of LLMTreeRec☆14Dec 9, 2024Updated last year
- Official repository for "LFR-GAN: Local Feature Refinement based Generative Adversarial Network for Text-to-Image Generation" (TOMM 2023)…☆11Mar 21, 2023Updated 3 years ago
- [ACL 2026] Code, benchmark and environment for "OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic…☆45Nov 10, 2025Updated 5 months ago
- Transformer OCR is a Optical Character Recognition tookit built for researchers working on both OCR for both Vietnamese and English. This…☆10Dec 27, 2021Updated 4 years ago
- The official implementation of the ECCV'24 paper MC-CoT: Boosting the Power of Small Multimodal Reasoning Models to Match Larger Models w…☆26May 19, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 收集LUG@NJU群的精华消息,好玩就行。☆12Jun 22, 2022Updated 3 years ago
- Joint Perception and Motion Prediction for Autonomous Driving Based on Bird's Eye View Maps☆22Feb 5, 2024Updated 2 years ago
- ☆18Nov 3, 2025Updated 5 months ago
- 2021MXAP-DGL rank2☆35Mar 23, 2022Updated 4 years ago
- Official implementation of "What does CLIP know about a red circle? Visual Prompt Engineering for VLMs", ICCV 2023☆11Sep 21, 2023Updated 2 years ago
- ☆20Apr 24, 2024Updated last year
- ☆19Dec 6, 2023Updated 2 years ago
- Code for AutoGeo.☆16Aug 18, 2024Updated last year
- ☆13Sep 5, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆10May 8, 2024Updated last year
- [CVPR24] OOSTraj: Out-of-Sight Trajectory Prediction With Vision-Positioning Denoising☆17Apr 4, 2024Updated 2 years ago
- [ACL 2025] A Neural-Symbolic Self-Training Framework☆117Jun 1, 2025Updated 10 months ago
- CamRest676 is an English data set, I translate it into Chinese for training nlu.☆12Dec 20, 2017Updated 8 years ago
- Code for "Improving Translation Faithfulness of Large Language Models via Augmenting Instructions"☆12Aug 26, 2023Updated 2 years ago
- ☆12Nov 2, 2024Updated last year
- ☆11Oct 2, 2023Updated 2 years ago