Mitigating Open-Vocabulary Caption Hallucinations (EMNLP 2024)
☆18Oct 18, 2024Updated last year
Alternatives and similar repositories for mocha_code
Users that are interested in mocha_code are comparing it to the libraries listed below
Sorting:
- Lightweight PDF Q&A tool powered by RAG (Retrieval-Augmented Generation) with MCP (Model Context Protocol) Support.☆22Oct 27, 2025Updated 4 months ago
- Official repo for [CVPR 2026] "SARMAE: Masked Autoencoder for SAR Representation Learning"☆32Dec 19, 2025Updated 3 months ago
- [NeurIPS2024] Official code for (IMA) Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs☆23Oct 15, 2024Updated last year
- [arXiv, 2024] Show Me What and Where has Changed? Question Answering and Grounding for Remote Sensing Change Detection☆35Jul 2, 2025Updated 8 months ago
- FaithScore: Fine-grained Evaluations of Hallucinations in Large Vision-Language Models☆33Nov 27, 2025Updated 3 months ago
- ☆11Mar 31, 2025Updated 11 months ago
- [EMNLP'22] Code for 'Exploring Representation-level Augmentation for Code Search'☆27Oct 9, 2023Updated 2 years ago
- LLaVA-NeXT-Image-Llama3-Lora, Modified from https://github.com/arielnlee/LLaVA-1.6-ft☆46Jul 17, 2024Updated last year
- ☆18Oct 5, 2023Updated 2 years ago
- Using CNN for classifying 101 different food categories - using VGG16, Alex Net and SVM☆10Jan 6, 2020Updated 6 years ago
- ☆16Feb 12, 2026Updated last month
- ☆19Jul 23, 2024Updated last year
- This branch of Asteroid contains code for the vocal harmony and chamber ensemble separation related papers.☆12Nov 7, 2024Updated last year
- The Official Code Repo for EgoOrientBench [CVPR25]☆14Nov 24, 2025Updated 3 months ago
- Mitigating Shortcuts in Visual Reasoning with Reinforcement Learning☆44Jul 2, 2025Updated 8 months ago
- This project aims to design, develop and implement the training model by using different inputs data. The machine will able to learn the …☆13Sep 22, 2020Updated 5 years ago
- Improved Daily SMAP Satellite Soil Moisture Prediction over China using deep learning model with transfer learning☆17Jul 22, 2021Updated 4 years ago
- Source code for EMNLP2022 long paper: Parameter-Efficient Tuning Makes a Good Classification Head☆14Nov 7, 2022Updated 3 years ago
- THOUGHTSCULPT, a general reasoning and search method for complex tasks☆13Dec 13, 2024Updated last year
- 本项目是关于Yi的多模态系列模型,如Yi-VL-6B/34B等的实验与应用。☆14Jan 25, 2024Updated 2 years ago
- A multimodal dataset of 5M insect specimens for biodiversity research.☆19Feb 25, 2026Updated 3 weeks ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆70Dec 9, 2024Updated last year
- Synthesize bio-plausible neural networks for cognitive tasks, mimicking brain architecture☆11Apr 14, 2021Updated 4 years ago
- Adaptive Multimodal Reasoning via Reinforcement Learning☆23Jan 11, 2026Updated 2 months ago
- Code associated with the paper: "Few-Shot Self-Rationalization with Natural Language Prompts"☆13Apr 27, 2022Updated 3 years ago
- The implementation for FREE-Merging: Fourier Transform for Model Merging with Lightweight Experts (ICCV25)☆14Jun 26, 2025Updated 8 months ago
- [NeurIPS 2024 Spotlight] CLIPLoss and Norm-Based Data Selection Methods for Multimodal Contrastive Learning.☆14Dec 12, 2024Updated last year
- [ICML 2023] "Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?" by Ruisi Cai, Zhenyu Zhang, Zhangyang Wang☆16May 4, 2023Updated 2 years ago
- ☆23Dec 1, 2022Updated 3 years ago
- Tight Mutual Information Estimation With Contrastive Fenchel-Legendre Optimization☆11Nov 29, 2022Updated 3 years ago
- ☆13Aug 7, 2025Updated 7 months ago
- This repository contains the official code for "Flexible Biometrics Recognition: Bridging the Multimodality Gap through Attention, Alignm…☆11Oct 9, 2024Updated last year
- Starter-pack for the AI4EO Food Security Challenge☆22May 29, 2023Updated 2 years ago
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆12Mar 14, 2025Updated last year
- 基于PaddlePaddle的土壤某物质含量高光谱反演☆20Sep 7, 2021Updated 4 years ago
- Code and data recipes for the paper: Optimal Condition Training for Target Source Separation by Efthymios Tzinis, Gordon Wichern, Paris S…☆14Feb 15, 2023Updated 3 years ago
- ☆21Oct 10, 2023Updated 2 years ago
- ☆18Aug 1, 2024Updated last year
- Reproducible research code for the experiments presented in our article "Kara1k: a karaoke dataset for cover song identification and sing…☆10Jan 9, 2018Updated 8 years ago