Gary-code / Machine-Learning-Park
机器学习乐园:主要包括机器学习基础,深度学习实践,工业应用。
☆14Updated 2 years ago
Alternatives and similar repositories for Machine-Learning-Park:
Users that are interested in Machine-Learning-Park are comparing it to the libraries listed below
- 😎 基于知识的文本生成相关文章总结与个人笔记☆21Updated 3 months ago
- [NAACL 2024] A Synthetic, Scalable and Systematic Evaluation Suite for Large Language Models☆33Updated 7 months ago
- ☆51Updated last month
- Code and model for AAAI 2024: UMIE: Unified Multimodal Information Extraction with Instruction Tuning☆30Updated 7 months ago
- The released data for paper "Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models".☆32Updated last year
- An automatic MLLM hallucination detection framework☆18Updated last year
- ☆58Updated 7 months ago
- Data for evaluating GPT-4V☆11Updated last year
- ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration☆21Updated 2 weeks ago
- [ACL2023] Preserving Commonsense Knowledge from Pre-trained Language Models via Causal Inference☆19Updated last year
- [ACL 2024] An Easy-to-use Hallucination Detection Framework for LLMs.☆25Updated 3 months ago
- Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective (ACL 2024)☆40Updated 2 months ago
- [ICLR 2023] This is the code repo for our ICLR‘23 paper "Universal Vision-Language Dense Retrieval: Learning A Unified Representation Spa…☆50Updated 6 months ago
- Multi-Figurative Language Generation (COLING 2022)☆12Updated last year
- EMNLP2023 - InfoSeek: A New VQA Benchmark focus on Visual Info-Seeking Questions☆17Updated 7 months ago
- The benchmark and datasets of the ICML 2024 paper "VisionGraph: Leveraging Large Multimodal Models for Graph Theory Problems in Visual C…☆13Updated 7 months ago
- This is the repository for COLING 2022 paper "Context-Tuning: Learning Contextualized Prompts for Natural Language Generation".☆11Updated 2 years ago
- ☆21Updated 2 months ago
- ☆13Updated 2 years ago
- This repository provides a comprehensive collection of research papers focused on multimodal representation learning, all of which have b…☆70Updated last year
- [Paper][IJCKG 2022] LaKo: Knowledge-driven Visual Question Answering via Late Knowledge-to-Text Injection☆25Updated 11 months ago
- ✨✨The Curse of Multi-Modalities (CMM): Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio☆38Updated 3 months ago
- [EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.☆55Updated 2 months ago
- ☆11Updated last month
- ☆11Updated last week
- [ACM MM 2024] Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives☆21Updated 2 months ago
- Source code of our MM'22 paper Cross-Lingual Cross-Modal Retrieval with Noise-Robust Learning☆21Updated 6 months ago
- [Paper][AAAI2024]Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-modal Structured Representations☆126Updated 6 months ago
- A Self-Training Framework for Vision-Language Reasoning☆60Updated 2 months ago
- ☆28Updated 11 months ago