SupstarZh / WhitenedCSE
[ACL2023] WhitenedCSE: Whitening-based Contrastive Learning of Sentence Embeddings
☆18Updated last year
Related projects ⓘ
Alternatives and complementary repositories for WhitenedCSE
- A paper list about diffusion models for natural language processing.☆175Updated last year
- ACL'2023: DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models☆292Updated 8 months ago
- ChartMimic: Evaluating LMM’s Cross-Modal Reasoning Capability via Chart-to-Code Generation☆93Updated 3 months ago
- Official implementation of HawkEye: Training Video-Text LLMs for Grounding Text in Videos☆34Updated 6 months ago
- Source code of LatentOps☆77Updated last year
- The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''☆178Updated 7 months ago
- Large Visual Language Model(LVLM), Large Language Model(LLM), Multimodal Large Language Model(MLLM), Alignment, Agent, AI System, Survey☆17Updated 6 months ago
- 本项目用于Multimodal领域新手的学习路线,包括该领域的经典论文,项目及课程。旨在希望学习者在一定的时间内达到对这个领域有较为深刻的认知,能够自己进行的独立研究。☆15Updated 7 months ago
- A RLHF Infrastructure for Vision-Language Models☆98Updated 5 months ago
- Listing some diffusion papers in NLP domain I have read, text generation is main, table will continue to be updated.☆28Updated 2 weeks ago
- 😎 up-to-date & curated list of awesome LMM hallucinations papers, methods & resources.☆144Updated 7 months ago
- ☆13Updated 2 weeks ago
- [ICLR 2024 (Spotlight)] "Frozen Transformers in Language Models are Effective Visual Encoder Layers"☆225Updated 9 months ago
- This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vi…☆81Updated 3 weeks ago
- ☆79Updated 2 years ago
- code for "Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization"☆45Updated 2 months ago
- ☆24Updated 4 months ago
- A curated list of awesome Multimodal studies.☆93Updated last week
- Official repository for the A-OKVQA dataset☆63Updated 6 months ago
- Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models☆136Updated last week
- ☆16Updated 2 years ago
- Official github repo of G-LLaVA☆121Updated 5 months ago
- Text Diffusion Model with Encoder-Decoder Transformers for Sequence-to-Sequence Generation [NAACL 2024]☆91Updated last year
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning☆153Updated 9 months ago
- [ACL 2024] Logical Closed Loop: Uncovering Object Hallucinations in Large Vision-Language Models. Detect and mitigate object hallucinatio…☆16Updated 4 months ago
- Grab GPU whenever available☆278Updated 2 years ago
- ☆11Updated 3 months ago
- Awesome papers & datasets specifically focused on long-term videos.☆195Updated 3 weeks ago
- MultiInstruct: Improving Multi-Modal Zero-Shot Learning via Instruction Tuning☆133Updated last year
- Official Implementation of ICLR'24: Kosmos-G: Generating Images in Context with Multimodal Large Language Models☆50Updated 5 months ago