lqzxt / NGTRLinks
☆13Updated 7 months ago
Alternatives and similar repositories for NGTR
Users that are interested in NGTR are comparing it to the libraries listed below
Sorting:
- [CVPR2025] VDocRAG: Retirval-Augmented Generation over Visually-Rich Documents☆54Updated 7 months ago
- Control LLM☆22Updated 9 months ago
- "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs" 2023☆16Updated last year
- ROUTE: Robust Multitask Tuning and Collaboration for Text-to-SQL (ICLR 2025 Pytorch Code)☆15Updated 7 months ago
- MedGen: Unlocking Medical Video Generation by Scaling Granularly-annotated Medical Videos.☆28Updated 6 months ago
- An End-to-End Model with Adaptive Filtering for Retrieval-Augmented Generation☆16Updated last year
- The official implementation of the paper "MMFuser: Multimodal Multi-Layer Feature Fuser for Fine-Grained Vision-Language Understanding". …☆62Updated last year
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆50Updated 8 months ago
- Parameter-Efficient Fine-Tuning for Foundation Models☆105Updated 9 months ago
- Implementation and evaluation of multimodal RAG with text and image inputs for industrial applications☆65Updated last year
- ☆67Updated 4 months ago
- [EMNLP 2025] Code for paper "Table-R1: Inference-Time Scaling for Table Reasoning"☆29Updated 7 months ago
- ☆13Updated 11 months ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32Updated last year
- Evaluation and dataset construction code for the CVPR 2025 paper "Vision-Language Models Do Not Understand Negation"☆42Updated 8 months ago
- This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"☆16Updated 2 months ago
- Offical Repository of "AtomThink: Multimodal Slow Thinking with Atomic Step Reasoning"☆57Updated last month
- Code for our Paper "All in an Aggregated Image for In-Image Learning"☆29Updated last year
- This repo contains the source code for VB-LoRA: Extreme Parameter Efficient Fine-Tuning with Vector Banks (NeurIPS 2024).☆42Updated last year
- Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution☆58Updated last month
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆23Updated 11 months ago
- Official PyTorch Implementation of MLLM Is a Strong Reranker: Advancing Multimodal Retrieval-augmented Generation via Knowledge-enhanced …☆90Updated last year
- [SIGIR 2025] This is the code repo for our SIGIR'25 paper: Enhancing the Patent Matching Capability of Large Language Models via Memory G…☆18Updated 8 months ago
- Implementation of the "the first large-scale multimodal mixture of experts models." from the paper: "Multimodal Contrastive Learning with…☆36Updated 2 months ago
- ☆35Updated 3 months ago
- survery of small language models☆17Updated last year
- Official Repo for FoodieQA paper (EMNLP 2024)☆17Updated 6 months ago
- [ACL 2023] PuMer: Pruning and Merging Tokens for Efficient Vision Language Models☆36Updated last year
- [ICCV 2025] Dynamic-VLM☆28Updated last year
- ☆46Updated 4 months ago