lqzxt / NGTRLinks
☆12Updated 4 months ago
Alternatives and similar repositories for NGTR
Users that are interested in NGTR are comparing it to the libraries listed below
Sorting:
- [CVPR2025] VDocRAG: Retirval-Augmented Generation over Visually-Rich Documents☆45Updated 4 months ago
- SPRINT: Script-agnostic Structure Recognition in Tables☆13Updated 6 months ago
- ☆13Updated 8 months ago
- "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs" 2023☆15Updated 10 months ago
- The official implementation of the paper "MMFuser: Multimodal Multi-Layer Feature Fuser for Fine-Grained Vision-Language Understanding". …☆59Updated 11 months ago
- survery of small language models☆16Updated last year
- Control LLM☆20Updated 6 months ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆33Updated last year
- Geometric-Mean Policy Optimization☆84Updated last week
- Parameter-Efficient Fine-Tuning for Foundation Models☆93Updated 6 months ago
- Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models☆63Updated 11 months ago
- [CVPR 2025] DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding☆21Updated 7 months ago
- ☆50Updated 2 months ago
- MedGen: Unlocking Medical Video Generation by Scaling Granularly-annotated Medical Videos.☆24Updated 3 months ago
- [NeurIPS 2025] Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation, arXiv 2024☆63Updated 3 weeks ago
- [NeurIPS 2025] Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO☆56Updated last month
- ROUTE: Robust Multitask Tuning and Collaboration for Text-to-SQL (ICLR 2025 Pytorch Code)☆15Updated 5 months ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆49Updated 5 months ago
- RewardAnything: Generalizable Principle-Following Reward Models☆44Updated 4 months ago
- Implementation of the "the first large-scale multimodal mixture of experts models." from the paper: "Multimodal Contrastive Learning with…☆29Updated last week
- ML-Mamba: Efficient Multi-Modal Large Language Model Utilizing Mamba-2☆67Updated 11 months ago
- Pytorch Implementation of the paper: "Learning to (Learn at Test Time): RNNs with Expressive Hidden States"☆25Updated this week
- Official Pytorch Implementation of Self-emerging Token Labeling☆35Updated last year
- [EMNLP 2025] Code for paper "Table-R1: Inference-Time Scaling for Table Reasoning"☆25Updated 4 months ago
- ☆18Updated last month
- An End-to-End Model with Adaptive Filtering for Retrieval-Augmented Generation☆15Updated 11 months ago
- ☆60Updated 2 months ago
- The official repo for “TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding”.☆43Updated last year
- Official repository of Graph RAG-Tool Fusion and ToolLinkOS dataset.☆20Updated 8 months ago
- Official implementation of CVPR 2024 paper "Retrieval-Augmented Open-Vocabulary Object Detection".☆44Updated last year