AI-Application-and-Integration-Lab / Scene-Text-Detection-And-Recognition-Model_M504Links
Scene-Text-Detection-And-Recognition-Model_M504
☆25Updated 9 months ago
Alternatives and similar repositories for Scene-Text-Detection-And-Recognition-Model_M504
Users that are interested in Scene-Text-Detection-And-Recognition-Model_M504 are comparing it to the libraries listed below
Sorting:
- [ICIP 2024] Open-Vocabulary Panoptic Segmentation Using BERT Pre-Training of Vision-Language Multiway Transformer Model☆16Updated 2 weeks ago
- [ICASSP 2025] PDSeg: Patch-Wise Distillation and Controllable Image Generation for Weakly-Supervised Histopathology Tissue Segmentation☆16Updated 2 weeks ago
- Official Repo for FoodieQA paper (EMNLP 2024)☆16Updated 6 months ago
- Official implementation of AAAI24 paper "A Dual-way Enhanced Framework from Text Matching Point of View for Multimodal Entity Linking"☆8Updated 8 months ago
- A distributed training framework for large language models powered by Lightning.☆22Updated 2 months ago
- TAT-DQA: Towards Complex Document Understanding By Discrete Reasoning☆23Updated 8 months ago
- ☆13Updated last year
- ☆17Updated last month
- Synthesizing realistic and diverse text-datasets from augmented LLMs☆12Updated 2 months ago
- Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".☆11Updated 7 months ago
- ☆13Updated 2 years ago
- ☆12Updated 4 months ago
- ☆16Updated last year
- SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433☆25Updated 6 months ago
- Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."☆42Updated 7 months ago
- [EMNLP 2024] Official implementation of "Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge Ut…☆21Updated 6 months ago
- ☆11Updated 4 months ago
- [ACL 2025 (Findings)] DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling☆14Updated 5 months ago
- Official PyTorch Implementation of MLLM Is a Strong Reranker: Advancing Multimodal Retrieval-augmented Generation via Knowledge-enhanced …☆78Updated 6 months ago
- Instruct Once, Chat Consistently in Multiple Rounds: An Efficient Tuning Framework for Dialogue (ACL 2024)☆23Updated 10 months ago
- ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-of…☆21Updated last week
- Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective (ACL 2024)☆50Updated 7 months ago
- Data and Code for the paper "FinanceMath: Knowledge-Intensive Math Reasoning in Finance Domains"☆19Updated 9 months ago
- Code for ProTrix: Building Models for Planning and Reasoning over Tables with Sentence Context☆17Updated 6 months ago
- ☆33Updated 7 months ago
- Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆36Updated 2 weeks ago
- ☆9Updated last week
- Codebase for Math Neurosurgery: Isolating LLMs' Math Reasoning Abilities Using Only Forward Passes☆14Updated 3 months ago
- Codebase for ACL 2023 paper "Mixture-of-Domain-Adapters: Decoupling and Injecting Domain Knowledge to Pre-trained Language Models' Memori…☆49Updated last year
- Public code repo for COLING 2025 paper "Aligning LLMs with Individual Preferences via Interaction"☆27Updated 2 months ago