AI-Application-and-Integration-Lab / Scene-Text-Detection-And-Recognition-Model_M504Links
Scene-Text-Detection-And-Recognition-Model_M504
☆25Updated 10 months ago
Alternatives and similar repositories for Scene-Text-Detection-And-Recognition-Model_M504
Users that are interested in Scene-Text-Detection-And-Recognition-Model_M504 are comparing it to the libraries listed below
Sorting:
- Official implementation of AAAI24 paper "A Dual-way Enhanced Framework from Text Matching Point of View for Multimodal Entity Linking"☆8Updated 9 months ago
- Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".☆12Updated 9 months ago
- ☆13Updated 7 months ago
- Synthesizing realistic and diverse text-datasets from augmented LLMs☆13Updated 3 months ago
- Codebase for Math Neurosurgery: Isolating LLMs' Math Reasoning Abilities Using Only Forward Passes☆17Updated last month
- Revisiting Hierarchical Text Classification : Inference and Metrics☆13Updated 7 months ago
- [KDD24-ADS] R-Eval: A Unified Toolkit for Evaluating Domain Knowledge of Retrieval Augmented Large Language Models☆12Updated last year
- ☆16Updated last year
- Official Repo for FoodieQA paper (EMNLP 2024)☆16Updated 3 weeks ago
- ☆17Updated 2 months ago
- Pythonic wrappers for Cider/CiderD evaluation metrics. Provides CIDEr as well as CIDEr-D (CIDEr Defended) which is more robust to gaming …☆13Updated last year
- ☆12Updated last year
- Applies ROME and MEMIT on Mamba-S4 models☆14Updated last year
- [AAAI 2024] Rethinking Mesh Watermark: Towards Highly Robust and Adaptable Deep 3D Mesh Watermarking☆13Updated 7 months ago
- ☆22Updated last year
- Mitigating Open-Vocabulary Caption Hallucinations (EMNLP 2024)☆16Updated 9 months ago
- Data and Code for the paper "FinanceMath: Knowledge-Intensive Math Reasoning in Finance Domains"☆20Updated 11 months ago
- ☆19Updated 3 months ago
- Let Models Speak Ciphers: Multiagent Debate through Embeddings☆13Updated last year
- ☆11Updated 9 months ago
- Original VinVL visual backbone with simplified APIs to easily extract features, boxes, object detections, in a few lines of Python code.☆9Updated 2 years ago
- Text-2-SQL☆19Updated 4 months ago
- iLLaVA: An Image is Worth Fewer Than 1/3 Input Tokens in Large Multimodal Models☆19Updated 5 months ago
- SCoRe: Training Language Models to Self-Correct via Reinforcement Learning☆10Updated 5 months ago
- This repo is reproduction resources for linear alignment paper, still working☆17Updated last year
- Code for ProTrix: Building Models for Planning and Reasoning over Tables with Sentence Context☆18Updated 8 months ago
- An evaluation suite for Retrieval-Augmented Generation (RAG).☆20Updated 2 months ago
- Codes for ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding [ICML 2025]]☆35Updated last week
- code for Learning the Unlearned: Mitigating Feature Suppression in Contrastive Learning☆16Updated last year
- Code for paper: Long cOntext aliGnment via efficient preference Optimization☆14Updated 5 months ago