PLippmann / multimodal-manga-translationLinks
Context-Informed Machine Translation of Manga using Multimodal Large Language Models
☆10Updated 6 months ago
Alternatives and similar repositories for multimodal-manga-translation
Users that are interested in multimodal-manga-translation are comparing it to the libraries listed below
Sorting:
- [WACV2025] source code of StrDA: https://arxiv.org/abs/2410.09913☆11Updated 2 months ago
- KV cache compression via sparse coding☆10Updated last month
- Comics Dataset Framework for Comics Understanding☆23Updated 3 months ago
- Renderer for the Crello dataset☆9Updated 5 months ago
- (WACV 2025 - Oral) Vision-language conversation in 10 languages including English, Chinese, French, Spanish, Russian, Japanese, Arabic, H…☆84Updated 4 months ago
- ☆84Updated 2 weeks ago
- stochastic bfloat16 based optimizer library☆16Updated 6 months ago
- Adapt MLLMs to Domains via Post-Training☆9Updated 5 months ago
- Official Implementation of LaViDa: :A Large Diffusion Language Model for Multimodal Understanding☆105Updated last week
- ☆21Updated 3 months ago
- Lightweight Hybrid Search and Reranking☆10Updated 3 months ago
- 2D-TPE: Two-Dimensional Positional Encoding Enhances Table Understanding for Large Language Models (WWW 2025)☆10Updated 2 months ago
- ☆17Updated last year
- [NAACL 2025] Official Code Repository for the paper "Probing-RAG: Self-Probing to Guide Language Models in Selective Document Retrieval"☆11Updated last month
- Enhancing Translation with RAG-Powered Large Language Models☆80Updated 3 months ago
- A bot that provides Youtube vid chapters on Twitter (a.k.a. X )☆12Updated 4 months ago
- Official repository of the paper: "A Comprehensive Gold Standard and Benchmark for Comics Text Detection and Recognition"☆26Updated last year
- ☆17Updated 5 months ago
- A simple tool to estimate the reading order of comic panels☆16Updated 2 years ago
- ☆62Updated 11 months ago
- This is the official pytorch implementation for paper: Filter, Correlate, Compress: Training-Free Token Reduction for MLLM Acceleration☆14Updated 3 months ago
- ☆9Updated 8 months ago
- Scripts for use with LongCLIP, including fine-tuning Long-CLIP☆61Updated 3 months ago
- The official code repo and data hub of top_nsigma sampling strategy for LLMs.☆26Updated 4 months ago
- ☆158Updated last month
- Recaption large (Web)Datasets with vllm and save the artifacts.☆52Updated 7 months ago
- [EMNLP 2024] Official PyTorch implementation code for realizing the technical part of Traversal of Layers (TroL) presenting new propagati…☆97Updated last year
- ☆20Updated 7 months ago
- ☆11Updated 7 months ago
- ☆12Updated 5 months ago