[IJCV 2025] Smaller But Better: Unifying Layout Generation with Smaller Large Language Models
☆152Aug 3, 2025Updated 7 months ago
Alternatives and similar repositories for LGGPT
Users that are interested in LGGPT are comparing it to the libraries listed below
Sorting:
- [arXiv 25] Aesthetics is Cheap, Show me the Text: An Empirical Evaluation of State-of-the-Art Generative Models for OCR☆248Aug 28, 2025Updated 6 months ago
- [IEEE TPAMI 2025] Privacy-Preserving Biometric Verification With Handwritten Random Digit String☆67Aug 3, 2025Updated 7 months ago
- [NeurIPS 2022 Spotlight] The official GitHub page of "MSDS: A Large-Scale Chinese Signature and Token Digit String Dataset for Handwritin…☆91Aug 3, 2025Updated 7 months ago
- Official code for DeepSound-V1☆13May 14, 2025Updated 10 months ago
- ☆23Jan 9, 2026Updated 2 months ago
- This repository contains the code for our ICML 2025 paper——LENSLLM: Unveiling Fine-Tuning Dynamics for LLM Selection🎉☆26May 29, 2025Updated 9 months ago
- [ICLR 2025] The offical implementation of "PSEC: Skill Expansion and Composition in Parameter Space", a new framework designed to facilit…☆63Feb 12, 2025Updated last year
- [PR 2025] The official GitHub page of "MegaHan97K: A Large-Scale Dataset for Mega-Category Chinese Character Recognition with over 97K Ca…☆75Dec 22, 2025Updated 3 months ago
- [ICML2025] Official Code of From Local Details to Global Context: Advancing Vision-Language Models with Attention-Based Selection☆25Jun 27, 2025Updated 8 months ago
- [ICLR 2026] OCR-Reasoning Benchmark: Unveiling the True Capabilities of MLLMs in Complex Text-Rich Image Reasoning☆73Dec 17, 2025Updated 3 months ago
- The official implement of CTRNet++.☆14Dec 30, 2024Updated last year
- [ICML 2025] This is the official PyTorch implementation of "🎵 HarmoniCa: Harmonizing Training and Inference for Better Feature Caching i…☆45Jul 10, 2025Updated 8 months ago
- [CVPR 2025] Noise-Consistent Siamese-Diffusion for Medical Image Synthesis and Segmentation☆87Nov 29, 2025Updated 3 months ago
- [ACL'25] UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench☆35Aug 12, 2025Updated 7 months ago
- Official implementation of UPOCR: Towards unified pixel-level OCR interface (ICML 2024)☆67Jun 6, 2024Updated last year
- [ICRA 2025]Robust Self-Reconfiguration for Fault-Tolerant Control of Modular Aerial Robot Systems☆26Jun 9, 2025Updated 9 months ago
- Official implementation of URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding (AAAI 2026…☆38Feb 4, 2026Updated last month
- [SIGGRAPH Asia 2025] The official implementation of the paper "DvD: Unleashing a Generative Paradigm for Document Dewarping via Coordinat…☆34Mar 10, 2026Updated last week
- Official implementation.☆27Jul 1, 2025Updated 8 months ago
- Official code for DocNLC: A Document Image Enhancement Framework with Normalized and Latent Contrastive Representation for Multiple Degra…☆42May 28, 2025Updated 9 months ago
- Official repository for "PosterO: Structuring Layout Trees to Enable Language Models in Generalized Content-Aware Layout Generation" (CVP…☆19Nov 12, 2025Updated 4 months ago
- Code repo for "Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning"☆33Jul 25, 2025Updated 7 months ago
- [CVPR 2025] SAM2Object: Consolidating View Consistency via SAM2 for Zero-Shot 3D Instance Segmentation☆30Jul 17, 2025Updated 8 months ago
- 【ICLR 2025 🔥】The code for Consistent In-Context Editing, an approach for tuning language models through contextual distributions, overco…☆55Apr 2, 2025Updated 11 months ago
- [MM'2024] Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking f…☆20Dec 4, 2024Updated last year
- [PVLDB 2025] TAB: Unified Benchmarking of Time Series Anomaly Detection Methods☆119Nov 26, 2025Updated 3 months ago
- [ICLR 2025] Pad: Personalized alignment of llms at decoding-time☆18Mar 19, 2025Updated last year
- [ICCV25 Highlight] The official implementation of the paper "LEGION: Learning to Ground and Explain for Synthetic Image Detection"☆75Oct 22, 2025Updated 5 months ago
- Tracking the latest and greatest research papers on diffusion large language models.☆23Mar 13, 2026Updated last week
- RoDLA: Benchmarking the Robustness of Document Layout Analysis Models☆39Mar 26, 2025Updated 11 months ago
- ☆98Feb 13, 2025Updated last year
- [MICCAI 2025] GL-LCM: Global-Local Latent Consistency Models for Fast High-Resolution Bone Suppression in Chest X-Ray Images☆15Mar 12, 2026Updated last week
- ☆21Nov 27, 2025Updated 3 months ago
- ☆31Dec 18, 2025Updated 3 months ago
- [🏆AAAI2025] Official Repo for ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry Area.☆70Feb 11, 2026Updated last month
- DeepDubber-V1: Towards High Quality and Dialogue, Narration, Monologue Adaptive Movie Dubbing Via Multi-Modal Chain-of-Thoughts Reasoning…☆29Sep 7, 2025Updated 6 months ago
- ☆157May 8, 2025Updated 10 months ago
- [TIM 2025] Towards Accurate Readings of Water Meters by Eliminating Transition Error: New Dataset and Effective Solution☆14Mar 5, 2025Updated last year
- [ACL'25 Main] Official Implementation of HiDe-LLaVA: Hierarchical Decoupling for Continual Instruction Tuning of Multimodal Large Languag…☆49Feb 16, 2026Updated last month