Official Repository for "Glyph: Scaling Context Windows via Visual-Text Compression"
☆574Nov 4, 2025Updated 5 months ago
Alternatives and similar repositories for Glyph
Users that are interested in Glyph are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆25Oct 7, 2025Updated 6 months ago
- [CVPR 2026] Variation-aware Vision Token Dropping for Faster Large Vision-Language Models☆29Mar 18, 2026Updated 3 weeks ago
- OmniGAIA: Towards Native Omni-Modal AI Agents☆89Apr 2, 2026Updated last week
- Official Implementation of SAGE-GRPO:Manifold-Aware Exploration for Reinforcement Learning in Video Generation☆101Apr 2, 2026Updated last week
- ☆183Dec 5, 2025Updated 4 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- OpenVE-3M: A Large-Scale High-Quality Dataset for Instruction-Guided Video Editing☆44Jan 9, 2026Updated 3 months ago
- ☆19Jan 17, 2025Updated last year
- ☆62Oct 29, 2024Updated last year
- Codes for paper SoAy: A Service-oriented APIs Applying Framework of Large Language Models☆27Jul 14, 2025Updated 8 months ago
- [ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models☆18Nov 4, 2025Updated 5 months ago
- ☆28Aug 19, 2025Updated 7 months ago
- [ICLR-2026] Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆32Feb 26, 2026Updated last month
- [NeurIPS 2025] Scaling Language-centric Omnimodal Representation Learning☆38Feb 6, 2026Updated 2 months ago
- A high-performance tokenizer (BPE + SentencePiece) built with Rust with Python bindings, focused on speed, safety, and resource optimizat…☆59Mar 15, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- [ACL2025 Oral🔥]Turning Trash into Treasure: Accelerating Inference of Large Language Models with Token Recycling☆27Nov 11, 2025Updated 5 months ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 9 months ago
- A framework aiming to bridge fast robot prototyping, predefined motion primitives, heterogeneous teleoperation, data collection, and flex…☆26Apr 4, 2026Updated last week
- [EMNLP'24 (Main)] DRPO(Dynamic Rewarding with Prompt Optimization) is a tuning-free approach for self-alignment. DRPO leverages a search-…☆25Nov 17, 2024Updated last year
- Official repo for paper "EMMA: Efficient Multimodal Understanding, Generation, and Editing with a Unified Architecture."☆62Dec 16, 2025Updated 3 months ago
- [ACL 2026 Findings] CoV: Chain-of-View Prompting for Spatial Reasoning☆52Updated this week
- Xmixers: A collection of SOTA efficient token/channel mixers☆28Sep 4, 2025Updated 7 months ago
- UniGen: Enhanced Training & Test-Time Strategies for Unified Multimodal Understanding and Generation☆39Nov 24, 2025Updated 4 months ago
- This is the official code repository for the paper: Towards General Continuous Memory for Vision-Language Models.☆24Jul 3, 2025Updated 9 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- This is a dataset that aligns piano music MIDI with their corresponding textual descriptions and comments. It can be used for multi-modal…☆12Nov 21, 2023Updated 2 years ago
- Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders [Technical Report]☆171Mar 30, 2026Updated last week
- Suri: Multi-constraint instruction following for long-form text generation (EMNLP’24)☆27Oct 3, 2025Updated 6 months ago
- ☆37Dec 16, 2025Updated 3 months ago
- Code and data for paper "Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation".☆24Oct 22, 2025Updated 5 months ago
- ☆17Jul 12, 2025Updated 9 months ago
- [CVPR 2026] LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling☆217Updated this week
- [ICLR 2025🔥] D2O: Dynamic Discriminative Operations for Efficient Long-Context Inference of Large Language Models☆27Jul 7, 2025Updated 9 months ago
- ☆12Oct 7, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- The official code repo and data hub of top_nsigma sampling strategy for LLMs.☆26Feb 11, 2025Updated last year
- ☆73Mar 3, 2026Updated last month
- ☆59Nov 12, 2025Updated 5 months ago
- Official repo for paper "IC-Effect: Precise and Efficient Video Effects Editing via In-Context Learning"☆41Jan 29, 2026Updated 2 months ago
- [ICLR2026] Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shaping☆63May 22, 2025Updated 10 months ago
- Image Tokenizer Needs Post-Training☆24Oct 4, 2025Updated 6 months ago
- D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI [ICLR 2026]☆77Mar 3, 2026Updated last month