Extending context length of visual language models
☆12Dec 18, 2024Updated last year
Alternatives and similar repositories for GIRAFFE
Users that are interested in GIRAFFE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official codebase for “In-Context Learning with Many Demonstration Examples”☆16Feb 13, 2023Updated 3 years ago
- ☆28Jul 23, 2025Updated 10 months ago
- ☆15Jul 9, 2025Updated 10 months ago
- [ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"☆82Nov 25, 2024Updated last year
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆66Jul 8, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆33Jun 24, 2024Updated last year
- ☆26Aug 23, 2024Updated last year
- Reparameterized Discrete Diffusion Models for Text Generation☆106Feb 14, 2023Updated 3 years ago
- [EMNLP'23] Code for Generating Data for Symbolic Language with Large Language Models☆18Oct 21, 2023Updated 2 years ago
- Code for paper ”Language Versatilists vs. Specialists: An Empirical Revisiting on Multilingual Transfer Ability“☆15Jun 13, 2023Updated 2 years ago
- ☆21May 24, 2024Updated 2 years ago
- Dream-VL and Dream-VLA, a diffusion VLM and a diffusion VLA.☆113Jan 14, 2026Updated 4 months ago
- Code for the paper "Decomposing the Enigma: Subgoal-based Demonstration Learning for Formal Theorem Proving"☆19May 25, 2023Updated 3 years ago
- [ICLR 2026] Computer Agent Arena: Toward Human-Centric Evaluation and Analysis of Computer-Use Agents☆60Feb 26, 2026Updated 3 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆28Feb 26, 2023Updated 3 years ago
- Vision Large Language Models trained on M3IT instruction tuning dataset☆17Aug 16, 2023Updated 2 years ago
- EvaByte: Efficient Byte-level Language Models at Scale☆118Apr 22, 2025Updated last year
- [NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?☆152Aug 26, 2024Updated last year
- ☆10Mar 13, 2023Updated 3 years ago
- Streaming Video Instruction Tuning☆75Feb 25, 2026Updated 3 months ago
- [EMNLP-2022 Findings] Code for paper “ProGen: Progressive Zero-shot Dataset Generation via In-context Feedback”.☆27Feb 4, 2023Updated 3 years ago
- [CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts☆17Apr 2, 2025Updated last year
- Official github repo of G-LLaVA☆149Feb 20, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for the ICLR'22 paper "Improving Non-Autoregressive Translation Models Without Distillation"☆18Mar 11, 2022Updated 4 years ago
- ☆102Dec 22, 2023Updated 2 years ago
- Dynamic config system based on python classes☆12Jan 27, 2023Updated 3 years ago
- Official implementation of Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And More☆25Feb 25, 2025Updated last year
- iLLaVA: An Image is Worth Fewer Than 1/3 Input Tokens in Large Multimodal Models (ICLR2026)☆22Mar 29, 2026Updated last month
- [ACL 24 Findings] Implementation of Resonance RoPE and the PosGen synthetic dataset.☆24Mar 5, 2024Updated 2 years ago
- Mutual attention model for matching QA pairs in dialogues☆11Sep 20, 2020Updated 5 years ago
- ☆13Oct 14, 2024Updated last year
- DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation☆820Jul 9, 2025Updated 10 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A website to help people built study group.☆14Oct 1, 2020Updated 5 years ago
- [NeurIPS 2025 Spotlight] Scaling Computer-Use Grounding via UI Decomposition and Synthesis☆166Nov 6, 2025Updated 6 months ago
- ☆29Sep 4, 2025Updated 8 months ago
- [ECCV 2024] FlexAttention for Efficient High-Resolution Vision-Language Models☆47Jan 8, 2025Updated last year
- ☆10Dec 3, 2024Updated last year
- ☆16Sep 11, 2025Updated 8 months ago
- ☆10Apr 8, 2018Updated 8 years ago