Official implementation of Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning
☆28Oct 30, 2024Updated last year
Alternatives and similar repositories for VisInContext
Users that are interested in VisInContext are comparing it to the libraries listed below
Sorting:
- FQGAN: Factorized Visual Tokenization and Generation☆59Mar 29, 2025Updated 11 months ago
- A Large-scale Dataset for training and evaluating model's ability on Dense Text Image Generation☆86Sep 27, 2025Updated 5 months ago
- ☆54Nov 12, 2025Updated 3 months ago
- An MCP Server that works with Roo Code/Cline.Bot/Claude Desktop to optimize costs by intelligently routing coding tasks between local LLM…☆41Jul 25, 2025Updated 7 months ago
- ECCV2024_Parrot Captions Teach CLIP to Spot Text☆66Sep 6, 2024Updated last year
- [CVPR 2025] PyTorch implementation of paper "FLAME: Frozen Large Language Models Enable Data-Efficient Language-Image Pre-training"☆32Jul 8, 2025Updated 7 months ago
- Repository containing starters templates to be used within Kodu☆15Sep 26, 2024Updated last year
- Improving Language Understanding from Screenshots. Paper: https://arxiv.org/abs/2402.14073☆31Jul 9, 2024Updated last year
- A Model Context Protocol (MCP) server that provides JSON-RPC functionality through OpenRPC.☆43Apr 23, 2025Updated 10 months ago
- NegCLIP.☆39Feb 6, 2023Updated 3 years ago
- This project is a Token Sale dApp that allows one to buy tokens and also displays recently minted tokens on the Solana blockchain using t…☆11Jul 30, 2024Updated last year
- Your command-line, context-aware chatbot for instant codebase insights & more ✨☆16May 30, 2024Updated last year
- ☆13Nov 21, 2025Updated 3 months ago
- Official codebase for the paper "Reasoning Within the Mind: Dynamic Multimodal Interleaving in Latent Space"☆63Dec 17, 2025Updated 2 months ago
- This repository contains an extension of fairseq for pixel / visual representations of text for machine translation.☆37Feb 2, 2024Updated 2 years ago
- Enable AI to control your PC. This repo includes the WorldGUI Benchmark and GUI-Thinker Agent Framework.☆113Jul 27, 2025Updated 7 months ago
- Kernel Playground - A playground to run large scale experiments on the Linux Kernel☆17Nov 8, 2025Updated 3 months ago
- Code Roberta version of RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder☆10Mar 16, 2023Updated 2 years ago
- This is a web app that forecasts the High and Low of the stocks, built using streamlit, and based on statsmodels, sklearn, and sktime.☆12Nov 19, 2021Updated 4 years ago
- Kernel CLI☆13Updated this week
- Digital advertising is becoming increasingly important. At the same time, however, the problems of this type of marketing are becoming mo…☆11Oct 4, 2022Updated 3 years ago
- defaultMODE is a Python framework for creating Discord AI agents with persistent memory and evolving behavior through brain-inspired sele…☆13Dec 18, 2025Updated 2 months ago
- ☆22Dec 23, 2025Updated 2 months ago
- ☆12Oct 17, 2025Updated 4 months ago
- This service integrates Python node invocation with TypeScript and litegraph.js, offering easy setup and ComfyUI compatibility. It simpli…☆12Jan 20, 2024Updated 2 years ago
- Middleware and macros/ui extensions to control smart buildings with Webex devices☆20Jul 31, 2025Updated 7 months ago
- An autonomous service implementing a decentralized Impact Evaluator☆13Dec 1, 2025Updated 3 months ago
- ☆11May 24, 2024Updated last year
- Official repository for Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning☆12Sep 2, 2024Updated last year
- The Layer3 ERC20 Token☆10Oct 1, 2025Updated 5 months ago
- [ICCV 2021] Multimodal Knowledge Expansion☆10Aug 28, 2021Updated 4 years ago
- ☆11Jan 28, 2025Updated last year
- Orchestrating, scaling and load balancing for containerized applications☆13Nov 6, 2025Updated 3 months ago
- [ICLR 2025 SSI-FM] Self-Taught Self-Correction for Small Language Models☆11Sep 19, 2025Updated 5 months ago
- 一个小小的书单,收集整理了一些计算机科学与技术方面的书籍英文原著pdf。☆10Jan 13, 2022Updated 4 years ago
- ☆33Jan 9, 2026Updated last month
- The Robinhood MCP Server provides a comprehensive interface to the Robinhood Crypto API. This server handles authentication, account mana…☆29Jun 18, 2025Updated 8 months ago
- 🪐 🎛️ User interface to manage your Jupyter platform.☆17Jun 6, 2025Updated 8 months ago
- ☆13Jan 14, 2025Updated last year