VisuRiddles: Fine-grained Perception is a important thing for Multimodal Large Models in Riddles Solving
☆18Oct 22, 2025Updated 5 months ago
Alternatives and similar repositories for VisuRiddles
Users that are interested in VisuRiddles are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Apr 21, 2025Updated 11 months ago
- The proposed simulated dataset consisting of 9,536 charts and associated data annotations in CSV format.☆26Feb 22, 2024Updated 2 years ago
- 【ArXiv】PDF-Wukong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling☆128Jun 4, 2025Updated 10 months ago
- "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs" 2023☆16Nov 28, 2024Updated last year
- DatasetImgLabeler is a image annotation tool for researchers to prepare datasets in ICDAR2015 format☆12Dec 7, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆14Jun 10, 2025Updated 10 months ago
- [ICCV 2025] LIRA☆21Nov 25, 2025Updated 4 months ago
- ☆23Nov 21, 2025Updated 4 months ago
- ☆12Sep 8, 2022Updated 3 years ago
- Increasing the scale and diversity of chart de-rendering data.☆12Mar 13, 2024Updated 2 years ago
- ☆18Mar 19, 2021Updated 5 years ago
- [ICCV2025] Training-Free Diffusion Models for Geometric Image Editing☆33Jan 13, 2026Updated 3 months ago
- ☆15May 15, 2025Updated 11 months ago
- [ICLR26] ThinkOmni: Lifting Textual Reasoning to Omni-modal Scenarios via Guidance Decoding☆93Mar 20, 2026Updated 3 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model.☆19Nov 20, 2024Updated last year
- ☆15Oct 23, 2018Updated 7 years ago
- ☆56Jun 17, 2025Updated 9 months ago
- ☆22May 30, 2023Updated 2 years ago
- ☆27Feb 20, 2024Updated 2 years ago
- Implement Code for UniMix and Bayias Compensated Loss☆19Mar 7, 2023Updated 3 years ago
- A cross-platform terminal-based IDE, file manager, and virtual terminal written in Rust.☆82Updated this week
- Export Donut model to onnx and run it with onnxruntime☆23Nov 21, 2023Updated 2 years ago
- This repository is the official Pytorch implementation of Balanced Product of Calibrated Experts for Long-Tailed Recognition (CVPR 2023).☆18Mar 13, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Caffe implementation of the paper "Deep Pyramidal Residual Networks" (https://arxiv.org/abs/1610.02915).☆27Jul 18, 2017Updated 8 years ago
- 实现Blip2RWKV+QFormer的多模态图文对话大模型,使用Two-Step Cognitive Psychology Prompt方法,仅3B参数的模型便能够出现类人因果思维链。对标MiniGPT-4,ImageBind等图文对话大语言模型,力求以更小的算力和资源实…☆42Jul 17, 2023Updated 2 years ago
- (ECCV'22) FADE: Fusing the Assets of Decoder and Encoder for Task-Agnostic Upsampling☆19Nov 22, 2024Updated last year
- ☆12Oct 20, 2023Updated 2 years ago
- Verify CPU circuits in Logisim or Verilog against MARS simulation☆10Dec 31, 2020Updated 5 years ago
- ☆33Sep 27, 2024Updated last year
- ☆134Dec 22, 2023Updated 2 years ago
- VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning☆36Jul 15, 2025Updated 9 months ago
- ☆12Aug 23, 2019Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [NeurIPS 2022] Source code for our paper "Escaping Saddle Points for Effective Generalization on Class-Imbalanced Data"☆24Oct 16, 2023Updated 2 years ago
- GAIIC2024无人机视角下的双光目标检测 - Rank6 解决方案☆12Jun 17, 2024Updated last year
- Decode Neural signal as Speech☆37Oct 6, 2024Updated last year
- 此项目创建的初衷是为了帮助人工智能、自然语言处理和大语言模型相关背景的同学找工作使用,欢迎加入项目的建设和维护☆18Mar 30, 2025Updated last year
- Source code for the Paper "Mind the Gap: Benchmarking Spatial Reasoning in Vision-Language Models"☆19Feb 1, 2026Updated 2 months ago
- Deciphering Oracle Bone Language with Diffusion Models (ACL 2024 Best Paper)☆228Sep 17, 2025Updated 6 months ago
- Z1h is a programming language that lets you work quickly and provide simple, reliable, and flexible service.☆13Aug 17, 2025Updated 7 months ago