Generating figures from research papers, using textual captions from the paper.
☆43Jul 17, 2023Updated 2 years ago
Alternatives and similar repositories for figure-diffusion
Users that are interested in figure-diffusion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…☆83Jan 30, 2023Updated 3 years ago
- ☆21Apr 8, 2024Updated 2 years ago
- MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning [NeurIPS 2025 Poster]☆24Dec 10, 2025Updated 5 months ago
- KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality☆45May 19, 2026Updated last week
- [AAAI2026] TextShield-R1: Reinforced Reasoning for Tampered Text Detection☆28Feb 14, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- DALLE-tools provided useful dataset utilities to improve you workflow with WebDatasets.☆14Mar 9, 2022Updated 4 years ago
- ☆13Nov 15, 2022Updated 3 years ago
- ☆18Mar 31, 2024Updated 2 years ago
- A summary of must-read papers for Neural Question Generation (NQG)☆14Nov 14, 2020Updated 5 years ago
- Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)☆57Jul 25, 2023Updated 2 years ago
- an attempt at implementing deep learning model proposed in paper teaching robots to draw☆11Aug 13, 2021Updated 4 years ago
- Initial commit☆13Aug 14, 2023Updated 2 years ago
- [ECCV 2024] Official PyTorch implementation of Hybrid Video Diffusion Models with 2D Triplane and 3D Wavelet Representation☆12Nov 29, 2024Updated last year
- Code for the paper "If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection"☆27Jul 10, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICLR 2025] "GraphEval: A Lightweight Graph-Based LLM Framework for Idea Evaluation", Tao Feng, Yihang Sun, Jiaxuan You☆18Mar 18, 2025Updated last year
- ☆17Apr 6, 2023Updated 3 years ago
- ☆12Jul 21, 2025Updated 10 months ago
- ☆19Jun 10, 2025Updated 11 months ago
- 用大模型批量处理数据,现支持各种大模型做OCR,支持通义千问, 月之暗面, 百度飞桨OCR, OpenAI 和LLAVA。Use LLM to generate or clean data for academic use. Support OCR with qwen, m…☆17Sep 15, 2024Updated last year
- (ACL 2025) 🔥🔥🔥Code for "Empowering Multimodal Large Language Models with Evol-Instruct"☆22May 15, 2025Updated last year
- [NeurIPS 2025 D&B Track] MLR-Bench: Evaluating AI Agents on Open-Ended Machine Learning Research☆30May 8, 2026Updated 2 weeks ago
- [CVPR2025] Is Your World Simulator a Good Story Presenter? A Consecutive Events-Based Benchmark for Future Long Video Generation☆20May 2, 2025Updated last year
- Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision☆19Apr 1, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A fun way to visualize influence in the game of Go.☆23Dec 13, 2018Updated 7 years ago
- Movie Screenplay Parser☆13Apr 29, 2024Updated 2 years ago
- ☆13Mar 11, 2025Updated last year
- [ECCV 2024] Code for "EraseDraw: Learning to Insert Objects by Erasing Them from Images"☆26Dec 1, 2024Updated last year
- Self-supervised method for completing partial LiDAR point clouds. Trained and tested on ShapeNet and SemanticKITTI in TensorFlow. (BMVC 2…☆14Oct 15, 2022Updated 3 years ago
- [ECCV 2024] Official PyTorch implementation of "HYPE: Hyperbolic Entailment Filtering for Underspecified Images and Texts"☆20Nov 22, 2024Updated last year
- ☆18Aug 7, 2025Updated 9 months ago
- This repository is the code of our paper "DiffUTE: Universal Text Editing Diffusion Model" (NeurIPS'2023).☆145Apr 11, 2025Updated last year
- Samwalker☆10Aug 29, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ORES: Open-vocabulary Responsible Visual Synthesis☆14Dec 12, 2023Updated 2 years ago
- LangXAI: Integrating Large Vision Models for Generating Textual Explanations to Enhance Explainability in Visual Perception Tasks☆19Oct 3, 2024Updated last year
- [AAAI 2026] ReCode: Reinforced Code Knowledge Editing for API Updates☆26Jul 1, 2025Updated 10 months ago
- Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"☆28Oct 14, 2025Updated 7 months ago
- A Streamlit-based Chatbot Arena for Ollama LLMs☆14May 19, 2024Updated 2 years ago
- PoseIt a multi-modal dataset that contains visual tactile data for holding poses☆14Feb 9, 2023Updated 3 years ago
- Official Repository for CLRCMD (Appear in ACL2022)☆42Feb 21, 2023Updated 3 years ago