google-deepmind / proactive_t2i_agentsLinks
Code release for the paper, "Proactive Agents for Text-to-Image Generation under Uncertainty"
☆35Updated 2 weeks ago
Alternatives and similar repositories for proactive_t2i_agents
Users that are interested in proactive_t2i_agents are comparing it to the libraries listed below
Sorting:
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆51Updated 5 months ago
- ☆9Updated last month
- A framework making it effortless to convert any llm model into a reasoning agent like o1 or DeepSeek's r1☆21Updated last week
- Enhancement in Multimodal Representation Learning.☆40Updated last year
- ☆36Updated 2 years ago
- ☆13Updated 5 months ago
- ☆21Updated 7 months ago
- GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents☆69Updated this week
- ☆61Updated 10 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆22Updated 6 months ago
- LLM reads a paper and produce a working prototype☆57Updated last month
- Scaling Computer-Use Grounding via UI Decomposition and Synthesis☆49Updated last week
- BH hackathon☆14Updated last year
- ZeroGUI: Automating Online GUI Learning at Zero Human Cost☆43Updated this week
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆53Updated 4 months ago
- ☆19Updated last week
- ☆77Updated 7 months ago
- 🧠 Societies of Mind & Economy of Minds☆58Updated 2 months ago
- Finetune any model on HF in less than 30 seconds☆58Updated 2 months ago
- Official implementation of "Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models"☆36Updated last year
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆90Updated 4 months ago
- ☆79Updated last month
- ☆82Updated last year
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 3 months ago
- ☆59Updated 2 weeks ago
- Official Repo for The Paper "Talk Structurally, Act Hierarchically: A Collaborative Framework for LLM Multi-Agent Systems"☆52Updated 3 months ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆110Updated 2 weeks ago
- Code for ScribeAgent paper☆57Updated 3 months ago
- Challenges for general-purpose web-browsing AI agents☆58Updated this week
- ☆65Updated 2 months ago