This is the repo for the paper Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining.
☆48Aug 22, 2025Updated 9 months ago
Alternatives and similar repositories for multi-actor-data-selection
Users that are interested in multi-actor-data-selection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [AAAI 2025]This repo contains evaluation code for the paper “UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in…☆37Apr 10, 2025Updated last year
- [ICLR 2025 Spotlight] The official implementation of the paper “LOKI:A Comprehensive Synthetic Data Detection Benchmark using Large Multi…☆179Feb 7, 2026Updated 4 months ago
- [ICCV 2025] The official implementation of the paper “Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm”☆81Oct 17, 2025Updated 7 months ago
- MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer (EMNLP 2025)☆12Apr 18, 2025Updated last year
- Survey on Data-centric Large Language Models☆93Jul 8, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation, ECCV 2024☆22Feb 15, 2024Updated 2 years ago
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆90Mar 23, 2025Updated last year
- [NeurIPS 2025 🔥] FakeVLM: Advancing Synthetic Image Detection through Explainable Multimodal Models and Fine-Grained Artifact Analysis☆149Sep 24, 2025Updated 8 months ago
- PaperPub is an academic arena where diverse AI Agents read papers daily, pick apart each other's arguments, and fiercely debate.☆43Apr 17, 2026Updated last month
- ☆37Feb 17, 2026Updated 3 months ago
- ☆13Feb 2, 2025Updated last year
- A 3D Kokoro Mate similar to Grok Ani.☆42Jul 20, 2025Updated 10 months ago
- Data and Code for EMNLP 2022 paper "ReasTAP: Injecting Table Reasoning Skills During Pre-training via Synthetic Reasoning Examples"☆15Jun 4, 2023Updated 3 years ago
- AutoLibra: Metric Induction for Agents from Open-Ended Human Feedback☆19Apr 23, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official PyTorch implementation of RACRO (https://www.arxiv.org/abs/2506.04559)☆19Jul 1, 2025Updated 11 months ago
- ☆16May 30, 2025Updated last year
- AAAI 2024: Visual Instruction Generation and Correction☆97Feb 4, 2024Updated 2 years ago
- The official implementation of the paper "CrossViewDiff: A Cross-View Diffusion Model for Satellite-to-Street View Synthesis"☆16Sep 2, 2024Updated last year
- Provide customized diffusers training and inference code for different needs☆12Jan 16, 2024Updated 2 years ago
- Cross-View Geolocalization and Disaster Mapping with Street-View and VHR Satellite Imagery: A Case Study of Hurricane IAN☆19Oct 3, 2024Updated last year
- [CVPR 2024, Highlight] The official implementation of the paper "SG-BEV: Satellite-Guided BEV Fusion for Cross-View Semantic Segmentation…☆50Sep 30, 2025Updated 8 months ago
- [ACL 2023] Few-shot Reranking for Multi-hop QA via Language Model Prompting☆27Oct 19, 2025Updated 7 months ago
- ROUTE: Robust Multitask Tuning and Collaboration for Text-to-SQL (ICLR 2025 Pytorch Code)☆17May 15, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A longitudinal dataset for academic literature, including papers, metadata, and citation graphs, Also available on 🤗 HuggingFace and Kag…☆17Sep 6, 2025Updated 9 months ago
- iLLaVA: An Image is Worth Fewer Than 1/3 Input Tokens in Large Multimodal Models (ICLR2026)☆22Mar 29, 2026Updated 2 months ago
- Official repository of Graph RAG-Tool Fusion and ToolLinkOS dataset.☆23Feb 13, 2025Updated last year
- GPT-ImgEval: Evaluating GPT-4o’s state-of-the-art image generation capabilities☆306May 3, 2025Updated last year
- Improving neural network representations using human similarity judgments☆13Nov 22, 2024Updated last year
- ☆27Jun 11, 2025Updated 11 months ago
- [AAAI 2026] ReCode: Reinforced Code Knowledge Editing for API Updates☆26Jul 1, 2025Updated 11 months ago
- [CVPR 2023] The models, datasets(satellite&street view) and correlative config files of OmniCity-v1.0 project.☆34Mar 27, 2025Updated last year
- A comprehensive paper list of Reasoning over Tables.☆30Nov 6, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ACL 2026 Findings] CoV: Chain-of-View Prompting for Spatial Reasoning☆61Apr 7, 2026Updated 2 months ago
- XL-VLMs: General Repository for eXplainable Large Vision Language Models☆49Sep 8, 2025Updated 9 months ago
- A scalable automated alignment method for large language models. Resources for "Aligning Large Language Models via Self-Steering Optimiza…☆20Nov 21, 2024Updated last year
- Implementation of the paper "Improving the Accuracy-Robustness Trade-off of Classifiers via Adaptive Smoothing".☆10Feb 6, 2024Updated 2 years ago
- python programs and procedures that facilitate local application of the earth2observe global water resources reanalysis☆10Nov 21, 2017Updated 8 years ago
- ☆20Apr 23, 2024Updated 2 years ago
- WebApp1k benchmark☆16Nov 21, 2025Updated 6 months ago