Survey on Data-centric Large Language Models
☆92Jul 8, 2024Updated last year
Alternatives and similar repositories for Data-centric_multimodal_LLM
Users that are interested in Data-centric_multimodal_LLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A list of papers about data quality in Large Language Models (LLMs)☆27Dec 14, 2023Updated 2 years ago
- This is the repo for the paper Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining.☆48Aug 22, 2025Updated 9 months ago
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆25Oct 7, 2025Updated 7 months ago
- Official PyTorch implementation of RACRO (https://www.arxiv.org/abs/2506.04559)☆19Jul 1, 2025Updated 10 months ago
- ☆16Sep 4, 2025Updated 8 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The official pytorch implementation of Exploring the Interactive Guidance for Unified and Effective Image Matting [TOMM 2025]☆25Nov 24, 2025Updated 6 months ago
- ☆110Sep 11, 2025Updated 8 months ago
- WisdoMentor - Series: A LLM for undergraduates | 博导智言(辅助大学生 学习)☆13May 9, 2024Updated 2 years ago
- ☆15Mar 15, 2024Updated 2 years ago
- Official Implementation of paper "Distilling Long-tailed Datasets" [CVPR 2025]☆21Aug 13, 2025Updated 9 months ago
- Github repository for "Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging" (ICML 2025)☆89Sep 23, 2025Updated 8 months ago
- ☆14Apr 21, 2023Updated 3 years ago
- Official PyTorch implementation for Distilling Dataset into Neural Field [ICLR 2025]☆16Mar 20, 2025Updated last year
- MathFusion: Enhancing Mathematical Problem-solving of LLM through Instruction Fusion (ACL 2025)☆37Jul 16, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ContextBLIP : Doubly Contextual Alignment for Contrastive Image Retrieval from Linguistically Complex Descriptions [ACL 2024]☆11May 17, 2024Updated 2 years ago
- PaperPub is an academic arena where diverse AI Agents read papers daily, pick apart each other's arguments, and fiercely debate.☆43Apr 17, 2026Updated last month
- [ICLR 2026] Empowering Small VLMs to Think with Dynamic Memorization and Exploration☆18Mar 18, 2026Updated 2 months ago
- ☆15Oct 4, 2024Updated last year
- ☆23Jan 16, 2024Updated 2 years ago
- Code for "Mixture-based feature space learning for few-shot image classification"-ICCV'2021.☆14Oct 13, 2021Updated 4 years ago
- [ICCV 2025] The official implementation of the paper “Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm”☆83Oct 17, 2025Updated 7 months ago
- A project designed to build and render a full Minecraft crafting tree.☆10Aug 10, 2021Updated 4 years ago
- ☆12Jun 13, 2025Updated 11 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Easy Data Preparation with latest LLMs-based Operators and Pipelines.☆3,981Updated this week
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding☆37Apr 25, 2026Updated last month
- Code and data from the paper 'Human Feedback is not Gold Standard'☆21May 5, 2026Updated 3 weeks ago
- [ACL 2025] We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLM…☆68Oct 27, 2024Updated last year
- Code for EMNLP2023 paper "MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter".☆12Dec 27, 2023Updated 2 years ago
- ☆18Mar 2, 2026Updated 2 months ago
- Official code for ICLR 2024 paper "Do Generated Data Always Help Contrastive Learning?"☆31Apr 4, 2024Updated 2 years ago
- Code for our ICML'24 on multimodal dataset distillation☆43Oct 11, 2024Updated last year
- Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries☆42Nov 19, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A python script for downloading huggingface datasets and models.☆20Apr 10, 2025Updated last year
- [ECCV 2024] About The official implementation of the paper "Cross-view image geo-localization with Panorama-BEV Co-Retrieval Network“.☆102Jul 8, 2025Updated 10 months ago
- CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning☆36Aug 28, 2025Updated 8 months ago
- Official repo for EscapeCraft (an 3D environment for room escape) and benchmark MM-Escape. This work is accepted by ICCV 2025.☆39Jul 7, 2025Updated 10 months ago
- ☆27Jul 10, 2025Updated 10 months ago
- awsome ai tools☆12Apr 21, 2023Updated 3 years ago
- ☆16Feb 4, 2025Updated last year