DataFlex is a data-centric training framework that enhances model performance by either selecting the most influential samples, optimizing their weights, or adjusting their mixing ratios.
☆114Mar 17, 2026Updated this week
Alternatives and similar repositories for DataFlex
Users that are interested in DataFlex are comparing it to the libraries listed below
Sorting:
- Dataflow-MM, multi-media operators for Dataflow. We aim to prepare data for Multimodal Large Language Models.☆32Mar 10, 2026Updated last week
- Survey on Data-centric Large Language Models☆93Jul 8, 2024Updated last year
- Easy Data Preparation with latest LLMs-based Operators and Pipelines.☆2,992Mar 12, 2026Updated last week
- [Archived] For the latest updates and community contribution, please visit: https://github.com/Ascend/TransferQueue or https://gitcode.co…☆13Jan 16, 2026Updated 2 months ago
- MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer (EMNLP 2025)☆11Apr 18, 2025Updated 11 months ago
- Just a demonstration of some sampling techniques (rejection sampling, importance sampling, sampling importance resampling, Metropolis sam…☆11Aug 24, 2013Updated 12 years ago
- AlignX-Family is an open-source research suite for advancing personalization in large language models-spanning data, code, models, and be…☆20Jan 12, 2026Updated 2 months ago
- ⭐️ 课程项目_基于toshare的机器量化分析(含数据采集+预处理与建模+模拟交易与回测+可视化)☆13Oct 6, 2019Updated 6 years ago
- ☆33Jul 8, 2025Updated 8 months ago
- ☆176Apr 15, 2025Updated 11 months ago
- ☆19Mar 21, 2022Updated 4 years ago
- Official implementation of MC-LLaVA.☆140Updated this week
- Generative Regional Editing (GRE) Benchmark☆19Sep 10, 2024Updated last year
- Efficient and Online Dataset Growth Algorithm (with cleanness and diversity awareness) to deal with growing web data☆21Aug 6, 2024Updated last year
- ACL'2023: Multi-Task Pre-Training of Modular Prompt for Few-Shot Learning☆40Oct 24, 2022Updated 3 years ago
- The Implementation of "AutoNE: Hyperparameter Optimization for Massive Network Embedding"(KDD 2019)☆17Jul 6, 2023Updated 2 years ago
- ☆13May 12, 2025Updated 10 months ago
- The official github repo for the open online courses: "Dive into LLMs".☆10Mar 15, 2024Updated 2 years ago
- ReCAP: Recursive Context-Aware Reasoning and Planning for Large Language Model Agents, NeurIPS 2025☆34Nov 15, 2025Updated 4 months ago
- ☆42Mar 13, 2026Updated last week
- Official implementation of TINC: Tree-structured Implicit Neural Compression☆22Sep 26, 2023Updated 2 years ago