DataFlex is a data-centric training framework that enhances model performance by either selecting the most influential samples, optimizing their weights, or adjusting their mixing ratios.
☆168Apr 8, 2026Updated this week
Alternatives and similar repositories for DataFlex
Users that are interested in DataFlex are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Survey on Data-centric Large Language Models☆94Jul 8, 2024Updated last year
- MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer (EMNLP 2025)☆12Apr 18, 2025Updated 11 months ago
- The Code and Script of "David's Slingshot: A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis"☆34Jun 13, 2025Updated 9 months ago
- [ICDE 2026] Text2SQL-Flow: A Robust SQL-Aware Data Augmentation Framework for Text-to-SQL☆30Mar 25, 2026Updated 2 weeks ago
- ☆111Sep 11, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆14May 12, 2023Updated 2 years ago
- This is an implementation of the paper "Are We Done with Object-Centric Learning?"☆12Sep 11, 2025Updated 6 months ago
- The implementation of RAGSynth: Synthetic Data for Robust and Faithful RAG Component Optimization☆21May 26, 2025Updated 10 months ago
- ☆13Jan 22, 2025Updated last year
- ☆82Oct 13, 2025Updated 5 months ago
- ☆28May 24, 2025Updated 10 months ago
- ☆18Nov 28, 2022Updated 3 years ago
- A spoken version of the textual story cloze benchmark☆22Aug 6, 2023Updated 2 years ago
- (ACL2025 oral) SCOPE: Optimizing KV Cache Compression in Long-context Generation☆34May 28, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Official code of MoSA (Mixture of Sparse Adapters).☆13Dec 14, 2023Updated 2 years ago
- ☆177Apr 15, 2025Updated 11 months ago
- ☆15Apr 11, 2024Updated last year
- A list of papers about data quality in Large Language Models (LLMs)☆27Dec 14, 2023Updated 2 years ago
- A framework for unified personalized model, achieving mutual enhancement between personalized understanding and generation. Demonstrating…☆128Dec 25, 2025Updated 3 months ago
- OneEdit: A Neural-Symbolic Collaboratively Knowledge Editing System.☆20Oct 14, 2024Updated last year
- ☆13Mar 5, 2025Updated last year
- A PyTorch native library for large model training☆25Apr 1, 2026Updated last week
- ☆16Jul 23, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- AlignX-Family is an open-source research suite for advancing personalization in large language models-spanning data, code, models, and be…☆20Jan 12, 2026Updated 2 months ago
- An Open Source implementation of Notebook LM.☆51Apr 3, 2026Updated last week
- ☆38Jan 25, 2026Updated 2 months ago
- ☆15Jan 24, 2025Updated last year
- 基于KiteX+Gin+Gorm+MySQL+Redis+ProtoBuf实现的青春版抖音项目☆11Aug 15, 2022Updated 3 years ago
- ☆19Mar 21, 2022Updated 4 years ago
- ☆20Nov 3, 2024Updated last year
- Blazingly fast neighborhood attention☆14Nov 28, 2023Updated 2 years ago
- ☆22May 3, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- The official implementation of paper: SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction.☆50Oct 18, 2024Updated last year
- Efficient and Online Dataset Growth Algorithm (with cleanness and diversity awareness) to deal with growing web data☆21Aug 6, 2024Updated last year
- Generative Regional Editing (GRE) Benchmark☆19Sep 10, 2024Updated last year
- [NeurIPS 2024] Fast Best-of-N Decoding via Speculative Rejection☆55Oct 29, 2024Updated last year
- Official implementation of ECCV24 paper: POA☆24Aug 8, 2024Updated last year
- INFOCOM 2024: Online Resource Allocation for Edge Intelligence with Colocated Model Retraining and Inference☆34Oct 13, 2024Updated last year
- Transform audio files into mel spectrograms for text-to-speech model training☆12Aug 25, 2021Updated 4 years ago