☆43Oct 13, 2023Updated 2 years ago
Alternatives and similar repositories for d2pruning
Users that are interested in d2pruning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for paper: “What Data Benefits My Classifier?” Enhancing Model Performance and Interpretability through Influence-Based Data Selecti…☆23May 17, 2024Updated 2 years ago
- Data Valuation without Training of a Model, submitted to ICLR'23☆22Dec 30, 2022Updated 3 years ago
- Metrics for "Beyond neural scaling laws: beating power law scaling via data pruning " (NeurIPS 2022 Outstanding Paper Award)☆58Apr 24, 2023Updated 3 years ago
- Project for SNARE benchmark☆11Jun 5, 2024Updated 2 years ago
- ☆11Dec 20, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICLR 2024] "Data Distillation Can Be Like Vodka: Distilling More Times For Better Quality" by Xuxi Chen*, Yu Yang*, Zhangyang Wang, Baha…☆15May 18, 2024Updated 2 years ago
- ☆13Dec 12, 2025Updated 6 months ago
- Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]☆80Nov 14, 2024Updated last year
- [NeurIPS 2024 Spotlight] CLIPLoss and Norm-Based Data Selection Methods for Multimodal Contrastive Learning.☆14Dec 12, 2024Updated last year
- A Survey of Dataset Refinement for Problems in Computer Vision Datasets☆34Sep 12, 2025Updated 9 months ago
- ☆10Feb 6, 2025Updated last year
- DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models☆13Nov 2, 2023Updated 2 years ago
- ☆33Mar 24, 2023Updated 3 years ago
- Code and data from the paper 'Human Feedback is not Gold Standard'☆21May 5, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆28Mar 21, 2024Updated 2 years ago
- Implementation of Gradient Information Optimization (GIO) for effective and scalable training data selection☆14Jun 22, 2023Updated 3 years ago
- [ECCV 2024] Official PyTorch implementation of "HYPE: Hyperbolic Entailment Filtering for Underspecified Images and Texts"☆20Nov 22, 2024Updated last year
- Official Code Repository for [AutoScale📈: Scale-Aware Data Mixing for Pre-Training LLMs] Published as a conference paper at **COLM 2025*…☆14Aug 8, 2025Updated 10 months ago
- The official implementation of paper "Spanning Training Progress: Temporal Dual-Depth Scoring (TDDS) for Enhanced Dataset Pruning" (CVPR …☆21Aug 20, 2024Updated last year
- 蚂蚁金融自然语言处理竞赛。☆10Sep 3, 2018Updated 7 years ago
- Code for "Improving Translation Faithfulness of Large Language Models via Augmenting Instructions"☆12Aug 26, 2023Updated 2 years ago
- code for promptCSE, emnlp 2022☆11Apr 10, 2023Updated 3 years ago
- ☆20Mar 23, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A fast, effective data attribution method for neural networks in PyTorch☆242Nov 18, 2024Updated last year
- Efficient Multi-Object Tracking for Edge devices☆14Nov 8, 2023Updated 2 years ago
- AAAI 2024, M3D: Dataset Condensation by Minimizing Maximum Mean Discrepancy☆26Mar 2, 2024Updated 2 years ago
- Source code for a LoRA-based continual relation extraction method.☆14Sep 25, 2023Updated 2 years ago
- ☆28Jul 10, 2025Updated 11 months ago
- Code for the paper "REV: Information-Theoretic Evaluation of Free-Text Rationales"☆16Aug 11, 2023Updated 2 years ago
- ☆32May 24, 2023Updated 3 years ago
- ☆15Apr 13, 2023Updated 3 years ago
- You Only Condense Once: Two Rules for Pruning Condensed Datasets (NeurIPS 2023)☆16Nov 18, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆22Jul 20, 2022Updated 3 years ago
- A Survey on Data Selection for Language Models☆261Apr 29, 2025Updated last year
- ☆30Apr 12, 2024Updated 2 years ago
- Awesome-open-world-learning☆26Oct 19, 2021Updated 4 years ago
- Bridging Large Language Models with Scala 3 Functions☆11Aug 31, 2024Updated last year
- code for kdd feasibiiity☆12Jul 17, 2023Updated 2 years ago
- Papers about training data quality management for ML models.☆124Jun 13, 2026Updated 2 weeks ago