Jiachen-T-Wang / GREATSView external linksLinks
β17Mar 23, 2025Updated 10 months ago
Alternatives and similar repositories for GREATS
Users that are interested in GREATS are comparing it to the libraries listed below
Sorting:
- Official Code Repository for [AutoScaleπ: Scale-Aware Data Mixing for Pre-Training LLMs] Published as a conference paper at **COLM 2025*β¦β13Aug 8, 2025Updated 6 months ago
- β10Oct 20, 2023Updated 2 years ago
- β32Feb 11, 2025Updated last year
- This is an official repository for "Performance Scaling via Optimal Transport: Enabling Data Selection from Partially Revealed Sources" (β¦β14Oct 26, 2023Updated 2 years ago
- Codebase for ICML submission "DOGE: Domain Reweighting with Generalization Estimation"β21Feb 29, 2024Updated last year
- Code for the paper "The Journey, Not the Destination: How Data Guides Diffusion Models"β25Dec 12, 2023Updated 2 years ago
- `dattri` is a PyTorch library for developing, benchmarking, and deploying efficient data attribution algorithms.β108Updated this week
- Aioli: A unified optimization framework for language model data mixingβ32Jan 17, 2025Updated last year
- Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Modelsβ48Oct 31, 2023Updated 2 years ago
- Data Valuation without Training of a Model, submitted to ICLR'23β22Dec 30, 2022Updated 3 years ago
- A Survey on Data Selection for Language Modelsβ254Apr 29, 2025Updated 9 months ago
- β32May 24, 2023Updated 2 years ago
- This repository shares undergraduate course materials for the Electronic Information Engineering program at the University of Science andβ¦β63Oct 23, 2025Updated 3 months ago
- β43Oct 13, 2023Updated 2 years ago
- Trending projects & awesome papers about data-centric llm studies.β40May 20, 2025Updated 8 months ago
- Code for the paper "Efficient Dataset Distillation using Random Feature Approximation"β37Feb 24, 2023Updated 2 years ago
- OpenVLA for AIRBOTβ14Aug 15, 2024Updated last year
- The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.β13Jun 17, 2024Updated last year
- Debiasing Through Data Attributionβ12May 23, 2024Updated last year
- Public Codebase for Rethinking Parameter Counting: Effective Dimensionality Revisitedβ37Dec 27, 2022Updated 3 years ago
- β43Aug 26, 2024Updated last year
- Example code for the NNGeometry PyTorch libraryβ10Aug 20, 2025Updated 5 months ago
- Code for using the Grasp Affordance Reasoning datasetβ10Sep 17, 2019Updated 6 years ago
- This repository is a version of https://github.com/dimatura/pypcd for Python 3β12Jun 26, 2020Updated 5 years ago
- 3D Scene Annotation and Dataset Toolkitβ10Jun 11, 2023Updated 2 years ago
- Grounded-SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and β¦β12Sep 5, 2023Updated 2 years ago
- β11Oct 20, 2023Updated 2 years ago
- PyTorch implementation for the Neural Logic Machines (NLM).β11May 7, 2019Updated 6 years ago
- ACL24β11Jun 7, 2024Updated last year
- Code for RA-L paper "One-shot Learning for Task-oriented Grasping"β12May 9, 2024Updated last year
- β11Oct 20, 2023Updated 2 years ago
- A simple and efficient baseline for data attributionβ11Nov 10, 2023Updated 2 years ago
- [NeurIPS 2024 D&B Track] DACO: Towards Application-Driven and Comprehensive Data Analysis via Code Generationβ12Mar 5, 2025Updated 11 months ago
- AutoLibra: Metric Induction for Agents from Open-Ended Human Feedbackβ17Oct 15, 2025Updated 4 months ago
- β12Apr 22, 2024Updated last year
- A terminal plotter for tensorboard and csvβ10Jul 20, 2024Updated last year
- Exploring Pose-Guided Imitation Learning for Robotic Precise Insertionβ20May 15, 2025Updated 9 months ago
- Code for paper "Reasoning Like an Economist: Post-Training on Economic Problems Induces Strategic Generalization in LLMs"β12Jun 11, 2025Updated 8 months ago
- KRF: Keypoint Refinement with Fusion Network for 6D Pose Estimationβ16Nov 9, 2024Updated last year