This is the official implementation of TAGCOS: Task-agnostic Gradient Clustered Coreset Selection for Instruction Tuning Data
☆13Jul 21, 2024Updated last year
Alternatives and similar repositories for TAGCOS
Users that are interested in TAGCOS are comparing it to the libraries listed below
Sorting:
- ☆24Oct 14, 2024Updated last year
- ☆42Feb 12, 2026Updated 2 weeks ago
- Official pytorch implementation of ICML2025 "TAROT: Targeted Data Selection via Optimal Transport"☆28Dec 12, 2024Updated last year
- Debiasing Through Data Attribution☆12May 23, 2024Updated last year
- 免注册免费使用 ChatGPT,请关注微信公众号【胖竹同学】。☆10Apr 4, 2023Updated 2 years ago
- ☆44Oct 1, 2024Updated last year
- Repo allows users to test different DL archictectures when applied to time series forecasting of weather data (TCN, LSTM, BiLSTM, GRU, Bi…☆19Mar 14, 2025Updated 11 months ago
- VQ-TR repository☆12Apr 18, 2024Updated last year
- A simple and efficient baseline for data attribution☆11Nov 10, 2023Updated 2 years ago
- [ACL 2025 Main] (🏆 Outstanding Paper Award) Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Proba…☆15Aug 15, 2025Updated 6 months ago
- ACL24☆11Jun 7, 2024Updated last year
- 基于SSM的驾校预约管理系统1拥有三种角色,分别为管理员、教练、学员,具体功能如下: 管理员:学员管理、教练管理、驾校车辆管理、预约管理、取消预约管理、公告管理 教练:教练信息查询、预约管理、取消预约管理、注册、个人中心 学员:查看教练信息、预约教练、取消预约教练、评…☆13Jan 11, 2024Updated 2 years ago
- ☆13Jan 22, 2025Updated last year
- distilled Self-Critique refines the outputs of a LLM with only synthetic data☆11Apr 11, 2024Updated last year
- ☆10Feb 21, 2023Updated 3 years ago
- An automated feature engineering framework 'FETCH' accepted in ICLR 2023.☆11Jun 20, 2023Updated 2 years ago
- ☆10Jun 10, 2023Updated 2 years ago
- Example code for the NNGeometry PyTorch library☆10Aug 20, 2025Updated 6 months ago
- The Conceptual Coverage Across Languages Benchmark for Text-to-Image Models☆12Oct 28, 2024Updated last year
- ☆10Oct 20, 2023Updated 2 years ago
- Graphical user interface for text-guided face editing☆11Jan 18, 2023Updated 3 years ago
- Code accompanying our ICML 2020 paper on choice set optimization in group decision-making.☆11Jun 27, 2020Updated 5 years ago
- Code repository for our paper, "Medical Large Language Models are Vulnerable to Data Poisoning Attacks" (Nature Medicine, 2024).☆12Jan 5, 2025Updated last year
- Tiny evaluation of leading LLMs on competitive programming problems☆14Nov 28, 2024Updated last year
- ☆12Nov 2, 2021Updated 4 years ago
- [ICML 2024] PyTorch implementation for "Diversified Batch Selection for Training Acceleration"☆10Jul 30, 2024Updated last year
- Empowering Small VLMs to Think with Dynamic Memorization and Exploration☆15Nov 18, 2025Updated 3 months ago
- Can Large Language Models Identify Authorship? (EMNLP 2024 Findings)☆12Feb 4, 2025Updated last year
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆189Jun 25, 2025Updated 8 months ago
- RWKV6 in native pytorch and triton:)☆11Aug 4, 2024Updated last year
- Codes and models for "Semi-supervised machine-learning classification of materials synthesis procedures". (https://doi.org/10.1038/s41524…☆10Apr 24, 2022Updated 3 years ago
- B.Tech Thesis Code for RLCaR: Deep Reinforcement Learning Framework for Optimal and Adaptive Cache Replacement☆12Oct 25, 2020Updated 5 years ago
- ☆12Oct 2, 2023Updated 2 years ago
- An implementation of online data mixing for the Pile dataset, based on the GPT-NeoX library.☆13Jan 9, 2024Updated 2 years ago
- An Empirical Study of Memorization in NLP (ACL 2022)☆13Jun 22, 2022Updated 3 years ago
- ☆10May 1, 2023Updated 2 years ago
- Wrapper for Ckmeans.1d.dp.☆13Mar 20, 2025Updated 11 months ago
- ☆13Nov 22, 2024Updated last year
- ☆12Jul 16, 2025Updated 7 months ago