This is the official implementation of TAGCOS: Task-agnostic Gradient Clustered Coreset Selection for Instruction Tuning Data
☆14Jul 21, 2024Updated last year
Alternatives and similar repositories for TAGCOS
Users that are interested in TAGCOS are comparing it to the libraries listed below
Sorting:
- ☆24Oct 14, 2024Updated last year
- Code repository for our paper, "Medical Large Language Models are Vulnerable to Data Poisoning Attacks" (Nature Medicine, 2024).☆12Jan 5, 2025Updated last year
- Official pytorch implementation of ICML2025 "TAROT: Targeted Data Selection via Optimal Transport"☆28Dec 12, 2024Updated last year
- distilled Self-Critique refines the outputs of a LLM with only synthetic data☆11Apr 11, 2024Updated last year
- Repository for NLP project. Name to be changed when we decide on a project☆16Apr 19, 2022Updated 3 years ago
- 基于SSM的驾校预约管理系统1拥有三种角色,分别为管理员、教练、学员,具体功能如下: 管理员:学员管理、教练管理、驾校车辆管理、预约管理、取消预约管理、公告管理 教练:教练信息查询、预约管理、取消预约管理、注册、个人中心 学员:查看教练信息、预约教练、取消预约教练、评…☆13Jan 11, 2024Updated 2 years ago
- ☆14Sep 27, 2022Updated 3 years ago
- ☆21Apr 5, 2025Updated 11 months ago
- Can Large Language Models Identify Authorship? (EMNLP 2024 Findings)☆12Feb 4, 2025Updated last year
- We systematically studied the influencing factors when LLM generates benchmarks,By using our code, you can generate high-quality QA datas…☆20May 20, 2025Updated 10 months ago
- Generating Annotation Spreadsheet for QA-SRL Scheme☆12Feb 14, 2017Updated 9 years ago
- 本项目是一款管理驾校和方便学员预约学车的系统☆15Dec 19, 2017Updated 8 years ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆188Jun 25, 2025Updated 8 months ago
- Python package to deal with PAN corpora and extract stylometric features from text documents.☆15Nov 11, 2022Updated 3 years ago
- Wrapper for Ckmeans.1d.dp.☆13Mar 20, 2025Updated last year
- Implementation of TSDS: Data Selection for Task-Specific Model Finetuning. An optimal-transport framework for selecting domain-specific a…☆18Dec 25, 2024Updated last year
- ☆42Feb 12, 2026Updated last month
- ☆10May 1, 2019Updated 6 years ago
- B.Tech Thesis Code for RLCaR: Deep Reinforcement Learning Framework for Optimal and Adaptive Cache Replacement☆12Oct 25, 2020Updated 5 years ago
- Code for the benchmarking single-cell foundation models (scGPT, scBERT, and Geneformer) for cell-type annotation task using skewed single…☆15Dec 8, 2024Updated last year
- Code for COLING22 paper, DPTDR: Deep Prompt Tuning for Dense Passage Retrieval☆26Aug 7, 2023Updated 2 years ago
- [ACL2025 Findings] Official code for MIG: Automatic Data Selection for Instruction Tuning by Maximizing Information Gain in Semantic Spac…☆28Aug 30, 2025Updated 6 months ago
- Repo allows users to test different DL archictectures when applied to time series forecasting of weather data (TCN, LSTM, BiLSTM, GRU, Bi…☆20Mar 14, 2025Updated last year
- AI Logging for Interpretability and Explainability🔬☆140Jun 7, 2024Updated last year
- ☆44Oct 1, 2024Updated last year
- Codes and models for "Semi-supervised machine-learning classification of materials synthesis procedures". (https://doi.org/10.1038/s41524…☆10Apr 24, 2022Updated 3 years ago
- ☆13Sep 30, 2022Updated 3 years ago
- 完整的原版transformer程序,complete origin transformer program☆17Mar 5, 2025Updated last year
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…☆416Jun 25, 2025Updated 8 months ago
- RWKV6 in native pytorch and triton:)☆11Aug 4, 2024Updated last year
- ☆13Jun 4, 2023Updated 2 years ago
- ☆10Feb 21, 2023Updated 3 years ago
- ☆64Apr 9, 2024Updated last year
- Python snippets☆21Mar 10, 2020Updated 6 years ago
- DNN_Partition辅助工具,用于对pytorch模型进行简单的性能分析以及支持模型切分☆14May 31, 2021Updated 4 years ago
- Using DDPG agent to control UAV system with energy efficiency☆16Jan 7, 2023Updated 3 years ago
- A Mobile edge computing server placement algorithm, written from scratch for 5g server placement depending upon various KPIs across a ar…☆12Sep 14, 2022Updated 3 years ago
- Proactive Content Caching with Deep Learning☆14Oct 17, 2022Updated 3 years ago
- EMNLP 2022 Demo "SynKB: Semantic Search for Chemical Synthesis Procedures"☆16Oct 31, 2022Updated 3 years ago