2003pro/TAGCOS

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/2003pro/TAGCOS)

2003pro / TAGCOS

This is the official implementation of TAGCOS: Task-agnostic Gradient Clustered Coreset Selection for Instruction Tuning Data

☆13

Alternatives and similar repositories for TAGCOS

Users that are interested in TAGCOS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Blue-Raincoat / SelectIT
View on GitHub
☆24Oct 14, 2024Updated last year
nyuolab / llm-knowledge-graphs
View on GitHub
Code repository for our paper, "Medical Large Language Models are Vulnerable to Data Poisoning Attacks" (Nature Medicine, 2024).
☆13Jan 5, 2025Updated last year
namiyousef / argument-mining
View on GitHub
Repository for NLP project. Name to be changed when we decide on a project
☆16Apr 19, 2022Updated 4 years ago
vicgalle / distilled-self-critique
View on GitHub
distilled Self-Critique refines the outputs of a LLM with only synthetic data
☆11Apr 11, 2024Updated 2 years ago
vita-epfl / TAROT
View on GitHub
Official pytorch implementation of ICML2025 "TAROT: Targeted Data Selection via Optimal Transport"
☆31Dec 12, 2024Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
baixianghuang / authorship-llm
View on GitHub
Can Large Language Models Identify Authorship? (EMNLP 2024 Findings)
☆13Feb 4, 2025Updated last year
jahuerta92 / authorship-embeddings
View on GitHub
☆15Sep 27, 2022Updated 3 years ago
uwnlp / qasrl_annotation
View on GitHub
Generating Annotation Spreadsheet for QA-SRL Scheme
☆12Feb 14, 2017Updated 9 years ago
JTWang2000 / NICE
View on GitHub
NICE: Non-differentiable evaluation metric-based InfluenCe Estimation
☆16Jul 7, 2025Updated last year
wangbuer1 / Driving-school-reservation-management-system
View on GitHub
基于SSM的驾校预约管理系统1拥有三种角色，分别为管理员、教练、学员，具体功能如下：管理员：学员管理、教练管理、驾校车辆管理、预约管理、取消预约管理、公告管理教练：教练信息查询、预约管理、取消预约管理、注册、个人中心学员：查看教练信息、预约教练、取消预约教练、评…
☆13Jan 11, 2024Updated 2 years ago
mullerpeter / authorstyle
View on GitHub
Python package to deal with PAN corpora and extract stylometric features from text documents.
☆15Nov 11, 2022Updated 3 years ago
tianyi-lab / Superfiltering
View on GitHub
[ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning
☆189Jun 25, 2025Updated last year
liukai90 / diving-school
View on GitHub
本项目是一款管理驾校和方便学员预约学车的系统
☆15Dec 19, 2017Updated 8 years ago
djdt / ckwrap
View on GitHub
Wrapper for Ckmeans.1d.dp.
☆13Mar 20, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
listentm / CROWDSELECT
View on GitHub
We systematically studied the influencing factors when LLM generates benchmarks,By using our code, you can generate high-quality QA datas…
☆20May 20, 2025Updated last year
robtacconelli / Nacrith-GPU
View on GitHub
Nacrith — Lossless text compression via ensemble neural arithmetic coding. Combines SmolLM2-135M language model with context mixing, adap…
☆22Mar 21, 2026Updated 3 months ago
olostep / olostep-mcp-server
View on GitHub
MCP server for Olostep — the web scraping, crawling, and search infrastructure used by top AI companies. Gives any MCP-compatible AI agen…
☆21Jul 7, 2026Updated last week
FreedomIntelligence / DPTDR
View on GitHub
Code for COLING22 paper, DPTDR: Deep Prompt Tuning for Dense Passage Retrieval
☆26Aug 7, 2023Updated 2 years ago
SabbaghCodes / ImbalancedLearningForSingleCellFoundationModels
View on GitHub
Code for the benchmarking single-cell foundation models (scGPT, scBERT, and Geneformer) for cell-type annotation task using skewed single…
☆16Dec 8, 2024Updated last year
xypan0 / G-DIG
View on GitHub
☆12Jun 30, 2024Updated 2 years ago
yichengchen24 / MIG
View on GitHub
[ACL2025 Findings] Official code for MIG: Automatic Data Selection for Instruction Tuning by Maximizing Information Gain in Semantic Spac…
☆28Aug 30, 2025Updated 10 months ago
zyxxmu / DSnoT
View on GitHub
Official Pytorch Implementation of Our Paper Accepted at ICLR 2024-- Dynamic Sparse No Training: Training-Free Fine-tuning for Sparse LLM…
☆50Apr 9, 2024Updated 2 years ago
CederGroupHub / synthesis-paragraph-classifier
View on GitHub
Codes and models for "Semi-supervised machine-learning classification of materials synthesis procedures". (https://doi.org/10.1038/s41524…
☆10Apr 24, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
UKPLab / acl2019-GPPL-humour-metaphor
View on GitHub
☆14Sep 30, 2022Updated 3 years ago
logix-project / logix
View on GitHub
AI Logging for Interpretability and Explainability🔬
☆147Jun 7, 2024Updated 2 years ago
zhiyuan1i / TorchRWKV
View on GitHub
RWKV6 in native pytorch and triton:)
☆11Aug 4, 2024Updated last year
princeton-nlp / benign-data-breaks-safety
View on GitHub
☆47Oct 1, 2024Updated last year
icantnamemyself / FormerTime
View on GitHub
☆10Feb 21, 2023Updated 3 years ago
coderfreestyle / Optimizing-LeCar-and-Convolutional-Neural-Network-Approaches-for-Cache-Replacement-Policy
View on GitHub
☆10May 1, 2019Updated 7 years ago
cx0 / geneformer-finetune
View on GitHub
☆13Jun 4, 2023Updated 3 years ago
juditacs / snippets
View on GitHub
Python snippets
☆21Mar 10, 2020Updated 6 years ago
bflashcp3f / SynKB
View on GitHub
EMNLP 2022 Demo "SynKB: Semantic Search for Chemical Synthesis Procedures"
☆17Oct 31, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
pixas / TAIA_LLM
View on GitHub
☆17Nov 1, 2024Updated last year
locuslab / scaling_laws_data_filtering
View on GitHub
☆64Apr 9, 2024Updated 2 years ago
sumanyumuku98 / RL-CAR
View on GitHub
B.Tech Thesis Code for RLCaR: Deep Reinforcement Learning Framework for Optimal and Adaptive Cache Replacement
☆13Oct 25, 2020Updated 5 years ago
tianyi-lab / Cherry_LLM
View on GitHub
[NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…
☆416Jun 25, 2025Updated last year
luisroque / bench
View on GitHub
☆14Jan 22, 2025Updated last year
aladinD / SafeMERGE
View on GitHub
Code for SafeMERGE (ICLR 2025).
☆15Apr 1, 2025Updated last year
OlegZero13 / Data-Science-Algorithm-Gallery
View on GitHub
Bare-bone implementation of algorithms and explanations.
☆17Jun 22, 2022Updated 4 years ago