kevinscaria/TarGEN

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kevinscaria/TarGEN)

kevinscaria / TarGEN

Targeted Data Generation with Large Language Models

☆19

Alternatives and similar repositories for TarGEN

Users that are interested in TarGEN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

HKUNLP / ProGen
View on GitHub
[EMNLP-2022 Findings] Code for paper “ProGen: Progressive Zero-shot Dataset Generation via In-context Feedback”.
☆27Feb 4, 2023Updated 3 years ago
ASTRAL-Group / LoRe
View on GitHub
When Reasoning Meets Its Laws
☆38Jan 2, 2026Updated 6 months ago
feiyang-k / AutoScale
View on GitHub
Official Code Repository for [AutoScale📈: Scale-Aware Data Mixing for Pre-Training LLMs] Published as a conference paper at **COLM 2025*…
☆14Aug 8, 2025Updated 11 months ago
AI9Stars / AStar-Thought
View on GitHub
[NeurIPS 2025] A*-Thought: Efficient Reasoning via Bidirectional Compression for Low-Resource Settings
☆16Jun 12, 2026Updated last month
Jikai0Wang / Speculative_CoT
View on GitHub
☆20May 14, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
SEU-WDS / MachineLearningCourses
View on GitHub
暑期机器学习讨论班是由张祥老师组织发起，全体学生参与的讨论交流活动。目的是让学生巩固机器学习基本算法，掌握基本原理和使用。组织形式为学生选题并制作PPT，采用演讲的形式授课给全体参与学生和导师。
☆10Sep 19, 2018Updated 7 years ago
him1411 / edgar10q-dataset
View on GitHub
EDGAR10-Q Dataset and implementation of the paper Context NER
☆17Sep 29, 2023Updated 2 years ago
MBZUAI-CLeaR / IoE-Prompting
View on GitHub
☆11Feb 28, 2024Updated 2 years ago
StefanHeng / ProgGen
View on GitHub
Code for paper "ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models"
☆17Mar 29, 2024Updated 2 years ago
tianyi-lab / RuleR
View on GitHub
[NAACL'25] RuleR: Improving LLM Controllability by Rule-based Data Recycling
☆14Sep 27, 2025Updated 10 months ago
OFA-Sys / DiverseEvol
View on GitHub
Self-Evolved Diverse Data Sampling for Efficient Instruction Tuning
☆88Dec 14, 2023Updated 2 years ago
AI4fun / DQ-LoRe
View on GitHub
☆13Jun 26, 2024Updated 2 years ago
xiangyue9607 / Sentence-LDP
View on GitHub
Code for the WWW'23 paper "Sanitizing Sentence Embeddings (and Labels) for Local Differential Privacy"
☆12Feb 20, 2023Updated 3 years ago
yangyifei729 / LaCo
View on GitHub
Official implementation for LaCo (EMNLP 2024 Findings)
☆22Oct 3, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
euiin / SMART
View on GitHub
SMART introduces a novel test-time framework where Small Language Models (SLMs) reason step-by-step, and Large Language Models (LLMs) pro…
☆12Jul 9, 2025Updated last year
cxcscmu / Montessori-Instruct
View on GitHub
Official repository for Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning [ICLR 2025]
☆51Jan 24, 2025Updated last year
Xnhyacinth / NesyCD
View on GitHub
[AAAI 2025] Neural-Symbolic Collaborative Distillation: Advancing Small Language Models for Complex Reasoning Tasks
☆12Jun 19, 2025Updated last year
wang8740 / MAP
View on GitHub
Documentation at
☆14Mar 27, 2025Updated last year
alon-albalak / FLAD
View on GitHub
Few-shot Learning with Auxiliary Data
☆31Dec 8, 2023Updated 2 years ago
aster2024 / SWIFT
View on GitHub
Source code for SWIFT, an efficient reward model.
☆21Jan 13, 2026Updated 6 months ago
Littleor / Personalized-DMER
View on GitHub
Source codes for the paper "Personalized Dynamic Music Emotion Recognition with Dual-Scale Attention-Based Meta-Learning" (PDMER) which p…
☆14Mar 24, 2025Updated last year
F2-Song / Weak-to-Strong-Decoding
View on GitHub
The official implementation of "Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong Decoding"
☆22Jun 26, 2025Updated last year
iliaschalkidis / lmtc-emnlp2020
View on GitHub
An Empirical Study on Large-Scale Multi-Label Text Classification including Few and Zero-Shot Labels
☆19Jul 24, 2023Updated 3 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
QwenLM / Self-Lengthen
View on GitHub
☆98Nov 6, 2024Updated last year
chuhac / Reasoning-to-Defend
View on GitHub
[EMNLP 2025] Reasoning-to-Defend: Safety-Aware Reasoning Can Defend Large Language Models from Jailbreaking
☆12Aug 22, 2025Updated 11 months ago
microsoft / dp-few-shot-generation
View on GitHub
☆28Nov 28, 2023Updated 2 years ago
hlml / fortuitous_forgetting
View on GitHub
☆19Apr 16, 2022Updated 4 years ago
IST-DASLab / SparseFinetuning
View on GitHub
Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry
☆43Jan 15, 2024Updated 2 years ago
LUMIA-Group / MemoryDecoder
View on GitHub
The official implementation of the paper "Memory Decoder: A Pretrained, Plug-and-Play Memory for Large Language Models" (NeurIPS 2025 Pos…
☆76Sep 29, 2025Updated 10 months ago
IreneZihuiLi / HiPool
View on GitHub
Hierarchical Models for long document encoding
☆22May 29, 2023Updated 3 years ago
LLM-MI-Research / Actionable-MI
View on GitHub
☆15Jan 20, 2026Updated 6 months ago
limenlp / safer-instruct
View on GitHub
This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"
☆17Feb 22, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
alycialee / beyond-scale-language-data-diversity
View on GitHub
☆13Jul 22, 2026Updated last week
yueyu1030 / ReGen
View on GitHub
[ACL'23 Findings] This is the code repo for our ACL'23 Findings paper "ReGen: Zero-Shot Text Classification via Training Data Generation …
☆24Sep 8, 2023Updated 2 years ago
lucidrains / transformer-lm-gan
View on GitHub
Explorations into adversarial losses on top of autoregressive loss for language modeling
☆41Dec 21, 2025Updated 7 months ago
coastalcph / zeroshot_lexglue
View on GitHub
Zero-shot evaluation on LEXGLUE tasks with GTP3.5
☆29Mar 11, 2023Updated 3 years ago
drogozhang / LED
View on GitHub
Source code of paper 'LED: Lexicon-Enlightened Dense Retriever for Large-Scale Retrieval' (WWW 2023)
☆22Aug 28, 2023Updated 2 years ago
SumilerGAO / SunGen
View on GitHub
☆28Feb 26, 2023Updated 3 years ago
AgMMU / AgMMU
View on GitHub
☆19Jul 28, 2025Updated last year