[ICML 2023] Tuning Language Models as Training Data Generators for Augmentation-Enhanced Few-Shot Learning
☆44May 10, 2023Updated 2 years ago
Alternatives and similar repositories for FewGen
Users that are interested in FewGen are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2022] Generating Training Data with Language Models: Towards Zero-Shot Language Understanding☆69Sep 18, 2022Updated 3 years ago
- The source code used for paper "Effective Seed-Guided Topic Discovery by Integrating Multiple Types of Contexts", in WSDM 2023.☆14May 27, 2023Updated 2 years ago
- ☆27Feb 26, 2023Updated 3 years ago
- Code for the WWW'23 paper "Sanitizing Sentence Embeddings (and Labels) for Local Differential Privacy"☆12Feb 20, 2023Updated 3 years ago
- Seed-Guided Topic Discovery with Out-of-Vocabulary Seeds (NAACL'22)☆18Feb 18, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Hierarchical Metadata-Aware Document Categorization under Weak Supervision (WSDM'21)☆45Apr 2, 2024Updated 2 years ago
- EMNLP 2021 - Frustratingly Simple Pretraining Alternatives to Masked Language Modeling☆34Nov 21, 2021Updated 4 years ago
- [KDD 2020] Hierarchical Topic Mining via Joint Spherical Tree and Text Embedding☆57Feb 14, 2021Updated 5 years ago
- [EMNLP 2022] Code for our paper “ZeroGen: Efficient Zero-shot Learning via Dataset Generation”.☆47Feb 18, 2022Updated 4 years ago
- Code and Data for our EMNLP-2020 paper Weakly-Supervised Aspect-Based Sentiment Analysis via Joint Aspect-Sentiment Topic Embedding.☆49Oct 23, 2020Updated 5 years ago
- ☆10Sep 27, 2021Updated 4 years ago
- The 3 baseline methods for few-shot NER tasks☆57Dec 10, 2021Updated 4 years ago
- ☆12Apr 18, 2025Updated 11 months ago
- ☆24Jun 12, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆23Dec 30, 2025Updated 3 months ago
- The source code for SetExpan framework, published in ECML-PKDD 2017☆32Nov 22, 2021Updated 4 years ago
- [EMNLP 2021] Distantly-Supervised Named Entity Recognition with Noise-Robust Learning and Language Model Augmented Self-Training☆65Nov 12, 2021Updated 4 years ago
- [EMNLP-2022 Findings] Code for paper “ProGen: Progressive Zero-shot Dataset Generation via In-context Feedback”.☆27Feb 4, 2023Updated 3 years ago
- [NeurIPS 2019] Spherical Text Embedding☆184Oct 29, 2023Updated 2 years ago
- ☆16Aug 14, 2022Updated 3 years ago
- The source code, dataset, and evaluation scripts used for SetRank, published in SIGIR 2018☆15Nov 26, 2021Updated 4 years ago
- Chrome extension for OA sites like arxiv, openreivew: 1. PDF back to abstract page, 2. Rename PDF page with paper title.☆18Oct 12, 2023Updated 2 years ago
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆59Jan 12, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for paper OA-Mine: Open-World Attribute Mining for E-Commerce Products with Weak Supervision☆31May 9, 2022Updated 3 years ago
- An Empirical Study on Large-Scale Multi-Label Text Classification including Few and Zero-Shot Labels☆19Jul 24, 2023Updated 2 years ago
- Comprehensive evaluation framework for Open Information Extraction.☆40Jun 21, 2022Updated 3 years ago
- [ACL'23 Findings] This is the code repo for our ACL'23 Findings paper "ReGen: Zero-Shot Text Classification via Training Data Generation …☆24Sep 8, 2023Updated 2 years ago
- Code and data for "Dynosaur: A Dynamic Growth Paradigm for Instruction-Tuning Data Curation" (EMNLP 2023)☆64Nov 30, 2023Updated 2 years ago
- [ACL 2024 Findings] This is the code for our paper "Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation wi…☆41Jun 23, 2024Updated last year
- [ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning☆98Apr 26, 2023Updated 2 years ago
- Hugging Face and Pyserini interoperability☆19May 18, 2023Updated 2 years ago
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆34Aug 9, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆21Dec 14, 2024Updated last year
- [EMNLP 2020] Text Classification Using Label Names Only: A Language Model Self-Training Approach☆299Feb 2, 2022Updated 4 years ago
- 暑期机器学习讨论班是由张祥老师组织发起,全体学生参与的讨论交流活动。目的是让学生巩固机器学习基本算法,掌握基本原理和使用。组织形式为学生选题并制作PPT,采用演讲的形式授课给全体参与学生和导师。☆10Sep 19, 2018Updated 7 years ago
- ☆20Apr 8, 2025Updated last year
- Official Github repo for the paper "Evaluating the Evaluation of Diversity in Natural Language Generation"☆21Feb 23, 2021Updated 5 years ago
- [ICLR 2022] Differentiable Prompt Makes Pre-trained Language Models Better Few-shot Learners☆131Dec 7, 2022Updated 3 years ago
- Zero-shot evaluation on LEXGLUE tasks with GTP3.5☆29Mar 11, 2023Updated 3 years ago