Few-shot Learning with Auxiliary Data
☆31Dec 8, 2023Updated 2 years ago
Alternatives and similar repositories for FLAD
Users that are interested in FLAD are comparing it to the libraries listed below
Sorting:
- ☆11Apr 4, 2023Updated 2 years ago
- ☆19May 6, 2023Updated 2 years ago
- ☆12Apr 22, 2024Updated last year
- Implementation of Implicit Graphon Neural Representation☆12Sep 1, 2023Updated 2 years ago
- Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]☆79Nov 14, 2024Updated last year
- A library implementing the kernels for and experiments using extrinsic gauge equivariant vector field Gaussian Processes☆26Oct 28, 2021Updated 4 years ago
- An implementation of online data mixing for the Pile dataset, based on the GPT-NeoX library.☆13Jan 9, 2024Updated 2 years ago
- Data and code for APPDIA: A Discourse-aware Transformer-based Style Transfer Model for Offensive Social Media Conversations (COLING 2022)…☆13Sep 8, 2022Updated 3 years ago
- ☆15Oct 4, 2024Updated last year
- KV Cache & LoRA for minGPT☆57Updated this week
- ☆27Jul 25, 2023Updated 2 years ago
- Code for the ACL 2023 paper: "Rethinking the Role of Scale for In-Context Learning: An Interpretability-based Case Study at 66 Billion Sc…☆35Sep 16, 2023Updated 2 years ago
- ☆15Sep 7, 2022Updated 3 years ago
- Emotion-Aware Dialogue Response Generation by Multi-Task Learning☆13Jan 22, 2022Updated 4 years ago
- Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments (Zhou et al., EMNLP 2024)☆14Oct 3, 2024Updated last year
- Code fo ICLR2025 paper "A Simple yet Effective DDG Predictor is An Unsupervised Antibody Optimizer and Explainer"☆22Jul 18, 2025Updated 7 months ago
- SCT: An Efficient Self-Supervised Cross-View Training For Sentence Embedding (TACL)☆16Jul 27, 2024Updated last year
- Implementation of Gradient Information Optimization (GIO) for effective and scalable training data selection☆14Jun 22, 2023Updated 2 years ago
- The offcial repository for 'CharacterBERT and Self-Teaching for Improving the Robustness of Dense Retrievers on Queries with Typos', SIGI…☆16May 4, 2022Updated 3 years ago
- Arabic edition of ALBERT pretrained language models☆16Apr 25, 2021Updated 4 years ago
- Official Code Repository for [AutoScale📈: Scale-Aware Data Mixing for Pre-Training LLMs] Published as a conference paper at **COLM 2025*…☆13Aug 8, 2025Updated 7 months ago
- Thank you BART! Rewarding Pre-Trained Models Improves Formality Style Transfer (ACL 2021)☆30Oct 25, 2022Updated 3 years ago
- Fast singularity detection with kernel☆38Jan 4, 2024Updated 2 years ago
- https://footprints.baulab.info☆17Oct 4, 2024Updated last year
- ☆19Jan 17, 2024Updated 2 years ago
- Auxiliary tasks for task-oriented dialogue systems. Published in ICNLSP'22 and indexed in the ACL Anthology.☆17Feb 27, 2023Updated 3 years ago
- Targeted Data Generation with Large Language Models☆19Jun 25, 2024Updated last year
- Data set for LREC 2020 paper "I Feel Offended, Don't Be Abusive!"☆18Sep 23, 2023Updated 2 years ago
- ☆19Feb 20, 2023Updated 3 years ago
- ☆29Oct 30, 2023Updated 2 years ago
- ☆25May 7, 2025Updated 10 months ago
- ☆20Mar 4, 2025Updated last year
- Super fast implementations of common benchmark text world games☆52Aug 25, 2025Updated 6 months ago
- QLoRA for Masked Language Modeling☆23Sep 11, 2023Updated 2 years ago
- This is the official implementation for our ACL 2024 paper: "Causal Estimation of Memorisation Profiles".☆24Mar 25, 2025Updated 11 months ago
- Code for our EMNLP-2023 paper: "Active Instruction Tuning: Improving Cross-Task Generalization by Training on Prompt Sensitive Tasks"☆25Nov 16, 2023Updated 2 years ago
- ☆24Sep 27, 2022Updated 3 years ago
- Code for paper Document-Level Paraphrase Generation with Sentence Rewriting and Reordering by Zhe Lin, Yitao Cai and Xiaojun Wan. This pa…☆26Nov 10, 2021Updated 4 years ago
- [ICLR 2022] Pretraining Text Encoders with Adversarial Mixture of Training Signal Generators☆26Jul 26, 2023Updated 2 years ago