[ACL 2025 Main] Official Repo for Paper "Measuring Data Diversity for Instruction Tuning: A Systematic Analysis and A Reliable Metric"
☆36Feb 10, 2026Updated 3 weeks ago
Alternatives and similar repositories for NovelSum
Users that are interested in NovelSum are comparing it to the libraries listed below
Sorting:
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆30Oct 9, 2025Updated 5 months ago
- ☆16Jul 7, 2025Updated 8 months ago
- Self-Evolved Diverse Data Sampling for Efficient Instruction Tuning☆86Dec 14, 2023Updated 2 years ago
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation☆90Nov 13, 2024Updated last year
- This repository contains codes for *Sem 2023 paper “Generative Data Augmentation for Aspect Sentiment Quad Prediction”.☆11May 30, 2023Updated 2 years ago
- Source code for SWIFT, an efficient reward model.☆18Jan 13, 2026Updated last month
- TOD-Flow: Modeling the Structure of Task-Oriented Dialogues☆13Feb 7, 2024Updated 2 years ago
- EANN(Pytorch)☆10Mar 12, 2022Updated 3 years ago
- Complete set of English dialect transformation rules and evaluation code☆16Jun 7, 2024Updated last year
- [ECAI 2023] QCCDM: A Q-Augmented Causal Cognitive Diagnosis Model for Student Learning☆12Aug 4, 2023Updated 2 years ago
- Library for Financial Applications (WP5)☆10Feb 26, 2025Updated last year
- ☆12Nov 9, 2018Updated 7 years ago
- Chromosomes from karyotype images☆11May 29, 2019Updated 6 years ago
- 擂台赛3-大规模预训练调优比赛的示例代码与baseline实现☆37Sep 27, 2022Updated 3 years ago
- This is an official pytorch implementation of 'Group-wise Inhibition based Feature Regularization for Robust Classification' (ICCV 2021 a…☆10Dec 10, 2022Updated 3 years ago
- ☆11Mar 10, 2017Updated 8 years ago
- Scalable Quantum Neural Network builds and trains a large-scale QNN in a modular fashion. SQNN is evaluated with a binary classification …☆12Oct 4, 2023Updated 2 years ago
- Can Large Language Models Identify Authorship? (EMNLP 2024 Findings)☆12Feb 4, 2025Updated last year
- ☆13Mar 25, 2022Updated 3 years ago
- Code for the paper "Greed is All You Need: An Evaluation of Tokenizer Inference Methods"☆13Nov 26, 2024Updated last year
- Simulation code and data of the paper - cold start to improve market thickness☆12Jan 30, 2026Updated last month
- ☆12Jan 7, 2020Updated 6 years ago
- A Controllable Model of Grounded Response Generation (AAAI 21)☆13Oct 25, 2022Updated 3 years ago
- [ICLR 2025 SCI-FM Workshop] Lemur: Log Parsing with Entropy Sampling and Chain-of-Thought Merging☆13Mar 27, 2025Updated 11 months ago
- Code for ICML 2022 paper: Achieving Fairness at No Utility Cost via Data Reweighing with Influence☆11Aug 3, 2022Updated 3 years ago
- [EMNLP 2025 Findings] Familiarity-aware Evidence Compression for Retrieval Augmented Generation☆14Aug 20, 2025Updated 6 months ago
- 蚂蚁金融自然语言处理竞赛。☆10Sep 3, 2018Updated 7 years ago
- ☆21Jul 12, 2025Updated 7 months ago
- Repository containing the group project Wind Power Forecasting for DTU's 02456 Deep Learning.☆13Apr 7, 2022Updated 3 years ago
- ☆12Nov 22, 2022Updated 3 years ago
- Extended Inductive Reasoning for Personalized Preference Inference from Behavioral Signals☆11Jan 8, 2026Updated 2 months ago
- An example for deploying Tensorflow 2 models with Docker and Fast API☆10Sep 30, 2022Updated 3 years ago
- An experiment with modern C++, suffix trees, and Ukkonen's algorithm for suffix tree construction.☆12Mar 15, 2019Updated 6 years ago
- https://arxiv.org/abs/2502.08942☆17Mar 31, 2025Updated 11 months ago
- 2018BDCI汽车行业用户观点主题及情感识别rank27☆11Jan 23, 2019Updated 7 years ago
- ☆10Jun 21, 2021Updated 4 years ago
- ☆13Dec 13, 2023Updated 2 years ago
- Original code base for On Pretraining Data Diversity for Self-Supervised Learning☆14Dec 30, 2024Updated last year
- This is a repository containing code for a hybrid quantum-classical transformer model from the paper: A Hybrid Transformer Architecture w…☆20Mar 6, 2025Updated last year