Code for our EMNLP-2023 paper: "Active Instruction Tuning: Improving Cross-Task Generalization by Training on Prompt Sensitive Tasks"
☆25Nov 16, 2023Updated 2 years ago
Alternatives and similar repositories for Active-IT
Users that are interested in Active-IT are comparing it to the libraries listed below
Sorting:
- Code and dataset for Polyglot Prompting: Multilingual Multitask Prompt Training.☆18Dec 7, 2022Updated 3 years ago
- Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments (Zhou et al., EMNLP 2024)☆14Oct 3, 2024Updated last year
- Implementation of Gradient Information Optimization (GIO) for effective and scalable training data selection☆14Jun 22, 2023Updated 2 years ago
- ☆147Apr 16, 2024Updated last year
- [ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning☆512Oct 20, 2024Updated last year
- Constrained Decoding Project☆20Nov 10, 2023Updated 2 years ago
- General system research material (not limited to paper) reading notes.☆22Mar 17, 2021Updated 4 years ago
- Few-shot Learning with Auxiliary Data☆31Dec 8, 2023Updated 2 years ago
- This is the repository for "Model Merging by Uncertainty-Based Gradient Matching", ICLR 2024.☆29May 15, 2024Updated last year
- AI Logging for Interpretability and Explainability🔬☆140Jun 7, 2024Updated last year
- Source code for paper "Conversational Question Answering over Knowledge Graphs with Transformer and Graph Attention Networks"☆33Jul 23, 2023Updated 2 years ago
- ☆64Apr 9, 2024Updated last year
- An open-source session replay tool for single-page applications that uses AI analysis, aggregated trends, and a RAG chatbot to help devel…☆11Jan 23, 2026Updated last month
- XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts☆35Jul 2, 2024Updated last year
- Simplifies data migration between Apache Ignite clusters by relying on Apache Avro as an intermediate storage format☆13Jun 27, 2023Updated 2 years ago
- A 7B parameter model for mathematical reasoning☆42Feb 17, 2025Updated last year
- Python package designed for single-cell calling cards data☆16Jun 8, 2025Updated 8 months ago
- A Data-Driven Approach to Predict the Success of Bank Telemarketing☆10Apr 27, 2021Updated 4 years ago
- Source code of ACL 2023 accepted paper "AD-KD: Attribution-Driven Knowledge Distillation for Language Model Compression"☆12Jun 14, 2023Updated 2 years ago
- Converter from UD-trees to BART representation☆36Mar 6, 2024Updated last year
- ☆11May 24, 2024Updated last year
- Azure Machine Learning - MLOps Python SDKv2☆10Jul 24, 2023Updated 2 years ago
- OPUS-Rota4: A Gradient-Based Protein Side-Chain Modeling Framework Assisted by Deep Learning-Based Predictors☆11Apr 14, 2022Updated 3 years ago
- ☆10Jul 16, 2023Updated 2 years ago
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation☆90Nov 13, 2024Updated last year
- Guide to interviewing for industry machine learning roles (data/applied/research scientist, ML engineer, etc).☆11Dec 28, 2022Updated 3 years ago
- ☆10Aug 15, 2022Updated 3 years ago
- KuaiSearch PERKS☆12Nov 16, 2021Updated 4 years ago
- Automaton & Cognition☆16Apr 14, 2024Updated last year
- ☆13May 9, 2024Updated last year
- ☆31Sep 19, 2025Updated 5 months ago
- ☆13Aug 11, 2024Updated last year
- AttentionDTA: prediction of drug–target binding affinity using attention model.https://ieeexplore.ieee.org/abstract/document/8983125☆13Aug 29, 2020Updated 5 years ago
- ElectronJS app to use Groq's Whisper model from a terminal on the desktop.☆11Feb 17, 2026Updated last week
- first attempt at description2code from 2016☆10Nov 15, 2018Updated 7 years ago
- [Ongoing Project] Codebase for network quantization study.☆12May 20, 2020Updated 5 years ago
- This is the code for reproducing the TABBIE baseline in our paper: "Retrieval-Based Transformer for Table Augmentation"☆12Sep 17, 2025Updated 5 months ago
- Code for the paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers" with GPT-J implementation.☆15Mar 22, 2023Updated 2 years ago
- A smart distributed crawler that infers navigation models of structured websites, used to cluster pages based on their structure and extr…☆10Aug 17, 2025Updated 6 months ago