duykhuongnguyen / LASeR-MAB
Code for paper: "LASeR: Learning to Adaptively Select Reward Models with Multi-Arm Bandits"
☆13Updated 3 months ago
Alternatives and similar repositories for LASeR-MAB:
Users that are interested in LASeR-MAB are comparing it to the libraries listed below
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated 10 months ago
- Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"☆31Updated 8 months ago
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆29Updated 2 weeks ago
- ☆14Updated 10 months ago
- ☆15Updated 5 months ago
- ☆26Updated last year
- Is In-Context Learning Sufficient for Instruction Following in LLMs?☆26Updated 7 months ago
- ☆16Updated 6 months ago
- Adding new tasks to T0 without catastrophic forgetting☆32Updated 2 years ago
- [EMNLP 2022] Language Model Pre-Training with Sparse Latent Typing☆15Updated last year
- ☆12Updated 4 months ago
- Long Context Extension and Generalization in LLMs☆39Updated 3 months ago
- [ICML 2023] Tuning Language Models as Training Data Generators for Augmentation-Enhanced Few-Shot Learning☆39Updated last year
- Offcial Repo of Paper "Eliminating Position Bias of Language Models: A Mechanistic Approach""☆11Updated 4 months ago
- ☆18Updated 7 months ago
- Data Valuation on In-Context Examples (ACL23)☆23Updated 2 months ago
- Codebase for Instruction Following without Instruction Tuning☆33Updated 3 months ago
- Repository for Skill Set Optimization☆12Updated 5 months ago
- Tasks for describing differences between text distributions.☆16Updated 5 months ago
- ☆46Updated 6 months ago
- Official implementation of the paper: "A deeper look at depth pruning of LLMs"☆11Updated 5 months ago
- [ACL 2023 Findings] What In-Context Learning “Learns” In-Context: Disentangling Task Recognition and Task Learning☆22Updated last year
- official repo of AAAI2024 paper Mitigating the Impact of False Negatives in Dense Retrieval with Contrastive Confidence Regularization☆13Updated last year
- Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs☆22Updated 3 months ago
- The repository contains code for Adaptive Data Optimization☆21Updated last month
- Efficient Scaling laws and collaborative pretraining.☆13Updated last month
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆25Updated 8 months ago
- ☆15Updated 11 months ago
- Vision Large Language Models trained on M3IT instruction tuning dataset☆17Updated last year
- The official GitHub page for paper "NegativePrompt: Leveraging Psychology for Large Language Models Enhancement via Negative Emotional St…☆19Updated 8 months ago