amazon-science / text_generation_diffusion_llm_topicLinks
Topic Embedding, Text Generation and Modeling using diffusion
☆15Updated 2 months ago
Alternatives and similar repositories for text_generation_diffusion_llm_topic
Users that are interested in text_generation_diffusion_llm_topic are comparing it to the libraries listed below
Sorting:
- Adding new tasks to T0 without catastrophic forgetting☆33Updated 2 years ago
- [ACL 2023] Gradient Ascent Post-training Enhances Language Model Generalization☆29Updated 11 months ago
- Finding semantically meaningful and accurate prompts.☆47Updated last year
- ☆21Updated 2 years ago
- Foundation Models for Data Tasks☆108Updated 2 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated 2 years ago
- The codebase for our ACL2023 paper: Did You Read the Instructions? Rethinking the Effectiveness of Task Definitions in Instruction Learni…☆30Updated 2 years ago
- This repo contains data and code for the paper "Reasoning over Public and Private Data in Retrieval-Based Systems."☆46Updated last year
- ☆21Updated this week
- [EMNLP 2022] Continual Training of Language Models for Few-Shot Learning☆45Updated 2 years ago
- [NeurIPS 2023 Main Track] This is the repository for the paper titled "Don’t Stop Pretraining? Make Prompt-based Fine-tuning Powerful Lea…☆74Updated last year
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆33Updated last year
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆87Updated last year
- The open source implementation of "Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers"☆19Updated last year
- RL algorithm: Advantage induced policy alignment☆65Updated 2 years ago
- Ensembling Hugging Face transformers made easy☆63Updated 2 years ago
- ☆54Updated 2 years ago
- Code release for Type-Aware Bi-Encoders for Open-Domain Entity Retrieval☆19Updated 2 years ago
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated 11 months ago
- AutoPEFT: Automatic Configuration Search for Parameter-Efficient Fine-Tuning (Zhou et al.; TACL 2024)☆46Updated last year
- Easy modernBERT fine-tuning and multi-task learning☆61Updated last month
- Code for our paper: "GrIPS: Gradient-free, Edit-based Instruction Search for Prompting Large Language Models"☆55Updated 2 years ago
- ☆78Updated 10 months ago
- Confident Adaptive Transformers☆13Updated 4 years ago
- [ACL 2023]: Training Trajectories of Language Models Across Scales https://arxiv.org/pdf/2212.09803.pdf☆24Updated last year
- [ICLR 2023] Code for our paper "Selective Annotation Makes Language Models Better Few-Shot Learners"☆108Updated 2 years ago
- Ranking of fine-tuned HF models as base models.☆36Updated 3 months ago
- Few-shot Learning with Auxiliary Data☆31Updated last year
- [EMNLP 2023 Industry Track] A simple prompting approach that enables the LLMs to run inference in batches.☆76Updated last year
- Hyperparameter tuning via uncertainty modeling☆47Updated last year