amazon-science / text_generation_diffusion_llm_topicLinks
Topic Embedding, Text Generation and Modeling using diffusion
☆14Updated last week
Alternatives and similar repositories for text_generation_diffusion_llm_topic
Users that are interested in text_generation_diffusion_llm_topic are comparing it to the libraries listed below
Sorting:
- Few-shot Learning with Auxiliary Data☆28Updated last year
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆43Updated last year
- Adding new tasks to T0 without catastrophic forgetting☆33Updated 2 years ago
- ☆29Updated last year
- Code for our paper: "GrIPS: Gradient-free, Edit-based Instruction Search for Prompting Large Language Models"☆55Updated 2 years ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆48Updated last year
- Aioli: A unified optimization framework for language model data mixing☆27Updated 4 months ago
- [ACL 2023] Gradient Ascent Post-training Enhances Language Model Generalization☆29Updated 8 months ago
- Simple and scalable tools for data-driven pretraining data selection.☆24Updated 3 months ago
- Retrieval as Attention☆82Updated 2 years ago
- Self-Supervised Alignment with Mutual Information☆19Updated last year
- The codebase for our ACL2023 paper: Did You Read the Instructions? Rethinking the Effectiveness of Task Definitions in Instruction Learni…☆29Updated last year
- A Toolkit for Distributional Control of Generative Models☆73Updated last year
- [NAACL 2024 Findings] Evaluation suite for the systematic evaluation of instruction selection methods.☆22Updated last year
- A package for fine tuning of pretrained NLP transformers using Semi Supervised Learning☆14Updated 3 years ago
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆20Updated 4 months ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆34Updated last year
- [ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning☆98Updated 2 years ago
- Repo for: When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment☆38Updated 2 years ago
- This is official project in our paper: Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and Layers☆30Updated last year
- [ICLR'25] "Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers"☆19Updated 2 months ago
- Entailment self-training☆25Updated 2 years ago
- Codebase for Context-aware Meta-learned Loss Scaling (CaMeLS). https://arxiv.org/abs/2305.15076.☆25Updated last year
- ☆75Updated 8 months ago
- ☆20Updated last month
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Updated last year
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆29Updated last year
- Learning to route instances for Human vs AI Feedback (ACL 2025 Main)☆23Updated 3 weeks ago
- ☆13Updated last year
- Learning adapter weights from task descriptions☆18Updated last year