amazon-science / text_generation_diffusion_llm_topicLinks
Topic Embedding, Text Generation and Modeling using diffusion
☆15Updated 8 months ago
Alternatives and similar repositories for text_generation_diffusion_llm_topic
Users that are interested in text_generation_diffusion_llm_topic are comparing it to the libraries listed below
Sorting:
- Foundation Models for Data Tasks☆110Updated 2 years ago
- The codebase for our ACL2023 paper: Did You Read the Instructions? Rethinking the Effectiveness of Task Definitions in Instruction Learni…☆30Updated 2 years ago
- Finding semantically meaningful and accurate prompts.☆48Updated 2 years ago
- [EMNLP 2023 Industry Track] A simple prompting approach that enables the LLMs to run inference in batches.☆77Updated last year
- ☆24Updated last month
- Implementation and datasets for "Training Language Models to Generate Quality Code with Program Analysis Feedback"☆39Updated 6 months ago
- Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval☆15Updated 2 years ago
- ☆38Updated last year
- [ACL 2023] Gradient Ascent Post-training Enhances Language Model Generalization☆29Updated last year
- Easy modernBERT fine-tuning and multi-task learning☆63Updated 7 months ago
- ACL 2023 (Findings) - BertNet: Harvesting Knowledge Graphs from Pretrained Language Models☆107Updated last year
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆91Updated last year
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆49Updated 2 years ago
- [ICLR'25] "Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers"☆40Updated 10 months ago
- This repo contains data and code for the paper "Reasoning over Public and Private Data in Retrieval-Based Systems."☆46Updated last year
- Interpretable and efficient predictors using pre-trained language models. Scikit-learn compatible.☆44Updated 2 months ago
- 🌏 Modular retrievers for zero-shot multilingual IR.☆30Updated last year
- Code for paper Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding☆88Updated last year
- Code and Dataset for Learning to Solve Complex Tasks by Talking to Agents☆24Updated 3 years ago
- Code for our paper: "GrIPS: Gradient-free, Edit-based Instruction Search for Prompting Large Language Models"☆57Updated 2 years ago
- 😜Constrative Learning of Sentence Embedding using LoRA (EECS487 final project)☆13Updated 2 years ago
- Adding new tasks to T0 without catastrophic forgetting☆33Updated 3 years ago
- ☆44Updated 2 years ago
- Codebase for Context-aware Meta-learned Loss Scaling (CaMeLS). https://arxiv.org/abs/2305.15076.☆25Updated 2 years ago
- Code for the paper "REV: Information-Theoretic Evaluation of Free-Text Rationales"☆16Updated 2 years ago
- ☆13Updated 3 years ago
- [EACL 2024] ICE-Score: Instructing Large Language Models to Evaluate Code☆80Updated last year
- [ACL 2023] Code for ContraCLM: Contrastive Learning For Causal Language Model☆35Updated 2 years ago
- [ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning☆98Updated 2 years ago
- A library for parameter-efficient and composable transfer learning for NLP with sparse fine-tunings.☆75Updated last year