zhichaoxu-shufe / context-aware-decoding-qfsLinks

☆12

Alternatives and similar repositories for context-aware-decoding-qfs

Users that are interested in context-aware-decoding-qfs are comparing it to the libraries listed below

Sorting:

kaistAI / InstructIR
IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…
☆32Updated last year
yxuansu / Contrastive_Search_versus_Contrastive_Decoding
An Empirical Study On Contrastive Search And Contrastive Decoding For Open-ended Text Generation
☆27Updated last year
yuzhaouoe / pretraining-data-packing
[ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training
☆22Updated 11 months ago
FreedomIntelligence / DPTDR
Code for COLING22 paper, DPTDR: Deep Prompt Tuning for Dense Passage Retrieval
☆25Updated last year
frankxu2004 / knnlm-why
Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"
☆58Updated 2 years ago
Yuanhy1997 / HyPe
HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]
☆14Updated 2 years ago
NonvolatileMemory / flash_attn_gqa
triton ver of gqa flash attn, based on the tutorial
☆11Updated 11 months ago
cooelf / CompassMTL
Task Compass: Scaling Multi-task Pre-training with Task Prefix (EMNLP 2022: Findings) (stay tuned & more will be updated)
☆22Updated 2 years ago
kyegomez / Reka-Torch
Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch
☆30Updated 2 weeks ago
ThomasScialom / T0_continual_learning
Adding new tasks to T0 without catastrophic forgetting
☆33Updated 2 years ago
swarnaHub / SummarizationPrograms
[ICLR 2023] PyTorch code of Summarization Programs: Interpretable Abstractive Summarization with Neural Modular Trees
☆24Updated 2 years ago
r-three / RAD
Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model
☆43Updated last year
gmftbyGMFTBY / Rep-Dropout
[NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective
☆33Updated last year
RUCAIBox / ELMER
This repository is the official implementation of our EMNLP 2022 paper ELMER: A Non-Autoregressive Pre-trained Language Model for Efficie…
☆26Updated 2 years ago
princeton-nlp / ShortcutGrammar
EMNLP 2022: Finding Dataset Shortcuts with Grammar Induction https://arxiv.org/abs/2210.11560
☆58Updated 4 months ago
kaistAI / factual-knowledge-acquisition
☆20Updated 2 months ago
whyNLP / Probabilistic-Transformer
A probabilitic model for contextual word representation. Accepted to ACL2023 Findings.
☆23Updated last year
RUCAIBox / BAMBOO
☆35Updated last year
LuLuLuyi / LongHeads
LongHeads: Multi-Head Attention is Secretly a Long Context Processor
☆29Updated last year
yixinL7 / SumLLM
Repo for "On Learning to Summarize with Large Language Models as References"
☆44Updated 2 years ago
Tiiiger / templm
Code release for "TempLM: Distilling Language Models into Template-Based Generators"
☆14Updated 2 years ago
lyutyuh / structured-span-selector
A Structured Span Selector (NAACL 2022). A structured span selector with a WCFG for span selection tasks (coreference resolution, semanti…
☆21Updated 3 years ago
GSYfate / knnlm-limits
Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"
☆23Updated 2 months ago
cliang1453 / CAMERO
CAMERO: Consistency Regularized Ensemble of Perturbed Language Models with Weight Sharing (ACL 2022)
☆10Updated 3 years ago
wyu97 / RACo
Resources for Retrieval Augmentation for Commonsense Reasoning: A Unified Approach. EMNLP 2022.
☆22Updated 2 years ago
orionw / FollowIR
FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions
☆45Updated last year
psunlpgroup / XSemPLR
Data and code for ACL 2023 paper XSemPLR: Cross-Lingual Semantic Parsing in Multiple Natural Languages and Meaning Representations
☆9Updated 2 years ago
yumeng5 / FewGen
[ICML 2023] Tuning Language Models as Training Data Generators for Augmentation-Enhanced Few-Shot Learning
☆42Updated 2 years ago
jzbjyb / ReAtt
Retrieval as Attention
☆83Updated 2 years ago
dqxiu / KAssess
☆14Updated last year