yixuantt / PoolingAndAttnLinks

"Pooling And Attention: What Are Effective Designs For LLM-Based Embedding Models?"

☆36

Alternatives and similar repositories for PoolingAndAttn

Users that are interested in PoolingAndAttn are comparing it to the libraries listed below

Sorting:

orionw / promptriever
The first dense retrieval model that can be prompted like an LM
☆81Updated 2 months ago
jakespringer / echo-embeddings
☆151Updated last year
voidism / Lookback-Lens
Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"
☆128Updated 11 months ago
orionw / FollowIR
FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions
☆45Updated last year
Hannibal046 / nanoColBERT
Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).
☆80Updated last year
tianyi-lab / MoE-Embedding
Code for "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"
☆74Updated 9 months ago
giangdip2410 / HyperRouter
Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"
☆33Updated last year
dwzhu-pku / LongEmbed
LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)
☆138Updated 8 months ago
ytyz1307zzh / RefAug
Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"
☆55Updated 9 months ago
ContextualAI / CLAIR_and_APO
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
☆60Updated 10 months ago
trapoom555 / Language-Model-STS-CFT
Improving Text Embedding of Language Models Using Contrastive Fine-tuning
☆64Updated 11 months ago
salesforce / summary-of-a-haystack
Codebase accompanying the Summary of a Haystack paper.
☆79Updated 9 months ago
declare-lab / flacuna
Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive instruction collection encompassing various tasks. Vicuna is al…
☆111Updated last year
liuqi6777 / llm4ranking
Large language models for document ranking.
☆62Updated 2 months ago
clinicalml / co-llm
Co-LLM: Learning to Decode Collaboratively with Multiple Language Models
☆116Updated last year
Raibows / Learn-to-Reason
Code for "Democratizing Reasoning Ability: Tailored Learning from Large Language Model", EMNLP 2023
☆35Updated last year
kaistAI / LangBridge
[ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision
☆90Updated 8 months ago
jeffreysijuntan / lloco
The official repo for "LLoCo: Learning Long Contexts Offline"
☆117Updated last year
SALT-NLP / demonstrated-feedback
☆124Updated 9 months ago
snu-mllab / Context-Memory
Pytorch implementation for "Compressed Context Memory For Online Language Model Interaction" (ICLR'24)
☆61Updated last year
David-Li0406 / Preference-Leakage
☆45Updated last month
sanyalsunny111 / LLM-Inheritune
This is the official repository for Inheritune.
☆112Updated 5 months ago
tianyang-x / SaySelf
Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"
☆107Updated 9 months ago
kyegomez / Infini-attention
Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…
☆55Updated 2 weeks ago
r-three / phatgoose
Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"
☆86Updated last year
rosewang2008 / backtracing
Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.
☆89Updated 11 months ago
nlp-uoregon / ullme
☆20Updated 3 months ago
ahans30 / goldfish-loss
[NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs
☆90Updated 8 months ago
bminixhofer / zett
Code for Zero-Shot Tokenizer Transfer
☆133Updated 6 months ago
tigerchen52 / awesome_role_of_small_models
a curated list of the role of small models in the LLM era
☆102Updated 9 months ago