"Pooling And Attention: What Are Effective Designs For LLM-Based Embedding Models?"
☆39Nov 13, 2024Updated last year
Alternatives and similar repositories for PoolingAndAttn
Users that are interested in PoolingAndAttn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The first dense retrieval model that can be prompted like an LM☆92May 8, 2025Updated last year
- Official implementation of Vector-ICL: In-context Learning with Continuous Vector Representations (ICLR 2025)☆23Jun 2, 2025Updated 11 months ago
- Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory☆62Feb 10, 2025Updated last year
- ☆21Apr 3, 2026Updated last month
- ☆18Sep 1, 2025Updated 8 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture☆213Jan 6, 2025Updated last year
- Official Code for paper "Towards Efficient and Effective Unlearning of Large Language Models for Recommendation" (Frontiers of Computer S…☆38Jul 19, 2024Updated last year
- Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"☆33Nov 29, 2023Updated 2 years ago
- “Generate to Understand for Representation”☆14Apr 18, 2024Updated 2 years ago
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆127May 7, 2024Updated 2 years ago
- ☆47Aug 25, 2024Updated last year
- official repository for ListT5☆49Nov 27, 2025Updated 5 months ago
- Code for Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking, EMNLP 2022, https://aclan…☆14Mar 30, 2026Updated last month
- Adversaial attack comparative assessment Large Language Model☆13May 21, 2025Updated 11 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- An Infr app that helps you replay & talk to everything you've ever seen.☆15Sep 19, 2023Updated 2 years ago
- XmodelLM☆38Nov 19, 2024Updated last year
- Code repository for the paper "The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Le…☆14Jan 16, 2025Updated last year
- ☆22Oct 22, 2025Updated 6 months ago
- Mixture of Lora Experts☆10Apr 7, 2024Updated 2 years ago
- EnsLoss: Stochastic Calibrated Loss Ensembles for Preventing Overfitting in Classification☆34Nov 1, 2025Updated 6 months ago
- Official Pytorch Implementation of Self-emerging Token Labeling☆35Mar 27, 2024Updated 2 years ago
- ☆11Oct 8, 2023Updated 2 years ago
- Code for the paper "Closing the Curious Case of Neural Text Degeneration"☆12Apr 9, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆149Nov 9, 2024Updated last year
- ☆17Apr 9, 2025Updated last year
- The dataset consists of public social media url pairs and the corresponding entailment label for an external conference (ACL 2021). Each …☆14Aug 16, 2021Updated 4 years ago
- [ACL 2025 Findings] Implicit Reasoning in Transformers is Reasoning through Shortcuts☆18Mar 11, 2025Updated last year
- ☆15Aug 26, 2024Updated last year
- Survey of Learning To Rank☆15Nov 13, 2025Updated 5 months ago
- Code for the paper "Cottention: Linear Transformers With Cosine Attention"☆20Nov 15, 2025Updated 5 months ago
- Official repository for ICLR 2025 paper "Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs"☆19Mar 18, 2025Updated last year
- PySpark-based causal inference package.☆13Aug 20, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆15Apr 6, 2026Updated last month
- [ACCV 2024 ] Official code for "DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention"☆32Jan 8, 2025Updated last year
- Official implementation of the paper: "A deeper look at depth pruning of LLMs"☆15Jul 24, 2024Updated last year
- ☆21Mar 26, 2025Updated last year
- Toy O☆16Sep 21, 2024Updated last year
- AceParse: A Comprehensive Dataset with Diverse Structured Texts for Academic Literature Parsing☆44Sep 17, 2024Updated last year
- Code release for Type-Aware Bi-Encoders for Open-Domain Entity Retrieval☆19Sep 24, 2022Updated 3 years ago