"Pooling And Attention: What Are Effective Designs For LLM-Based Embedding Models?"
☆39Nov 13, 2024Updated last year
Alternatives and similar repositories for PoolingAndAttn
Users that are interested in PoolingAndAttn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The first dense retrieval model that can be prompted like an LM☆92May 8, 2025Updated last year
- Official implementation of Vector-ICL: In-context Learning with Continuous Vector Representations (ICLR 2025)☆23Jun 2, 2025Updated 11 months ago
- ☆21Apr 3, 2026Updated last month
- Official Code for paper "Towards Efficient and Effective Unlearning of Large Language Models for Recommendation" (Frontiers of Computer S…☆38Jul 19, 2024Updated last year
- Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"☆33Nov 29, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- “Generate to Understand for Representation”☆14Apr 18, 2024Updated 2 years ago
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆127May 7, 2024Updated 2 years ago
- official repository for ListT5☆49Nov 27, 2025Updated 6 months ago
- Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.☆90Jul 21, 2024Updated last year
- Code for Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking, EMNLP 2022, https://aclan…☆14Mar 30, 2026Updated last month
- XmodelLM☆38Nov 19, 2024Updated last year
- Code repository for the paper "The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Le…☆14Jan 16, 2025Updated last year
- ☆23Oct 22, 2025Updated 7 months ago
- Mixture of Lora Experts☆11Apr 7, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆19Oct 28, 2025Updated 7 months ago
- ☆11Oct 8, 2023Updated 2 years ago
- Code for the paper "Closing the Curious Case of Neural Text Degeneration"☆12Apr 9, 2025Updated last year
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆149Nov 9, 2024Updated last year
- ☆17Apr 9, 2025Updated last year
- Extended Inductive Reasoning for Personalized Preference Inference from Behavioral Signals☆11Jan 8, 2026Updated 4 months ago
- The dataset consists of public social media url pairs and the corresponding entailment label for an external conference (ACL 2021). Each …☆14Aug 16, 2021Updated 4 years ago
- [ACL 2025 Findings] Implicit Reasoning in Transformers is Reasoning through Shortcuts☆18Mar 11, 2025Updated last year
- Code for the paper "Cottention: Linear Transformers With Cosine Attention"☆20Nov 15, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Position-Guided Prompt Learning for Anomaly Detection in Chest X-Rays☆20May 28, 2024Updated 2 years ago
- PySpark-based causal inference package.☆13Aug 20, 2021Updated 4 years ago
- ☆15Apr 6, 2026Updated last month
- Official implementation of the paper: "A deeper look at depth pruning of LLMs"☆15Jul 24, 2024Updated last year
- Graph Diffusion Policy Optimization☆43Mar 17, 2024Updated 2 years ago
- Code release for Type-Aware Bi-Encoders for Open-Domain Entity Retrieval☆19Sep 24, 2022Updated 3 years ago
- 一个基于 Hermes 的 agent skill:每天自 动从 arXiv 抓取论文,用 AI 生成中文摘要和作者单位,推送到飞书,并提供本地静态阅读网站。☆77May 23, 2026Updated last week
- DALI Multi Agent System Framework☆42Mar 24, 2026Updated 2 months ago
- The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".☆30Nov 12, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [NeurIPS 2024] The official implementation of "Image Copy Detection for Diffusion Models"☆18Oct 1, 2024Updated last year
- Code to reproduce results of our experiments using LoRe☆17Apr 8, 2026Updated last month
- PyTorch code for "ADEM-VL: Adaptive and Embedded Fusion for Efficient Vision-Language Tuning"☆21Oct 28, 2024Updated last year
- FinMTEB: Finance Massive Text Embedding Benchmark (EMNLP 2025 Main)☆55Nov 15, 2025Updated 6 months ago
- successor to RollingFunctions.jl☆10Aug 9, 2023Updated 2 years ago
- ☆56Nov 6, 2024Updated last year
- ☆16Apr 11, 2022Updated 4 years ago