"Pooling And Attention: What Are Effective Designs For LLM-Based Embedding Models?"
☆38Nov 13, 2024Updated last year
Alternatives and similar repositories for PoolingAndAttn
Users that are interested in PoolingAndAttn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The first dense retrieval model that can be prompted like an LM☆91May 8, 2025Updated 10 months ago
- Official implementation of Vector-ICL: In-context Learning with Continuous Vector Representations (ICLR 2025)☆21Jun 2, 2025Updated 9 months ago
- Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory☆59Feb 10, 2025Updated last year
- ☆21Jul 21, 2025Updated 8 months ago
- ☆16Sep 1, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture☆213Jan 6, 2025Updated last year
- Official Code for paper "Towards Efficient and Effective Unlearning of Large Language Models for Recommendation" (Frontiers of Computer S…☆38Jul 19, 2024Updated last year
- Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"☆33Nov 29, 2023Updated 2 years ago
- “Generate to Understand for Representation”☆14Apr 18, 2024Updated last year
- Simple program to manually caption your images (or any other file types) so you can use them for AI training☆37Mar 20, 2023Updated 3 years ago
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆126May 7, 2024Updated last year
- Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.☆92Jul 21, 2024Updated last year
- ☆46Aug 25, 2024Updated last year
- Adversaial attack comparative assessment Large Language Model☆13May 21, 2025Updated 10 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- An Infr app that helps you replay & talk to everything you've ever seen.☆15Sep 19, 2023Updated 2 years ago
- XmodelLM☆38Nov 19, 2024Updated last year
- Code repository for the paper "The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Le…☆13Jan 16, 2025Updated last year
- Mixture of Lora Experts☆10Apr 7, 2024Updated last year
- ☆19Oct 28, 2025Updated 5 months ago
- EnsLoss: Stochastic Calibrated Loss Ensembles for Preventing Overfitting in Classification☆34Nov 1, 2025Updated 4 months ago
- Official Pytorch Implementation of Self-emerging Token Labeling☆35Mar 27, 2024Updated 2 years ago
- ☆11Oct 8, 2023Updated 2 years ago
- Code for the paper "Closing the Curious Case of Neural Text Degeneration"☆12Apr 9, 2025Updated 11 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆17Apr 9, 2025Updated 11 months ago
- Extended Inductive Reasoning for Personalized Preference Inference from Behavioral Signals☆11Jan 8, 2026Updated 2 months ago
- The dataset consists of public social media url pairs and the corresponding entailment label for an external conference (ACL 2021). Each …☆14Aug 16, 2021Updated 4 years ago
- [ACL 2025 Findings] Implicit Reasoning in Transformers is Reasoning through Shortcuts☆17Mar 11, 2025Updated last year
- Code for the paper "Cottention: Linear Transformers With Cosine Attention"☆20Nov 15, 2025Updated 4 months ago
- Danmuku dataset☆11Jul 7, 2023Updated 2 years ago
- PySpark-based causal inference package.☆13Aug 20, 2021Updated 4 years ago
- ☆15Jan 12, 2026Updated 2 months ago
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.☆14Mar 20, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Graph Diffusion Policy Optimization☆42Mar 17, 2024Updated 2 years ago
- AceParse: A Comprehensive Dataset with Diverse Structured Texts for Academic Literature Parsing☆44Sep 17, 2024Updated last year
- The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".☆30Nov 12, 2024Updated last year
- DALI Multi Agent System Framework☆42Jan 30, 2026Updated last month
- DEF CON Hacker Tracker☆14Jul 30, 2025Updated 8 months ago
- [NeurIPS 2024] RaVL: Discovering and Mitigating Spurious Correlations in Fine-Tuned Vision-Language Models☆31Nov 12, 2024Updated last year
- This is the repository for "SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image Recognition"☆16Oct 8, 2024Updated last year