Tool for converting LLMs from uni-directional to bi-directional by removing causal mask for tasks like classification and sentence embeddings. Compatible with π€ transformers.
β65Dec 12, 2024Updated last year
Alternatives and similar repositories for BiLLM
Users that are interested in BiLLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- β20Apr 8, 2025Updated last year
- [SIGIR 2025] The official repo for "Scaling Sparse and Dense Retrieval in Decoder-Only LLMs"β20Mar 31, 2025Updated last year
- Hugging Face RoBERTa with Flash Attention 2β24Sep 14, 2025Updated 6 months ago
- code for piccolo embedding model from SenseTimeβ145May 21, 2024Updated last year
- Leveraging passage embeddings for efficient listwise reranking with large language models.β51Dec 7, 2024Updated last year
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- β11Nov 21, 2024Updated last year
- Train and Infer Powerful Sentence Embeddings with AnglE | π₯ SOTA on STS and MTEB Leaderboardβ567Mar 22, 2026Updated 2 weeks ago
- Improving Text Embedding of Language Models Using Contrastive Fine-tuningβ63Aug 2, 2024Updated last year
- WIP: Ofen is a toolkit aimed at making transformer models production-ready. API includedβ17Oct 2, 2024Updated last year
- ROUTE: Robust Multitask Tuning and Collaboration for Text-to-SQL (ICLR 2025 Pytorch Code)β17May 15, 2025Updated 10 months ago
- ExpertFingerprinting: Behavioral Pattern Analysis and Specialization Mapping of Experts in GPT-OSS-20B's Mixture-of-Experts Architectureβ25Feb 3, 2026Updated 2 months ago
- Cascade bert+word vec and one layer FLAT, trained by adversarial FGM and Stochastic Weight Averagingβ23Nov 4, 2021Updated 4 years ago
- Model implementation for the contextual embeddings projectβ47Jun 2, 2025Updated 10 months ago
- A package for fine tuning of pretrained NLP transformers using Semi Supervised Learningβ14Oct 27, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code for EMNLP 2023 paper: DALE: Generative Data Augmentation for Low-Resource Legal NLPβ10Oct 27, 2023Updated 2 years ago
- β60Jan 26, 2025Updated last year
- Cocktail: A Comprehensive Information Retrieval Benchmark with LLM-Generated Documents Integrationβ15Jun 4, 2024Updated last year
- [Findings of ACL'2023] Improving Contrastive Learning of Sentence Embeddings from AI Feedbackβ40Aug 14, 2023Updated 2 years ago
- [ACL 2023] Few-shot Reranking for Multi-hop QA via Language Model Promptingβ27Oct 19, 2025Updated 5 months ago
- Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'β1,672Apr 4, 2026Updated last week
- β13Feb 17, 2025Updated last year
- β25Oct 20, 2022Updated 3 years ago
- Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.β733Jan 26, 2026Updated 2 months ago
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Code for ACL paper "Zero-Shot Text Classification via Self-Supervised Tuning"β28Sep 25, 2023Updated 2 years ago
- A collection of scripts for retrieving, storing, and querying SureChEMBL data.β42Jul 11, 2024Updated last year
- Unified Learned Sparse Retrieval Frameworkβ68May 13, 2024Updated last year
- A python wrapper for API Offres d'emploi v2, the job offers API by Emploi store (Pole Emploi)β14Jun 7, 2022Updated 3 years ago
- Code for embedding and retrieval research.β16Oct 24, 2023Updated 2 years ago
- β14Jan 6, 2025Updated last year
- Generative Representational Instruction Tuningβ689Jun 25, 2025Updated 9 months ago
- LLM for NERβ81Jul 29, 2024Updated last year
- [EMNLP 2025] The official implementation of "Zero-shot Multimodal Document Retrieval via Cross-Modal Question Generation"β15Aug 26, 2025Updated 7 months ago
- NordVPN Special Discount Offer β’ AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- β13Mar 22, 2023Updated 3 years ago
- Convert Bert TF-checkpoint to Pytorchβ15Sep 23, 2020Updated 5 years ago
- [ACL 2025] AIR-Bench: Automated Heterogeneous Information Retrieval Benchmarkβ165Mar 29, 2026Updated last week
- This is the github repository for the paper at NAACL 2024: Self-Improving for Zero-Shot Named Entity Recognition with Large Language Modeβ¦β52Mar 17, 2024Updated 2 years ago
- β54Sep 11, 2024Updated last year
- SimXNS is a research project for information retrieval. This repo contains official implementations by MSRA NLC team.β116Jan 9, 2024Updated 2 years ago
- This is the code repo for our paper "Learning More Effective Representations for Dense Retrieval through Deliberate Thinking Before Searcβ¦β27Mar 2, 2025Updated last year