Tool for converting LLMs from uni-directional to bi-directional by removing causal mask for tasks like classification and sentence embeddings. Compatible with ๐ค transformers.
โ67Dec 12, 2024Updated last year
Alternatives and similar repositories for BiLLM
Users that are interested in BiLLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Simple but Powerful SOTA NER Model | Official Code For Label Supervised LLaMA Finetuningโ152Mar 17, 2024Updated 2 years ago
- โ20Apr 8, 2025Updated last year
- [SIGIR 2025] The official repo for "Scaling Sparse and Dense Retrieval in Decoder-Only LLMs"โ20Mar 31, 2025Updated last year
- code for piccolo embedding model from SenseTimeโ145May 21, 2024Updated 2 years ago
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.โ594Updated this week
- AI Agents on DigitalOcean Gradient AI Platform โข AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Leveraging passage embeddings for efficient listwise reranking with large language models.โ51Dec 7, 2024Updated last year
- โ11Nov 21, 2024Updated last year
- Train and Infer Powerful Sentence Embeddings with AnglE | ๐ฅ SOTA on STS and MTEB Leaderboardโ569Mar 22, 2026Updated last month
- Improving Text Embedding of Language Models Using Contrastive Fine-tuningโ63Aug 2, 2024Updated last year
- WIP: Ofen is a toolkit aimed at making transformer models production-ready. API includedโ17Oct 2, 2024Updated last year
- ROUTE: Robust Multitask Tuning and Collaboration for Text-to-SQL (ICLR 2025 Pytorch Code)โ17May 15, 2025Updated last year
- [Findings of EMNLP 2022] Holistic Sentence Embeddings for Better Out-of-Distribution Detectionโ18Jun 14, 2023Updated 2 years ago
- ExpertFingerprinting: Behavioral Pattern Analysis and Specialization Mapping of Experts in GPT-OSS-20B's Mixture-of-Experts Architectureโ28Feb 3, 2026Updated 3 months ago
- Cascade bert+word vec and one layer FLAT, trained by adversarial FGM and Stochastic Weight Averagingโ23Nov 4, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient โข AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Model implementation for the contextual embeddings projectโ47Jun 2, 2025Updated 11 months ago
- โ63Jul 21, 2024Updated last year
- Code for EMNLP 2023 paper: DALE: Generative Data Augmentation for Low-Resource Legal NLPโ10Oct 27, 2023Updated 2 years ago
- โ63Jan 26, 2025Updated last year
- [Findings of ACL'2023] Improving Contrastive Learning of Sentence Embeddings from AI Feedbackโ40Aug 14, 2023Updated 2 years ago
- Efficient Pre-training of Masked Language Model via Concept-based Curriculum Maskingโ13Feb 5, 2023Updated 3 years ago
- ไธญๆๆตทไบๅคงๆจกๅ้ๅ๏ผZh-LLM๏ผโ23Dec 18, 2023Updated 2 years ago
- Parameter-efficient Fine Tuning for Clinical LLMsโ17Apr 23, 2024Updated 2 years ago
- A PyTorch implementation of Proxy Anchor Loss based on CVPR 2020 paper "Proxy Anchor Loss for Deep Metric Learning"โ11Jan 16, 2021Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI โข AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [ACL 2023] Few-shot Reranking for Multi-hop QA via Language Model Promptingโ27Oct 19, 2025Updated 7 months ago
- Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'โ1,689Apr 4, 2026Updated last month
- โ25Oct 20, 2022Updated 3 years ago
- โ13Feb 17, 2025Updated last year
- Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.โ737May 2, 2026Updated 2 weeks ago
- โ44Apr 22, 2025Updated last year
- CCKS 2020: ้ขๅไธญๆ็ญๆๆฌ็ๅฎไฝ้พๆไปปๅกโ43Mar 27, 2021Updated 5 years ago
- A collection of scripts for retrieving, storing, and querying SureChEMBL data.โ42Jul 11, 2024Updated last year
- Unified Learned Sparse Retrieval Frameworkโ68May 13, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer โข AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Code for embedding and retrieval research.โ16Oct 24, 2023Updated 2 years ago
- Generative Representational Instruction Tuningโ691Jun 25, 2025Updated 10 months ago
- LLM for NERโ82Jul 29, 2024Updated last year
- [EMNLP 2025] The official implementation of "Zero-shot Multimodal Document Retrieval via Cross-Modal Question Generation"โ15Aug 26, 2025Updated 8 months ago
- Datasets and Evaluation Scripts for CompHRDocโ59Feb 25, 2025Updated last year
- โ13Mar 22, 2023Updated 3 years ago
- [ACL 2025] AIR-Bench: Automated Heterogeneous Information Retrieval Benchmarkโ165Mar 29, 2026Updated last month