Tool for converting LLMs from uni-directional to bi-directional by removing causal mask for tasks like classification and sentence embeddings. Compatible with 🤗 transformers.
☆65Dec 12, 2024Updated last year
Alternatives and similar repositories for BiLLM
Users that are interested in BiLLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Simple but Powerful SOTA NER Model | Official Code For Label Supervised LLaMA Finetuning☆152Mar 17, 2024Updated 2 years ago
- ☆20Apr 8, 2025Updated last year
- [SIGIR 2025] The official repo for "Scaling Sparse and Dense Retrieval in Decoder-Only LLMs"☆20Mar 31, 2025Updated last year
- Hugging Face RoBERTa with Flash Attention 2☆24Sep 14, 2025Updated 7 months ago
- code for piccolo embedding model from SenseTime☆144May 21, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.☆592Updated this week
- Leveraging passage embeddings for efficient listwise reranking with large language models.☆51Dec 7, 2024Updated last year
- ☆11Nov 21, 2024Updated last year
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆63Aug 2, 2024Updated last year
- WIP: Ofen is a toolkit aimed at making transformer models production-ready. API included☆17Oct 2, 2024Updated last year
- ROUTE: Robust Multitask Tuning and Collaboration for Text-to-SQL (ICLR 2025 Pytorch Code)☆17May 15, 2025Updated 11 months ago
- [Findings of EMNLP 2022] Holistic Sentence Embeddings for Better Out-of-Distribution Detection☆18Jun 14, 2023Updated 2 years ago
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆227Apr 8, 2026Updated 3 weeks ago
- Cascade bert+word vec and one layer FLAT, trained by adversarial FGM and Stochastic Weight Averaging☆23Nov 4, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Model implementation for the contextual embeddings project☆47Jun 2, 2025Updated 10 months ago
- A package for fine tuning of pretrained NLP transformers using Semi Supervised Learning☆14Oct 27, 2021Updated 4 years ago
- Easy modernBERT fine-tuning and multi-task learning☆65Mar 13, 2026Updated last month
- ☆62Jan 26, 2025Updated last year
- Cocktail: A Comprehensive Information Retrieval Benchmark with LLM-Generated Documents Integration☆15Jun 4, 2024Updated last year
- [Findings of ACL'2023] Improving Contrastive Learning of Sentence Embeddings from AI Feedback☆40Aug 14, 2023Updated 2 years ago
- Efficient Pre-training of Masked Language Model via Concept-based Curriculum Masking☆13Feb 5, 2023Updated 3 years ago
- Source code for SummaReranker (ACL 2022)☆24Jan 7, 2024Updated 2 years ago
- 中文海事大模型郑和(Zh-LLM)☆21Dec 18, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [ACL 2023] Few-shot Reranking for Multi-hop QA via Language Model Prompting☆27Oct 19, 2025Updated 6 months ago
- Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'☆1,686Apr 4, 2026Updated 3 weeks ago
- ☆25Oct 20, 2022Updated 3 years ago
- ☆13Feb 17, 2025Updated last year
- Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.☆734Jan 26, 2026Updated 3 months ago
- Streamlit UI to remove duplicate or near duplicate images☆12Mar 25, 2023Updated 3 years ago
- ☆43Apr 22, 2025Updated last year
- CCKS 2020: 面向中文短文本的实体链指任务☆43Mar 27, 2021Updated 5 years ago
- Code for ACL paper "Zero-Shot Text Classification via Self-Supervised Tuning"☆28Sep 25, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A collection of scripts for retrieving, storing, and querying SureChEMBL data.☆42Jul 11, 2024Updated last year
- Unified Learned Sparse Retrieval Framework☆68May 13, 2024Updated last year
- Code for embedding and retrieval research.☆16Oct 24, 2023Updated 2 years ago
- ☆14Jan 6, 2025Updated last year
- [EMNLP 2025] The official implementation of "Zero-shot Multimodal Document Retrieval via Cross-Modal Question Generation"☆15Aug 26, 2025Updated 8 months ago
- [ACL 2025] AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark☆165Mar 29, 2026Updated last month
- This is the github repository for the paper at NAACL 2024: Self-Improving for Zero-Shot Named Entity Recognition with Large Language Mode…☆52Mar 17, 2024Updated 2 years ago