Tool for converting LLMs from uni-directional to bi-directional by removing causal mask for tasks like classification and sentence embeddings. Compatible with π€ transformers.
β63Dec 12, 2024Updated last year
Alternatives and similar repositories for BiLLM
Users that are interested in BiLLM are comparing it to the libraries listed below
Sorting:
- A Simple but Powerful SOTA NER Model | Official Code For Label Supervised LLaMA Finetuningβ152Mar 17, 2024Updated last year
- Hugging Face RoBERTa with Flash Attention 2β24Sep 14, 2025Updated 5 months ago
- WIP: Ofen is a toolkit aimed at making transformer models production-ready. API includedβ17Oct 2, 2024Updated last year
- code for piccolo embedding model from SenseTimeβ144May 21, 2024Updated last year
- Improving Text Embedding of Language Models Using Contrastive Fine-tuningβ64Aug 2, 2024Updated last year
- [SIGIR 2025] The official repo for "Scaling Sparse and Dense Retrieval in Decoder-Only LLMs"β20Mar 31, 2025Updated 11 months ago
- Leveraging passage embeddings for efficient listwise reranking with large language models.β50Dec 7, 2024Updated last year
- β57Jan 26, 2025Updated last year
- Code for Robust Fine-tuning (RbFT)β17Jan 31, 2025Updated last year
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.β579Updated this week
- coded with and corrected by Google Anti-Gravityβ13Nov 23, 2025Updated 3 months ago
- Model implementation for the contextual embeddings projectβ40Jun 2, 2025Updated 8 months ago
- Code for KaLM-Embedding modelsβ114Jun 30, 2025Updated 8 months ago
- Source code for SummaReranker (ACL 2022)β25Jan 7, 2024Updated 2 years ago
- Label shift estimation for transfer difficulty with Familiarity.β10Feb 4, 2025Updated last year
- A PyTorch implementation of Proxy Anchor Loss based on CVPR 2020 paper "Proxy Anchor Loss for Deep Metric Learning"β11Jan 16, 2021Updated 5 years ago
- Code for EMNLP 2023 paper: DALE: Generative Data Augmentation for Low-Resource Legal NLPβ10Oct 27, 2023Updated 2 years ago
- [ACL 2023] Few-shot Reranking for Multi-hop QA via Language Model Promptingβ27Oct 19, 2025Updated 4 months ago
- β26May 11, 2025Updated 9 months ago
- Cascade bert+word vec and one layer FLAT, trained by adversarial FGM and Stochastic Weight Averagingβ23Nov 4, 2021Updated 4 years ago
- A travel agent based on Qwen2.5, fine-tuned by SFT + DPO/PPO/GRPO using traveling question-answer dataset, a mindmap can be output using β¦β56Nov 14, 2025Updated 3 months ago
- Code for the MTEB leaderboardβ30Feb 4, 2025Updated last year
- β11Nov 21, 2024Updated last year
- Generative Reranker PyTerrierβ18Dec 1, 2025Updated 2 months ago
- ROUTE: Robust Multitask Tuning and Collaboration for Text-to-SQL (ICLR 2025 Pytorch Code)β17May 15, 2025Updated 9 months ago
- Effective Unsupervised Domain Adaptation of Neural Rankers by Diversifying Synthetic Query Generationβ15Apr 23, 2025Updated 10 months ago
- Code for ACL paper "Zero-Shot Text Classification via Self-Supervised Tuning"β28Sep 25, 2023Updated 2 years ago
- One-stop shop for running and fine-tuning transformer-based language models for retrievalβ63Updated this week
- Efficient Pre-training of Masked Language Model via Concept-based Curriculum Maskingβ13Feb 5, 2023Updated 3 years ago
- Cocktail: A Comprehensive Information Retrieval Benchmark with LLM-Generated Documents Integrationβ15Jun 4, 2024Updated last year
- β34Jan 19, 2026Updated last month
- Machine translated multilingual STS benchmark dataset.β33Dec 21, 2023Updated 2 years ago
- Evaluate state-of-the-art sparse embedding models on the LIMIT dataset (`limit-small` and `limit`) from google's paper `On the Theoreticaβ¦β15Sep 4, 2025Updated 5 months ago
- β12Aug 21, 2024Updated last year
- β12Nov 5, 2024Updated last year
- A framework for evaluating the effectiveness of chain-of-thought reasoning in language models.β19Feb 6, 2025Updated last year
- Efficient, Flexible, and Highly Fault-Tolerant Model Service Management Based on SGLangβ61Nov 8, 2024Updated last year
- Finetune mistral-7b-instruct for sentence embeddingsβ88May 2, 2024Updated last year
- Code for embedding and retrieval research.β16Oct 24, 2023Updated 2 years ago