WhereIsAI / BiLLMLinks
Tool for converting LLMs from uni-directional to bi-directional by removing causal mask for tasks like classification and sentence embeddings. Compatible with π€ transformers.
β59Updated 6 months ago
Alternatives and similar repositories for BiLLM
Users that are interested in BiLLM are comparing it to the libraries listed below
Sorting:
- β46Updated 4 months ago
- Code and dataset for the emnlp paper titled Instruct and Extract: Instruction Tuning for On-Demand Information Extractionβ52Updated last year
- [ACL-24 Findings] Code implementation of Paper "Rethinking Negative Instances for Generative Named Entity Recognition"β52Updated last year
- Implementation of paper: HLATR: Enhance Multi-stage Text Retrieval with Hybrid List Aware Transformer Rerankingβ68Updated 2 years ago
- β21Updated 2 years ago
- Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"β101Updated 2 years ago
- This is a meta-model distilled from LLMs for information extraction. This is an intermediate checkpoint that can be well-transferred to aβ¦β27Updated 3 months ago
- Leveraging passage embeddings for efficient listwise reranking with large language models.β44Updated 6 months ago
- [ACL 2023] Code and Data Repo for Paper "Element-aware Summary and Summary Chain-of-Thought (SumCoT)"β53Updated last year
- Finetune mistral-7b-instruct for sentence embeddingsβ83Updated last year
- SNCSE: Contrastive Learning for Unsupervised Sentence Embedding with Soft Negative Samplesβ75Updated 2 years ago
- Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrievalβ50Updated this week
- Source code for ACL 2023 paper Decoder Tuning: Efο¬cient Language Understanding as Decodingβ50Updated last year
- A toolkit for building dense retrievers with deep language models.β60Updated 3 years ago
- Implementation of "The Power of Scale for Parameter-Efficient Prompt Tuning"β56Updated 2 years ago
- [ACL 2024 Findings] Learning Fine-Grained Grounded Citations for Attributed Large Language Modelsβ18Updated 7 months ago
- β35Updated last year
- β33Updated last year
- β45Updated 3 years ago
- β7Updated 2 years ago
- Code for ACL2023 paper: Pre-Training to Learn in Contextβ108Updated 10 months ago
- Collections of IR Researchβ35Updated last month
- β68Updated 2 years ago
- Unofficial implementation of AlpaGasusβ91Updated last year
- [ACL 2022] A hierarchical table dataset for question answering and data-to-text generation.β90Updated 3 months ago
- Official Code for "PPT: Pre-trained Prompt Tuning for Few-shot Learning". ACL 2022β108Updated 2 years ago
- [Neurips2023] Source code for Lift Yourself Up: Retrieval-augmented Text Generation with Self Memoryβ59Updated 2 years ago
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"β80Updated last year
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Modelsβ40Updated last year
- β35Updated last year