WhereIsAI / BiLLM
Tool for converting LLMs from uni-directional to bi-directional by removing causal mask for tasks like classification and sentence embeddings. Compatible with đ¤ transformers.
â57Updated 2 months ago
Alternatives and similar repositories for BiLLM:
Users that are interested in BiLLM are comparing it to the libraries listed below
- Rethinking Negative Instances for Generative Named Entity Recognitionâ49Updated 11 months ago
- â30Updated last year
- â21Updated last year
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuningâ141Updated 5 months ago
- Code and dataset for the emnlp paper titled Instruct and Extract: Instruction Tuning for On-Demand Information Extractionâ49Updated last year
- This is a meta-model distilled from LLMs for information extraction. This is an intermediate checkpoint that can be well-transferred to aâŚâ24Updated 3 months ago
- Dual Cross Encoder for Dense Retrievalâ16Updated last year
- A framework for editing the CoTs for better factualityâ48Updated last year
- Implementation of the paper: "Making Retrieval-Augmented Language Models Robust to Irrelevant Context"â66Updated 6 months ago
- EMNLP'2023 (Findings): Large Language Model Is Not a Good Few-shot Information Extractor, but a Good Reranker for Hard Samples!â41Updated 10 months ago
- [Neurips2023] Source code for Lift Yourself Up: Retrieval-augmented Text Generation with Self Memoryâ58Updated last year
- â95Updated 4 months ago
- â30Updated last year
- Leveraging passage embeddings for efficient listwise reranking with large language models.â36Updated 2 months ago
- [ACL 2024 Findings] Learning Fine-Grained Grounded Citations for Attributed Large Language Modelsâ14Updated 3 months ago
- Official repository for paper "TableBench: A Comprehensive and Complex Benchmark for Table Question Answering"â38Updated 4 months ago
- Towards Systematic Measurement for Long Text Qualityâ31Updated 5 months ago
- [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationalesâ72Updated 2 weeks ago
- â34Updated 8 months ago
- A toolkit for building dense retrievers with deep language models.â57Updated 3 years ago
- Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrievalâ40Updated 3 months ago
- Code and data for the paper "Can Large Language Models Understand Real-World Complex Instructions?"(AAAI2024)â47Updated 10 months ago
- â7Updated last year
- Code and Data Repo for [ACL 2023] Paper "Element-aware Summary and Summary Chain-of-Thought (SumCoT)"â53Updated last year
- â14Updated 8 months ago
- [ACL 2024] FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Modelsâ95Updated 2 months ago
- Llama-3-SynE: A Significantly Enhanced Version of Llama-3 with Advanced Scientific Reasoning and Chinese Language Capabilities | 睧çťé˘čŽçťćĺ âŚâ33Updated 2 months ago
- Implementation of paper: HLATR: Enhance Multi-stage Text Retrieval with Hybrid List Aware Transformer Rerankingâ67Updated 2 years ago
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"â79Updated last year
- SNCSE: Contrastive Learning for Unsupervised Sentence Embedding with Soft Negative Samplesâ75Updated 2 years ago