OpenNLG / OpenBA-v2
OpenBA-V2: 3B LLM (Large Language Model) with T5 architecture, utilizing model pruning technique and continuing pretraining from OpenBA-15B.
☆23Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for OpenBA-v2
- L-CITEEVAL: DO LONG-CONTEXT MODELS TRULY LEVERAGE CONTEXT FOR RESPONDING?☆17Updated 3 weeks ago
- We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.☆47Updated 2 weeks ago
- [ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Models☆53Updated 3 months ago
- Towards Systematic Measurement for Long Text Quality☆28Updated 2 months ago
- Official Implementation of "Probing Language Models for Pre-training Data Detection"☆16Updated 5 months ago
- EMNLP'2023: Explore-Instruct: Enhancing Domain-Specific Instruction Coverage through Active Exploration☆32Updated 8 months ago
- ☆63Updated 5 months ago
- ☆47Updated 4 months ago
- One Network, Many Masks: Towards More Parameter-Efficient Transfer Learning☆38Updated last year
- ☆15Updated last month
- [ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models☆72Updated 8 months ago
- BeHonest: Benchmarking Honesty in Large Language Models☆29Updated 2 months ago
- ☆51Updated 7 months ago
- ☆47Updated 2 months ago
- SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration☆22Updated last month
- Codes and data for ACL 2023 Findings paper "Click: Controllable Text Generation with Sequence Likelihood Contrastive Learning"☆15Updated 8 months ago
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…☆83Updated 4 months ago
- Counting-Stars (★)☆76Updated 2 months ago
- 🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts☆34Updated last month
- ☆25Updated last month
- Repo for the EMNLP'24 Paper "Dual-Space Knowledge Distillation for Large Language Models".☆36Updated this week
- Codes for our paper "Speculative Decoding: Exploiting Speculative Execution for Accelerating Seq2seq Generation" (EMNLP 2023 Findings)☆33Updated 11 months ago
- Code for "Retaining Key Information under High Compression Rates: Query-Guided Compressor for LLMs" (ACL 2024)☆13Updated 5 months ago
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"☆59Updated 8 months ago
- ☆36Updated 10 months ago
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆37Updated 4 months ago
- ☆71Updated 10 months ago
- [ACL 2024] Long-Context Language Modeling with Parallel Encodings☆143Updated 4 months ago
- self-adaptive in-context learning☆41Updated last year
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆96Updated 7 months ago