OpenNLG/OpenBA-v2

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/OpenNLG/OpenBA-v2)

OpenNLG / OpenBA-v2

OpenBA-V2: 3B LLM (Large Language Model) with T5 architecture, utilizing model pruning technique and continuing pretraining from OpenBA-15B.

☆25

Alternatives and similar repositories for OpenBA-v2

Users that are interested in OpenBA-v2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

LCM-Lab / L-CITEEVAL
View on GitHub
Evaluating the faithfulness of long-context language models
☆30Oct 21, 2024Updated last year
dropreg / efficient_alpaca
View on GitHub
The aim of this repository is to utilize LLaMA to reproduce and enhance the Stanford Alpaca
☆98Apr 5, 2023Updated 3 years ago
jordddan / Pruning-LLMs
View on GitHub
The framework to prune LLMs to any size and any config.
☆94Mar 1, 2024Updated 2 years ago
ZetangForward / CMD-Context-aware-Model-self-Detoxification
View on GitHub
CMD: a framework for Context-aware Model self-Detoxification (EMNLP2024 Long Paper)
☆17Feb 10, 2025Updated last year
LCM-Lab / Bridge_Gap_Diffusion
View on GitHub
Diffusion Model Improvement Method
☆35Sep 4, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
yyDing1 / GNER
View on GitHub
[ACL 2024 Findings] Code implementation of Paper "Rethinking Negative Instances for Generative Named Entity Recognition"
☆60Mar 20, 2024Updated 2 years ago
ProjectNUWA / StrokeNUWA
View on GitHub
☆27Feb 18, 2024Updated 2 years ago
LCM-Lab / LongRM
View on GitHub
Revealing and unlocking the context boundary of reward models
☆21May 10, 2026Updated 2 months ago
OpenNLG / OpenBA
View on GitHub
☆95Oct 8, 2023Updated 2 years ago
LitterBrother-Xiao / Overview-of-Non-autoregressive-Applications
View on GitHub
☆188Jul 22, 2024Updated last year
megagonlabs / holobench
View on GitHub
🫧 Code for Holistic Reasoning with Long-Context LMs: A Benchmark for Database Operations on Massive Textual Data (Maekawa*, Iso* et al.…
☆12Feb 25, 2025Updated last year
ProjectNUWA / LayoutNUWA
View on GitHub
☆152Jan 31, 2024Updated 2 years ago
jordddan / GameEval
View on GitHub
Using conversational games to evaluate powerful LLMs
☆18Sep 3, 2023Updated 2 years ago
Coling2022-DePro / DePro
View on GitHub
Decorrelate Irrelevant, Purify Relevant: Overcome Textual Spurious Correlations from a Feature Perspective
☆11Nov 16, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
zhaochen0110 / Timo
View on GitHub
Code and data for "Timo: Towards Better Temporal Reasoning for Language Models" (COLM 2024)
☆26Oct 23, 2024Updated last year
WowCZ / LongMIT
View on GitHub
LongMIT: Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets
☆43Sep 30, 2024Updated last year
LCM-Lab / LOOM-Eval
View on GitHub
A comprehensive and efficient long-context model evaluation framework
☆31Feb 25, 2026Updated 4 months ago
Jikai0Wang / OPT-Tree
View on GitHub
☆30May 24, 2025Updated last year
lijuntaopku / UFD
View on GitHub
Code for Unsupervised Domain Adaptation of a Pretrained Cross-Lingual Language Model, IJCAI 2020
☆12Nov 26, 2020Updated 5 years ago
LCM-Lab / Elastic-Attention
View on GitHub
Elastic Attention: Test-time Adaptive Sparsity Ratios for Efficient Transformers
☆23May 26, 2026Updated last month
mjy1111 / BAKE
View on GitHub
This is the repository for our paper: Untying the Reversal Curse via Bidirectional Language Model Editing
☆11May 25, 2025Updated last year
amosproj / amos2022ws02-automotive-test-app
View on GitHub
Android Automotive Testapp
☆13Feb 10, 2023Updated 3 years ago
zhengzx-nlp / REDER
View on GitHub
[NeurIPS 2021] Duplex Sequence-to-Sequence Learning for Reversible Machine Translation
☆15Jun 7, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
dropreg / R-Drop
View on GitHub
☆880May 24, 2024Updated 2 years ago
zhaochen0110 / Cotempqa
View on GitHub
Code and data for "Living in the Moment: Can Large Language Models Grasp Co-Temporal Reasoning?" (ACL 2024)
☆31Jul 3, 2024Updated 2 years ago
princeton-pli / PruLong
View on GitHub
Code for the preprint "Cache Me If You Can: How Many KVs Do You Need for Effective Long-Context LMs?"
☆48Jul 29, 2025Updated 11 months ago
wutaiqiang / LLM_KD_AKL
View on GitHub
☆22Oct 22, 2024Updated last year
LLMkvsys / rethink-kv-compression
View on GitHub
☆24Mar 7, 2025Updated last year
bigai-nlco / CREAM
View on GitHub
[NeurIPS 2024] | An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encoding
☆22Oct 10, 2024Updated last year
RAIVNLab / MatFormer-OLMo
View on GitHub
Code repository for the public reproduction of the language modelling experiments on "MatFormer: Nested Transformer for Elastic Inference…
☆31Nov 14, 2023Updated 2 years ago
zhaochen0110 / LMLM
View on GitHub
Code and data for "Improving Temporal Generalization of Pre-trained Language Models with Lexical Semantic Change" (EMNLP2022)
☆17Dec 8, 2022Updated 3 years ago
U-C4N / Deepseek-CoT
View on GitHub
Deepseek-CoT
☆10Oct 6, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
qtli / Papers-on-Dialogue-System
View on GitHub
A Survey of Neural Dialogue Systems
☆19Dec 31, 2021Updated 4 years ago
chtmp223 / suri
View on GitHub
Suri: Multi-constraint instruction following for long-form text generation [EMNLP’24]
☆27Oct 3, 2025Updated 9 months ago
yanlinf / UXSenti
View on GitHub
Unsupervised Cross-lingual Sentiment Analysis (CoNLL 2019)
☆10Nov 4, 2019Updated 6 years ago
Leey21 / CipherBank
View on GitHub
☆13Jun 13, 2025Updated last year
arnab-api / romba
View on GitHub
Applies ROME and MEMIT on Mamba-S4 models
☆16Apr 5, 2024Updated 2 years ago
swtheing / WizardCoder_Instruct_Generator
View on GitHub
Generate the WizardCoder Instruct from the CodeAlpaca
☆21Jun 27, 2023Updated 3 years ago
GATECH-EIC / LaCache
View on GitHub
[ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models
☆17Nov 4, 2025Updated 8 months ago