OpenMOSS/Thus-Spake-Long-Context-LLM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/OpenMOSS/Thus-Spake-Long-Context-LLM)

OpenMOSS / Thus-Spake-Long-Context-LLM

a survey of long-context LLMs from four perspectives, architecture, infrastructure, training, and evaluation

☆62

Alternatives and similar repositories for Thus-Spake-Long-Context-LLM

Users that are interested in Thus-Spake-Long-Context-LLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tengxiaoliu / LM_skip
View on GitHub
[NeurIPS 2024] Can Language Models Learn to Skip Steps?
☆21Jan 25, 2025Updated last year
OpenLMLab / scaling-rope
View on GitHub
code for Scaling Laws of RoPE-based Extrapolation
☆73Oct 16, 2023Updated 2 years ago
OpenLMLab / ParallelTokenizer
View on GitHub
Use the tokenizer in parallel to achieve superior acceleration
☆20Mar 21, 2024Updated 2 years ago
OpenLMLab / LongWanjuan
View on GitHub
Towards Systematic Measurement for Long Text Quality
☆39Sep 5, 2024Updated last year
open-nlplab / fastchatgpt
View on GitHub
A python tool help to interact with chatgpt.
☆10Dec 11, 2022Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
OpenMOSS / rope_pp
View on GitHub
[ICLR26] Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs
☆33Dec 9, 2025Updated 7 months ago
OpenMOSS / CoLLiE
View on GitHub
Collaborative Training of Large Language Models in an Efficient Way
☆419Aug 28, 2024Updated last year
KaiLv69 / DuoDecoding
View on GitHub
DuoDecoding: Hardware-aware Heterogeneous Speculative Decoding with Dynamic Multi-Sequence Drafting
☆19Mar 4, 2025Updated last year
OpenLMLab / Sniffer
View on GitHub
☆27Jun 5, 2023Updated 3 years ago
OpenMOSS / claude-codex-handoff
View on GitHub
Drop-in async file-based handoff protocol for two AI coding agents (Claude Code + Codex), installed as one shared .handoff/ in your proje…
☆30Jul 4, 2026Updated 2 weeks ago
xinghaow99 / pbs-attn
View on GitHub
[ICML 2026] Sparser Block-Sparse Attention via Token Permutation
☆31May 22, 2026Updated 2 months ago
choosewhatulike / cluster-clip
View on GitHub
Multi-GPU supported kmeans clustering for cluser-clip
☆15Jun 3, 2024Updated 2 years ago
OpenMOSS / LongLLaDA
View on GitHub
[AAAI26] LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs
☆55Dec 7, 2025Updated 7 months ago
tengxiaoliu / RLET
View on GitHub
[EMNLP 2022] RLET: A Reinforcement Learning Based Approach for Explainable QA with Entailment Trees
☆11Jul 15, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
HKUNLP / STRING
View on GitHub
[ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"
☆82Nov 25, 2024Updated last year
KaiLv69 / UDR
View on GitHub
ACL'23: Unified Demonstration Retriever for In-Context Learning
☆38Dec 2, 2023Updated 2 years ago
ayyyq / TARA
View on GitHub
code for [ACL23] An AMR-based Link Prediction Approach for Document-level Event Argument Extraction
☆24Oct 2, 2023Updated 2 years ago
GAIR-NLP / weak-to-strong-reasoning
View on GitHub
☆59Sep 2, 2024Updated last year
OpenIXCLab / CODA
View on GitHub
CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning
☆37Aug 28, 2025Updated 10 months ago
GAIR-NLP / alignment-for-honesty
View on GitHub
☆78May 22, 2024Updated 2 years ago
InternLM / Spark
View on GitHub
An official implementation of "SPARK: Synergistic Policy And Reward Co-Evolving Framework"
☆25Oct 23, 2025Updated 9 months ago
OpenMOSS / Embodied-Planner-R1
View on GitHub
Embodied-Planner-R1: Unleashing Embodied Task Planning Ability in LLMs via Reinforcement Learning
☆27Mar 30, 2026Updated 3 months ago
OpenMOSS / SpeechGPT-2.0-preview
View on GitHub
GPT-4o-level, real-time spoken dialogue system.
☆375Jan 27, 2025Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
hmwang2002 / CTRL-S
View on GitHub
[ECCV 2026] Official repository of "Reliable Reasoning in SVG-LLMs via Multi-Task Multi-Reward Reinforcement Learning".
☆22Updated this week
yegcjs / mixinglaws
View on GitHub
☆113Jul 15, 2025Updated last year
OpenMOSS / HalluQA
View on GitHub
Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"
☆139Jun 5, 2024Updated 2 years ago
open-nlplab / fastIE
View on GitHub
Information Extraction related tools and models
☆10Mar 16, 2023Updated 3 years ago
Shark-NLP / CoNT
View on GitHub
[NeurIPS'22 Spotlight] Data and code for our paper CoNT: Contrastive Neural Text Generation
☆152May 10, 2023Updated 3 years ago
Liuziyu77 / gene-skill
View on GitHub
Gene-skill: Throw a few Skills into the “gene blender” and shake out a new Skill that gets more done.
☆59Apr 17, 2026Updated 3 months ago
OpenLMLab / LEval
View on GitHub
[ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark
☆406Jul 9, 2024Updated 2 years ago
OpenMOSS / Say-I-Dont-Know
View on GitHub
[ICML'2024] Can AI Assistants Know What They Don't Know?
☆86Feb 5, 2024Updated 2 years ago
InternLM / InternBootcamp
View on GitHub
Official implement on InternBootCamp
☆348Updated this week
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
xinghaow99 / DenoSent
View on GitHub
[AAAI 2024] DenoSent: A Denoising Objective for Self-Supervised Sentence Representation Learning
☆15Apr 29, 2024Updated 2 years ago
OpenMOSS / Sparse-dLLM
View on GitHub
☆29Oct 16, 2025Updated 9 months ago
yhcc / utcie
View on GitHub
This is the code repo for the paper <UTC-IE: A Unified Token-pair Classification Architecture for Information Extraction>
☆15Aug 10, 2023Updated 2 years ago
pzs19 / LEMMA
View on GitHub
☆16Sep 4, 2025Updated 10 months ago
mlpc-ucsd / BERT_Convolutions
View on GitHub
(ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.
☆21Jul 13, 2022Updated 4 years ago
InternLM / EndoCoT
View on GitHub
[ECCV 2026] An official implementation of "EndoCoT". Scaling endogenous Chain-of-Thought (CoT) reasoning in diffusion models for complex …
☆43Jun 26, 2026Updated 3 weeks ago
artpli / CodeIE
View on GitHub
[ACL 23] CodeIE: Large Code Generation Models are Better Few-Shot Information Extractors
☆42Dec 14, 2025Updated 7 months ago