☆18Dec 2, 2024Updated last year
Alternatives and similar repositories for UltraGist
Users that are interested in UltraGist are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Oct 3, 2024Updated last year
- ☆36Oct 4, 2025Updated 8 months ago
- ☆12Apr 29, 2024Updated 2 years ago
- [EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA☆153Dec 22, 2025Updated 5 months ago
- This is the official code repository for the paper: Towards General Continuous Memory for Vision-Language Models.☆29Jul 3, 2025Updated 11 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- The repo for In-context Autoencoder☆172May 11, 2024Updated 2 years ago
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"☆24Apr 30, 2025Updated last year
- KV cache compression via sparse coding☆18Oct 26, 2025Updated 7 months ago
- ☆24Jan 16, 2025Updated last year
- Official code and resources for the paper "EXIT: Context-Aware Extractive Compression for Enhancing Retrieval-Augmented Generation."☆24Dec 23, 2024Updated last year
- [COLM'25] A Controlled Study on Long Context Extension and Generalization in LLMs☆65Mar 9, 2026Updated 3 months ago
- [NAACL 2024] A Synthetic, Scalable and Systematic Evaluation Suite for Large Language Models☆33Jun 10, 2024Updated 2 years ago
- Efficient retrieval head analysis with triton flash attention that supports topK probability☆13Jun 15, 2024Updated 2 years ago
- Neural ODE Transformers (ICLR 2025)☆21Sep 6, 2025Updated 9 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Submodule of evalverse forked from [google-research/instruction_following_eval](https://github.com/google-research/google-research/tree/m…☆14May 4, 2024Updated 2 years ago
- ☆13Oct 19, 2023Updated 2 years ago
- [NeurIPS 2024] Advancing Training Efficiency of Deep Spiking Neural Networks through Rate-based Backpropagation☆20Jan 16, 2025Updated last year
- Executive Memory for Coherent Long-Horizon Reasoning!☆84Jan 14, 2026Updated 5 months ago
- Dataset and baseline for Coling 2022 long paper (oral): "ConFiguRe: Exploring Discourse-level Chinese Figures of Speech"☆13Jul 27, 2023Updated 2 years ago
- LongAttn :Selecting Long-context Training Data via Token-level Attention☆15Jul 16, 2025Updated 10 months ago
- [EMNLP'24] LongHeads: Multi-Head Attention is Secretly a Long Context Processor☆32Apr 8, 2024Updated 2 years ago
- [ICCV2025] V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding☆60Apr 4, 2026Updated 2 months ago
- Know2BIO: A Comprehensive Dual-View Benchmark for Evolving Biomedical Knowledge Graphs☆16Feb 10, 2026Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆11Aug 4, 2024Updated last year
- Synthetic Alphabet Dataset☆19Mar 27, 2025Updated last year
- Rationale-enhanced language models are better continual relation learners (EMNLP 2023 Main Conference)☆12Oct 11, 2023Updated 2 years ago
- ☆12Apr 25, 2024Updated 2 years ago
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆55Jul 3, 2024Updated last year
- [ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"☆450Oct 16, 2024Updated last year
- Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"☆254Sep 12, 2025Updated 9 months ago
- An efficient implementation of the NSA (Native Sparse Attention) kernel☆134Jun 24, 2025Updated 11 months ago
- [ACL 2024] Long-Context Language Modeling with Parallel Encodings☆170Jun 13, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models☆196Oct 8, 2024Updated last year
- some minitools for linux os that are program with python☆13Jun 20, 2017Updated 8 years ago
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding☆37Apr 25, 2026Updated last month
- KACC: A Multi-task Benchmark for Knowledge Abstraction, Concretization and Completion☆12Oct 21, 2021Updated 4 years ago
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆23Jun 25, 2024Updated last year
- ☆11Jun 7, 2023Updated 3 years ago
- ☆29Apr 7, 2024Updated 2 years ago