yurakuratov/hidden_capacity

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yurakuratov/hidden_capacity)

yurakuratov / hidden_capacity

Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity (ACL 2025, oral)

☆35

Alternatives and similar repositories for hidden_capacity

Users that are interested in hidden_capacity are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Cognitive-AI-Systems / pogema-toolbox
View on GitHub
☆23Apr 17, 2026Updated 3 months ago
jina-ai / textbook
View on GitHub
distill chatGPT coding ability into small model (1b)
☆31Sep 7, 2023Updated 2 years ago
juliagusak / neural-ode-norm
View on GitHub
Models and code for the ICLR 2020 workshop paper "Towards Understanding Normalization in Neural ODEs"
☆16Apr 27, 2020Updated 6 years ago
v-gen-ai / Marchuk
View on GitHub
Global Weather Forecasting from Mid-Range to Subseasonal Scale
☆19Mar 26, 2026Updated 4 months ago
john-hewitt / implicit-ins
View on GitHub
Codebase for Instruction Following without Instruction Tuning
☆36Sep 24, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
juliagusak / neural-ode-metasolver
View on GitHub
Supplementary code for the paper "Meta-Solver for Neural Ordinary Differential Equations" https://arxiv.org/abs/2103.08561
☆25Mar 30, 2021Updated 5 years ago
BernhardLinz / zabbix-ldap-sync-bash
View on GitHub
Sync Zabbix User with Active Directory Group via LDAP with a pure Bash script
☆11Feb 13, 2024Updated 2 years ago
getao / icae
View on GitHub
The repo for In-context Autoencoder
☆174May 11, 2024Updated 2 years ago
WSNLP / uncertainty_transformers
View on GitHub
☆35May 30, 2022Updated 4 years ago
booydar / babilong
View on GitHub
BABILong is a benchmark for LLM evaluation using the needle-in-a-haystack approach.
☆251Jun 1, 2026Updated last month
AIRI-Institute / LLM-Microscope
View on GitHub
☆62Mar 3, 2025Updated last year
OpenNLPLab / HGRN2
View on GitHub
HGRN2: Gated Linear RNNs with State Expansion
☆58Aug 20, 2024Updated last year
RodkinIvan / associative-recurrent-memory-transformer
View on GitHub
[ICML 24 NGSM workshop] Associative Recurrent Memory Transformer implementation and scripts for training and evaluation
☆67Mar 12, 2026Updated 4 months ago
open-compass / ANAH
View on GitHub
[ACL 2024] ANAH & [NeurIPS 2024] ANAH-v2 & [ICLR 2025] Mask-DPO
☆66Apr 30, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
tianyi-lab / R2-T2
View on GitHub
[ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"
☆19Mar 10, 2025Updated last year
AIRI-Institute / xland-minigrid-datasets
View on GitHub
XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning
☆14Jun 19, 2024Updated 2 years ago
shengliu66 / FractionalReason
View on GitHub
Official github repo for "Fractional Reasoning via Latent Steering Vectors Improves Inference Time Compute"
☆17Jun 30, 2025Updated last year
sail-sg / SimLayerKV
View on GitHub
The official implementation of paper: SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction.
☆54Oct 18, 2024Updated last year
facebookresearch / SecureFLCompression
View on GitHub
Compression primitives for uplink compression in Federated Learning that are compatible with Secure Aggregation.
☆11Jul 27, 2022Updated 4 years ago
LMCache / lmcache-agent-trace
View on GitHub
Agent application/benchmark/workload traces should be placed here.
☆15Apr 13, 2026Updated 3 months ago
WLS04 / EOPD
View on GitHub
☆20May 17, 2026Updated 2 months ago
ZongqianLi / 500xCompressor
View on GitHub
[ACL 2025 Main] Repository for the paper: 500xCompressor: Generalized Prompt Compression for Large Language Models
☆64Mar 9, 2026Updated 4 months ago
recursal / GoldFinch-paper
View on GitHub
GoldFinch and other hybrid transformer components
☆46Jul 20, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ahans30 / goldfish-loss
View on GitHub
[NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs
☆98Nov 17, 2024Updated last year
qizhangli / Gradient-based-Jailbreak-Attacks
View on GitHub
Code for our NeurIPS 2024 paper Improved Generation of Adversarial Examples Against Safety-aligned LLMs
☆12Nov 7, 2024Updated last year
AlirezaMorsali / MLP-Attention
View on GitHub
☆17Dec 19, 2024Updated last year
FusionBrainLab / Guide-and-Rescale
View on GitHub
Official Implementation for "Guide-and-Rescale: Self-Guidance Mechanism for Effective Tuning-Free Real Image Editing"
☆55Sep 12, 2024Updated last year
thunlp / EREN
View on GitHub
Official codes for COLING 2024 paper "Robust and Scalable Model Editing for Large Language Models": https://arxiv.org/abs/2403.17431v1
☆14Mar 27, 2024Updated 2 years ago
Adaxry / Unified_Layer_Skipping
View on GitHub
☆15Apr 11, 2024Updated 2 years ago
OmniMMI / M4
View on GitHub
[CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts
☆19Apr 2, 2025Updated last year
FusionBrainLab / Vision_GRPO
View on GitHub
☆91Mar 5, 2025Updated last year
skolai / fewbit
View on GitHub
Compression schema for gradients of activations in backward pass
☆45Jul 26, 2023Updated 3 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
googleinterns / localizing-paragraph-memorization
View on GitHub
☆15Feb 21, 2024Updated 2 years ago
mlbio-epfl / joint-inference
View on GitHub
[ICLR 2025] Large (Vision) Language Models are Unsupervised In-Context Learners
☆22Jun 6, 2025Updated last year
zhixuan-lin / forgetting-transformer
View on GitHub
[ICLR 2025 & COLM 2025] Official PyTorch implementation of the Forgetting Transformer and Adaptive Computation Pruning
☆150Feb 25, 2026Updated 5 months ago
yale-nlp / refdpo
View on GitHub
☆16Jul 23, 2024Updated 2 years ago
erdavids / Hex_Map_Tutorial
View on GitHub
☆10Apr 28, 2020Updated 6 years ago
bazenkov / neuro-raai
View on GitHub
Materials for the course on neuromorphic computing at RAAI Summer School 2021
☆14Jul 5, 2022Updated 4 years ago
milas / rock5-talos
View on GitHub
[**DEPRECATED** - see link for replacement!] Friendly fork of Talos Linux for the Radxa Rock 5 SBCs
☆18May 31, 2024Updated 2 years ago