A benchmark for testing memorization abilities of LMs
☆24Oct 15, 2024Updated last year
Alternatives and similar repositories for ForgettingCurve
Users that are interested in ForgettingCurve are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation☆28Jun 30, 2025Updated 10 months ago
- Self-hosted AI assistant with tool use, multi-agent orchestration, coding copilot and a lightweight Flask + vanilla JS stack.☆118May 9, 2026Updated last week
- [ICML 2025] RocketKV: Accelerating Long-Context LLM Inference via Two-Stage KV Cache Compression☆47Aug 7, 2025Updated 9 months ago
- The implementation of "Shallow-to-Deep Training for Neural Machine Translation"☆10Oct 26, 2020Updated 5 years ago
- [EMNLP 2022] This is the code repo for our EMNLP‘22 paper "Dimension Reduction for Efficient Dense Retrieval via Conditional Autoencoder"…☆13Oct 20, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆42Feb 8, 2026Updated 3 months ago
- HIPPO: Enhancing the Table Understanding Capability of Large Language Models through Hybrid-Modal Preference Optimization☆17May 29, 2025Updated 11 months ago
- The repository of EMNLP 2023 "MixEdit: Revisiting Data Augmentation and Beyond for Grammatical Error Correction"☆12Nov 25, 2023Updated 2 years ago
- ☆31Feb 10, 2025Updated last year
- Official repository for the paper "COAST: Enhancing the Code Debugging Ability of LLMs through Communicative Agent Based Data Synthesis".☆17Feb 19, 2025Updated last year
- ☆17Apr 9, 2025Updated last year
- This is the code repo for our paper "Learning More Effective Representations for Dense Retrieval through Deliberate Thinking Before Searc…☆28Mar 2, 2025Updated last year
- This repository contains the source code for the Saving 77% of the Parameters in Large Language Models Technical Report☆57Dec 2, 2025Updated 5 months ago
- Code for Parametric RAG, SIGIR 2025 Full Paper☆230May 1, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆23Jan 16, 2025Updated last year
- [COLM'25] A Controlled Study on Long Context Extension and Generalization in LLMs☆65Mar 9, 2026Updated 2 months ago
- [CVPR'2025] VoCo-LLaMA: This repo is the official implementation of "VoCo-LLaMA: Towards Vision Compression with Large Language Models".☆205Jun 18, 2025Updated 11 months ago
- Bayesian scaling laws for in-context learning.☆15Mar 12, 2025Updated last year
- ☆78Nov 5, 2024Updated last year
- ☆57Dec 27, 2025Updated 4 months ago
- RENT (Reinforcement Learning via Entropy Minimization) is an unsupervised method for training reasoning LLMs.☆43Oct 31, 2025Updated 6 months ago
- Collections of RLxLM experiments using minimal codes☆14Feb 17, 2025Updated last year
- A Mechanistic‑Interpretability study that finds the structural dynamics of Large Language Models under fine‑tuning.☆16May 30, 2025Updated 11 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [ICML 2025] EffiCoder: Enhancing Code Generation in Large Language Models through Efficiency-Aware Fine-tuning☆16May 24, 2025Updated 11 months ago
- ☆29Jan 23, 2024Updated 2 years ago
- High-performance tokenized language data-loader for Python C++ extension☆15Jul 22, 2024Updated last year
- Pictures of Mahiro and Mihari Oyama and a basic webserver written in JS (targeting Bun) to serve them☆11Aug 10, 2024Updated last year
- Federated Transformer (NeurIPS 24): a framework to enhance the performance of multi-party Vertical Federated Learning involving fuzzy ide…☆44Dec 14, 2024Updated last year
- Fork of HyenaDNA, a long-range genomic foundation model built with Hyena☆10Aug 14, 2023Updated 2 years ago
- Unofficial implementation of paper : Exploring the Space of Key-Value-Query Models with Intention☆12May 24, 2023Updated 2 years ago
- ☆16Feb 4, 2025Updated last year
- ☆52Jan 28, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ECAI 2025☆20May 4, 2026Updated 2 weeks ago
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28May 4, 2025Updated last year
- [ICCV 2023] Code for "Multi-task View Synthesis with Neural Radiance Fields"☆11Oct 2, 2023Updated 2 years ago
- ☆10Jun 12, 2019Updated 6 years ago
- The repository for papaer "Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs"☆14Dec 16, 2024Updated last year
- Here we will test various linear attention designs.☆62Apr 25, 2024Updated 2 years ago
- Semantic emoji finder. Python/dash UI. Uses sentence transformer embeddings and duckdb☆19Sep 15, 2025Updated 8 months ago