A benchmark for testing memorization abilities of LMs
☆24Oct 15, 2024Updated last year
Alternatives and similar repositories for ForgettingCurve
Users that are interested in ForgettingCurve are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation☆30Jun 30, 2025Updated last year
- An introduction to basic concepts of Transformers and key techniques of their recent advances.☆52Dec 21, 2023Updated 2 years ago
- A list of conferences and journals relevant to machine translation☆33Mar 17, 2022Updated 4 years ago
- [EMNLP 2022] This is the code repo for our EMNLP‘22 paper "Dimension Reduction for Efficient Dense Retrieval via Conditional Autoencoder"…☆13Oct 20, 2022Updated 3 years ago
- This is the code repo for the paper "RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards".☆24Oct 28, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆45Jun 4, 2026Updated 3 weeks ago
- HIPPO: Enhancing the Table Understanding Capability of Large Language Models through Hybrid-Modal Preference Optimization☆18May 29, 2025Updated last year
- The repository of EMNLP 2023 "MixEdit: Revisiting Data Augmentation and Beyond for Grammatical Error Correction"☆12Nov 25, 2023Updated 2 years ago
- ☆33Feb 10, 2025Updated last year
- Official repository for the paper "COAST: Enhancing the Code Debugging Ability of LLMs through Communicative Agent Based Data Synthesis".☆18Feb 19, 2025Updated last year
- ☆17Apr 9, 2025Updated last year
- This repository open-sources our GEC system submitted by THU KELab (sz) in the CCL2023-CLTC Track 1: Multidimensional Chinese Learner Tex…☆15Nov 25, 2023Updated 2 years ago
- This is the code repo for our paper "Learning More Effective Representations for Dense Retrieval through Deliberate Thinking Before Searc…☆27Mar 2, 2025Updated last year
- Code for Parametric RAG, SIGIR 2025 Full Paper☆234May 1, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆24Jan 16, 2025Updated last year
- [COLM'25] A Controlled Study on Long Context Extension and Generalization in LLMs☆65Mar 9, 2026Updated 3 months ago
- [EMNLP '25] Source code for paper "ExpandR: Teaching Dense Retrievers Beyond Queries with LLM Guidance"☆40Aug 13, 2025Updated 10 months ago
- [ICML 2026] The official implementation of paper "Generation Enhances Understanding in Unified Multimodal Models via Multi-Representation…☆80May 25, 2026Updated last month
- Code for "Retaining Key Information under High Compression Rates: Query-Guided Compressor for LLMs" (ACL 2024)☆19Jun 12, 2024Updated 2 years ago
- RENT (Reinforcement Learning via Entropy Minimization) is an unsupervised method for training reasoning LLMs.☆43Oct 31, 2025Updated 8 months ago
- [ECCV'24 Oral] PiTe: Pixel-Temporal Alignment for Large Video-Language Model☆17Feb 13, 2025Updated last year
- A Mechanistic‑Interpretability study that finds the structural dynamics of Large Language Models under fine‑tuning.☆17May 30, 2025Updated last year
- [ICML 2025] EffiCoder: Enhancing Code Generation in Large Language Models through Efficiency-Aware Fine-tuning☆16May 24, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆29Jan 23, 2024Updated 2 years ago
- Federated Transformer (NeurIPS 24): a framework to enhance the performance of multi-party Vertical Federated Learning involving fuzzy ide…☆44Dec 14, 2024Updated last year
- This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vi…☆122Jun 18, 2025Updated last year
- ☆17Feb 4, 2025Updated last year
- ECAI 2025☆20May 4, 2026Updated last month
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28May 4, 2025Updated last year
- ☆10Jun 12, 2019Updated 7 years ago
- The repository for papaer "Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs"☆14Dec 16, 2024Updated last year
- "FiD-ICL: A Fusion-in-Decoder Approach for Efficient In-Context Learning" (ACL 2023)☆15Jul 24, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Here we will test various linear attention designs.☆62Apr 25, 2024Updated 2 years ago
- ☆10Dec 18, 2023Updated 2 years ago
- Official Code for paper "Towards Efficient and Effective Unlearning of Large Language Models for Recommendation" (Frontiers of Computer S…☆38Jul 19, 2024Updated last year
- A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning☆293Sep 25, 2025Updated 9 months ago
- ParamMute: Suppressing Knowledge-Critical FFNs for Faithful Retrieval-Augmented Generation☆58Feb 2, 2026Updated 4 months ago
- [EMNLP 2025] Distill Visual Chart Reasoning Ability from LLMs to MLLMs☆61Aug 25, 2025Updated 10 months ago
- This code repository is for the accepted ACL2022 paper "On Vision Features in Multimodal Machine Translation". We provide the details and…☆44Sep 16, 2022Updated 3 years ago