[ICLR 2025] Released code for paper "Spurious Forgetting in Continual Learning of Language Models"
☆60May 9, 2025Updated 10 months ago
Alternatives and similar repositories for spurious-forgetting
Users that are interested in spurious-forgetting are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [EMNLP2022] Released code for paper "Distilling Causal Effect from Miscellaneous Other-Class for Continual Named Entity Recognition"☆22Feb 9, 2023Updated 3 years ago
- [ACL2023] Preserving Commonsense Knowledge from Pre-trained Language Models via Causal Inference☆24Dec 25, 2023Updated 2 years ago
- This is a curated list of "Continual Learning with Pretrained Models" research.☆19May 29, 2025Updated 10 months ago
- ☆18Aug 19, 2024Updated last year
- Implementation of "Decoding-time Realignment of Language Models", ICML 2024.☆21Jun 17, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- The official pytorch implementation of our proposed model MISSL (ICDE-24).☆13Dec 8, 2023Updated 2 years ago
- ☆19May 3, 2025Updated 10 months ago
- Continual Memorization of Factoids in Large Language Models☆12Nov 20, 2024Updated last year
- The paper "Triple-shapelet Networks for Time Series Classification"☆15Feb 8, 2020Updated 6 years ago
- CVPR2025-Multi-party Collaborative Attention Control for Image Customization☆16May 14, 2025Updated 10 months ago
- The code for paper "Convolutional Multi-timescale Echo State Network"☆13Jul 24, 2019Updated 6 years ago
- Code and resources for the NeurIPS 2025 Paper "BMMR: A Large-Scale Bilingual Multimodal Multi-Discipline Reasoning Dataset" by Zhiheng X…☆19Oct 14, 2025Updated 5 months ago
- Implementation of the paper "Exploring the Universal Vulnerability of Prompt-based Learning Paradigm" on Findings of NAACL 2022☆32Jul 11, 2022Updated 3 years ago
- This is an official pytorch implementation for paper "Temporal-Frequency Co-training for Time Series Semi-supervised Learning" (AAAI-23)…☆15May 17, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [NeurIPS 2023] "Rethinking Tokenizer and Decoder in Masked Graph Modeling for Molecules"☆40Mar 16, 2024Updated 2 years ago
- Exchange-of-Thought: Enhancing Large Language Model Capabilities through Cross-Model Communication☆21Mar 21, 2024Updated 2 years ago
- This project contains the necessary files to reproduce the paper: "Explaining Character-Aware Neural Networks for Word-Level Prediction: …☆12Nov 15, 2018Updated 7 years ago
- This is an official pytorch implementation for paper "Scale-teaching: Robust Multi-scale Training for Time Series Classification with Noi…☆16Nov 3, 2023Updated 2 years ago
- The official implementation of InfoRM [NeurIPS 2024].☆15Oct 25, 2025Updated 5 months ago
- ☆14Jun 8, 2018Updated 7 years ago
- A method of ensemble learning for heterogeneous large language models.☆64Aug 7, 2024Updated last year
- [AAAI 2024] SiMA-Hand: Boosting 3D Hand-Mesh Reconstruction by Single-to-Multi-view Adaptation, Pytorch implementation.☆11Feb 6, 2024Updated 2 years ago
- Official Implementation for "SiLVR : A Simple Language-based Video Reasoning Framework"☆19Jan 18, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- An official implementation of "Catastrophic Failure of LLM Unlearning via Quantization" (ICLR 2025)☆37Feb 22, 2025Updated last year
- Code for the paper "Spectrum Guided Topology Augmentation for Graph Contrastive Learning"☆11Jul 18, 2023Updated 2 years ago
- Official PyTorch implementation of our ECCV2024 paper “Rethinking Few-shot Class-incremental Learning: Learning from Yourself”☆21Jan 12, 2025Updated last year
- Web app created to collect audios for course project☆10Apr 6, 2018Updated 7 years ago
- A Model Agnostic function to directly remove specified layers from the LLM☆10May 23, 2024Updated last year
- 本项目提供了基 于910B的huggingface LLM模型的Tensor Parallel(TP)部署教程,同时也可以作为一份极简的TP学习代码。☆32Jan 6, 2026Updated 2 months ago
- Due to the huge vocaburary size (151,936) of Qwen models, the Embedding and LM Head weights are excessively heavy. Therefore, this projec…☆35Jan 6, 2026Updated 2 months ago
- [CSUR 2025] Continual Learning of Large Language Models: A Comprehensive Survey☆534Dec 23, 2025Updated 3 months ago
- [ICLR 25] A novel framework for building intrinsically interpretable LLMs with human-understandable concepts to ensure safety, reliabilit…☆31Feb 5, 2026Updated last month
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [CVPR'23] Instance-specific and Model-adaptive Supervision for Semi-supervised Semantic Segmentation☆41Nov 16, 2023Updated 2 years ago
- Official code for PLoP☆18Mar 6, 2026Updated 3 weeks ago
- direct preference optimization with only 1 model copy :)☆14Oct 2, 2023Updated 2 years ago
- Official Implementation of MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models☆13Nov 1, 2025Updated 4 months ago
- Test-time preferenece optimization (ICML 2025).☆181May 8, 2025Updated 10 months ago
- The source code for the paper: Yirong Mao, Ruiping Wang, Shiguang Shan, Xilin Chen. COSONet: Compact Second-Order Network for Video Face …☆12Dec 27, 2018Updated 7 years ago
- [ICLR 2026] Thinking on the Fly: Test-Time Reasoning Enhancement via Latent Thought Policy Optimization☆24Mar 6, 2026Updated 3 weeks ago