☆29Apr 7, 2024Updated 2 years ago
Alternatives and similar repositories for SLM
Users that are interested in SLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for the paper: Rehearsal-free Continual Language Learning via Efficient Parameter Isolation☆13May 16, 2023Updated 3 years ago
- [EMNLP 2024 Findings] Unlocking Continual Learning Abilities in Language Models☆26Oct 8, 2024Updated last year
- Adding new tasks to T0 without catastrophic forgetting☆33Oct 20, 2022Updated 3 years ago
- Code for ACL 2024 accepted paper titled "SAPT: A Shared Attention Framework for Parameter-Efficient Continual Learning of Large Language …☆40Jan 13, 2025Updated last year
- Continual Memorization of Factoids in Large Language Models☆12Nov 20, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [ECCV 2024] Mind the Interference: Retaining Pre-trained Knowledge in Parameter Efficient Continual Learning of Vision-Language Models☆57Jul 9, 2024Updated last year
- ☆12Apr 29, 2024Updated 2 years ago
- ☆14Oct 3, 2024Updated last year
- ☆205Jul 13, 2024Updated last year
- ✌[ICLR 2024] Class Incremental Learning via Likelihood Ratio Based Task Prediction☆31Oct 29, 2024Updated last year
- ☆33Feb 24, 2025Updated last year
- The Official Code of CPR (ICLR 2021)☆15Oct 14, 2021Updated 4 years ago
- Preventing Zero-Shot Transfer Degradation in Continual Learning of Vision-Language Models☆109Mar 5, 2024Updated 2 years ago
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"☆24Apr 30, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Must-read Papers on Large Language Model (LLM) Continual Learning☆150Nov 14, 2023Updated 2 years ago
- ☆24Jan 16, 2025Updated last year
- ☆18Dec 2, 2024Updated last year
- New Modeling The Background CodeBase☆15Jan 7, 2022Updated 4 years ago
- Code for NeurIPS 2021 paper "Flattening Sharpness for Dynamic Gradient Projection Memory Benefits Continual Learning".☆16Oct 18, 2021Updated 4 years ago
- A local search system implementation using Elasticsearch for Wikipedia data indexing and retrieval.☆14May 17, 2025Updated last year
- This is a public repository for:☆38Aug 11, 2021Updated 4 years ago
- Code for paper "MoE-Adapters" CVPR2024 and "MoE-Adapters++" TPAMI2025☆275Sep 18, 2025Updated 9 months ago
- ☆23Nov 29, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The implementation code for Uncertainty-based Continual Learning with Adaptive Regularization (Neurips 2019)☆37May 25, 2021Updated 5 years ago
- [ICLR 2022] Official Code Repository for "TRGP: TRUST REGION GRADIENT PROJECTION FOR CONTINUAL LEARNING"☆22Oct 5, 2022Updated 3 years ago
- External Knowledge Injection for CLIP-Based Class-Incremental Learning (ICCV 2025)☆60Nov 19, 2025Updated 7 months ago
- Code for MERL's ECCV 2022 paper on Cross-Modal Knowledge Transfer Without Task-Relevant Source Data☆11Jul 19, 2022Updated 3 years ago
- ☆11Nov 13, 2024Updated last year
- SPGCL: Mining Spatio-Temporal Relations via Self-Paced Graph Contrastive Learning☆14Feb 16, 2023Updated 3 years ago
- An implementation of online data mixing for the Pile dataset, based on the GPT-NeoX library.☆14Jan 9, 2024Updated 2 years ago
- The PyTorch re-implement of a branch-aware coronary centerline extraction in CT Angiography images. (paper: 'Branch-Aware Double DQN for …☆13Jul 25, 2025Updated 11 months ago
- Code for the paper "SizeShiftReg: a Regularization Method for Improving Size-Generalization in Graph Neural Networks"☆12Jan 17, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [ICML 2022] Generalizing to Evolving Domains with Latent Structure-Aware Sequential Autoencoder☆24Feb 25, 2024Updated 2 years ago
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Models☆15Mar 8, 2023Updated 3 years ago
- ☆10Feb 6, 2025Updated last year
- NLP/ML面试各类资料链接 汇总(主要Github收集)☆11Mar 3, 2020Updated 6 years ago
- ☆16May 20, 2025Updated last year
- ☆13May 21, 2024Updated 2 years ago
- ☆19Jul 3, 2025Updated 11 months ago