Codebase for Context-aware Meta-learned Loss Scaling (CaMeLS). https://arxiv.org/abs/2305.15076.
☆25Jan 23, 2024Updated 2 years ago
Alternatives and similar repositories for CaMeLS
Users that are interested in CaMeLS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for "Unlearning Traces the Influential Training Data of Language Models"☆13Jun 13, 2024Updated last year
- ☆13Mar 25, 2022Updated 3 years ago
- Official PyTorch implementation of “Flexible Dataset Distillation: Learn Labels Instead of Images”☆41Oct 21, 2020Updated 5 years ago
- Repository for "Propagating Knowledge Updates to LMs Through Distillation" (NeurIPS 2023).☆26Aug 25, 2024Updated last year
- ☆23Nov 1, 2022Updated 3 years ago
- ☆16May 30, 2019Updated 6 years ago
- code for training and using chess embeddings models☆13Jun 9, 2024Updated last year
- ☆19Mar 31, 2024Updated last year
- a clean blog☆10Apr 20, 2020Updated 5 years ago
- Tensorflow implementation of deformable conv and pooling operations.☆10Jul 17, 2017Updated 8 years ago
- Online Adaptation of Language Models with a Memory of Amortized Contexts (NeurIPS 2024)☆75Aug 3, 2024Updated last year
- ☆13May 9, 2024Updated last year
- JAX implementation of Kolmogorov Arnold Networks (KANs).☆10May 7, 2024Updated last year
- ☆38Nov 4, 2024Updated last year
- Official code for the paper "Attention as a Hypernetwork"☆55Feb 24, 2026Updated 3 weeks ago
- Official Pytorch Implementation of "Outlier-weighed Layerwise Sampling for LLM Fine-tuning" by Pengxiang Li, Lu Yin, Xiaowei Gao, Shiwei …☆35Jun 3, 2025Updated 9 months ago
- Public Inflection Benchmarks☆68Mar 6, 2024Updated 2 years ago
- ☆21Jul 28, 2022Updated 3 years ago
- Code for the paper "What Makes Better Augmentation Strategies? Augment Difficult but Not too Different" (ICLR 22)☆12Aug 28, 2023Updated 2 years ago
- Code implementation for paper "On the Efficacy of Small Self-Supervised Contrastive Models without Distillation Signals".☆17Dec 15, 2021Updated 4 years ago
- Official implementation of Tabular Transfer Learning via Prompting LLMs (COLM 2024).☆13Aug 6, 2024Updated last year
- Parkar and Kim et al.'s paper on Can LLMs Select Important Instructions to Annotate?"☆13Jul 4, 2024Updated last year
- [ICLR 2026] ParallelBench: Understanding the Tradeoffs of Parallel Decoding in Diffusion LLMs☆44Updated this week
- yet another anki app☆14Sep 9, 2024Updated last year
- ☆14Feb 12, 2024Updated 2 years ago
- Neural Module Network for Reasoning over Text, ICLR 2020☆119Oct 6, 2020Updated 5 years ago
- Evaluating and improving the faithfulness of the interpretations offered by Neural Module Networks☆13Jun 12, 2023Updated 2 years ago
- Official implementation for Sparse MetA-Tuning (SMAT)☆17Jul 29, 2025Updated 7 months ago
- Create shortcuts to Homebrew formula app bundles☆16May 6, 2024Updated last year
- User-friendly implementation of the Mixture-of-Sparse-Attention (MoSA). MoSA selects distinct tokens for each head with expert choice rou…☆28May 3, 2025Updated 10 months ago
- Kinetics: Rethinking Test-Time Scaling Laws☆86Jul 11, 2025Updated 8 months ago
- ☆25Oct 5, 2020Updated 5 years ago
- An implementation for dynamic conversation recommendation☆17Apr 23, 2020Updated 5 years ago
- Standardizing environment infrastructure with Strands Agents — step, observe, reward.☆43Mar 17, 2026Updated last week
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆19Mar 10, 2025Updated last year
- Implementation of the Paper "Goal-Driven Explainable Clustering via Language Descriptions"☆40May 24, 2023Updated 2 years ago
- Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"☆33May 9, 2024Updated last year
- Code for ICML 2024 paper☆35Sep 18, 2025Updated 6 months ago
- Official repo of dataset-decomposition paper [NeurIPS 2024]