☆216Oct 10, 2022Updated 3 years ago
Alternatives and similar repositories for RHO-Loss
Users that are interested in RHO-Loss are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models (ICLR 2022)☆29Feb 9, 2022Updated 4 years ago
- A fast, effective data attribution method for neural networks in PyTorch☆241Nov 18, 2024Updated last year
- Data for "Datamodels: Predicting Predictions with Training Data"☆96May 25, 2023Updated 3 years ago
- [ICML 2024] PyTorch implementation for "Diversified Batch Selection for Training Acceleration"☆10Jul 30, 2024Updated last year
- Pretraining summarization models using a corpus of nonsense☆13Sep 28, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- An implementation of online data mixing for the Pile dataset, based on the GPT-NeoX library.☆14Jan 9, 2024Updated 2 years ago
- Implementation of the paper: Selective_Backpropagation from paper Accelerating Deep Learning by Focusing on the Biggest Losers☆15Feb 2, 2020Updated 6 years ago
- HomebrewNLP in JAX flavour for maintable TPU-Training☆51Jan 20, 2024Updated 2 years ago
- Implementation of the spotlight: a method for discovering systematic errors in deep learning models☆11Oct 5, 2021Updated 4 years ago
- D-Adaptation for SGD, Adam and AdaGrad☆531Jan 22, 2025Updated last year
- Debiasing Through Data Attribution☆13May 23, 2024Updated 2 years ago
- source code for ICLR'22 paper "VOS: Learning What You Don’t Know by Virtual Outlier Synthesis"☆325Oct 1, 2023Updated 2 years ago
- Data Valuation on In-Context Examples (ACL23)☆24Jan 12, 2025Updated last year
- ☆21Mar 15, 2023Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Uncertainty-Aware Curriculum Learning for Neural Machine Translation (ACL 2020)☆11Jun 12, 2020Updated 6 years ago
- Fast and simple stream processing of files in tar files, useful for deep learning, big data, and many other applications.☆137Dec 10, 2023Updated 2 years ago
- A simple and efficient baseline for data attribution☆11Nov 10, 2023Updated 2 years ago
- Official code for the paper: "Metadata Archaeology"☆19May 10, 2023Updated 3 years ago
- Implementation of Gradient Information Optimization (GIO) for effective and scalable training data selection☆14Jun 22, 2023Updated 2 years ago
- Train very large language models in Jax.☆208Oct 21, 2023Updated 2 years ago
- ☆13Aug 20, 2021Updated 4 years ago
- Masked Siamese Networks for Label-Efficient Learning (https://arxiv.org/abs/2204.07141)☆463May 9, 2022Updated 4 years ago
- A simple Jax implementation of influence functions.☆21Apr 9, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code release for "Clue Me In: Semi-Supervised FGVC with Out-of-Distribution Data".☆13Apr 11, 2022Updated 4 years ago
- Code for NeurIPS 2020 Paper --- Continual Learning of a Mixed Sequence of Similar and Dissimilar Tasks☆21Oct 24, 2022Updated 3 years ago
- A Survey of Dataset Refinement for Problems in Computer Vision Datasets☆34Sep 12, 2025Updated 9 months ago
- ☆19Feb 25, 2024Updated 2 years ago
- A case study of efficient training of large language models using commodity hardware.☆67Aug 4, 2022Updated 3 years ago
- Aioli: A unified optimization framework for language model data mixing☆32Jan 17, 2025Updated last year
- atmaCup #11 の Public 4th / Private 5th Solution のリポジトリです。☆12Aug 3, 2021Updated 4 years ago
- This is a pytorch version for Non-local Neural Networks(onging)☆27May 18, 2019Updated 7 years ago
- AdaCat☆48Aug 4, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- DSIR large-scale data selection framework for language model training☆274Apr 7, 2024Updated 2 years ago
- This is an official repository for "LAVA: Data Valuation without Pre-Specified Learning Algorithms" (ICLR2023).☆54Jun 5, 2024Updated 2 years ago
- ☆13Feb 14, 2022Updated 4 years ago
- Codebase for Prioritizing samples in Reinforcement Learning with Reducible Loss☆12Oct 10, 2022Updated 3 years ago
- ☆43Oct 13, 2023Updated 2 years ago
- Exploration of automated dataset selection approaches at large scales.☆55Mar 4, 2025Updated last year
- ☆32May 24, 2023Updated 3 years ago