☆214Oct 10, 2022Updated 3 years ago
Alternatives and similar repositories for RHO-Loss
Users that are interested in RHO-Loss are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models (ICLR 2022)☆29Feb 9, 2022Updated 4 years ago
- A fast, effective data attribution method for neural networks in PyTorch☆237Nov 18, 2024Updated last year
- Data for "Datamodels: Predicting Predictions with Training Data"☆96May 25, 2023Updated 2 years ago
- [ICML 2024] PyTorch implementation for "Diversified Batch Selection for Training Acceleration"☆10Jul 30, 2024Updated last year
- ☆54Jan 18, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Implementation of the paper: Selective_Backpropagation from paper Accelerating Deep Learning by Focusing on the Biggest Losers☆15Feb 2, 2020Updated 6 years ago
- ☆10Oct 20, 2023Updated 2 years ago
- Implementation of the spotlight: a method for discovering systematic errors in deep learning models☆11Oct 5, 2021Updated 4 years ago
- D-Adaptation for SGD, Adam and AdaGrad☆532Jan 22, 2025Updated last year
- source code for ICLR'22 paper "VOS: Learning What You Don’t Know by Virtual Outlier Synthesis"☆323Oct 1, 2023Updated 2 years ago
- ☆21Mar 15, 2023Updated 3 years ago
- Uncertainty-Aware Curriculum Learning for Neural Machine Translation (ACL 2020)☆11Jun 12, 2020Updated 5 years ago
- Fast and simple stream processing of files in tar files, useful for deep learning, big data, and many other applications.☆135Dec 10, 2023Updated 2 years ago
- A simple and efficient baseline for data attribution☆11Nov 10, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"☆28May 2, 2022Updated 4 years ago
- Official code for the paper: "Metadata Archaeology"☆19May 10, 2023Updated 2 years ago
- Implementation of Gradient Information Optimization (GIO) for effective and scalable training data selection☆14Jun 22, 2023Updated 2 years ago
- Train very large language models in Jax.☆209Oct 21, 2023Updated 2 years ago
- Code for "Tracing Knowledge in Language Models Back to the Training Data"☆39Dec 27, 2022Updated 3 years ago
- ☆13Aug 20, 2021Updated 4 years ago
- Masked Siamese Networks for Label-Efficient Learning (https://arxiv.org/abs/2204.07141)☆463May 9, 2022Updated 4 years ago
- More interactive weak supervision with FlyingSquid☆315Sep 1, 2020Updated 5 years ago
- A simple Jax implementation of influence functions.☆20Apr 9, 2024Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Code release for "Clue Me In: Semi-Supervised FGVC with Out-of-Distribution Data".☆13Apr 11, 2022Updated 4 years ago
- Code for NeurIPS 2020 Paper --- Continual Learning of a Mixed Sequence of Similar and Dissimilar Tasks☆21Oct 24, 2022Updated 3 years ago
- A Survey of Dataset Refinement for Problems in Computer Vision Datasets☆34Sep 12, 2025Updated 7 months ago
- ☆19Feb 25, 2024Updated 2 years ago
- trying to make WebGPU a bit easier to use☆19Jan 9, 2024Updated 2 years ago
- A case study of efficient training of large language models using commodity hardware.☆67Aug 4, 2022Updated 3 years ago
- Flux reconstruction fluid flow solver for 1D PDEs written in Julia. Linear advection, Burgers, viscous Burgers, and Euler equations.☆14Apr 28, 2022Updated 4 years ago
- Aioli: A unified optimization framework for language model data mixing☆32Jan 17, 2025Updated last year
- This is a pytorch version for Non-local Neural Networks(onging)☆27May 18, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- DSIR large-scale data selection framework for language model training☆273Apr 7, 2024Updated 2 years ago
- AdaCat☆48Aug 4, 2022Updated 3 years ago
- This is an official repository for "LAVA: Data Valuation without Pre-Specified Learning Algorithms" (ICLR2023).☆53Jun 5, 2024Updated last year
- ☆13Feb 14, 2022Updated 4 years ago
- Exploration of automated dataset selection approaches at large scales.☆54Mar 4, 2025Updated last year
- ☆32May 24, 2023Updated 2 years ago
- Visual Taste Approximator (VTA) is a very simple tool that helps anyone create an automatic replica of themselves that can approximate th…☆40Sep 11, 2022Updated 3 years ago