☆215Oct 10, 2022Updated 3 years ago
Alternatives and similar repositories for RHO-Loss
Users that are interested in RHO-Loss are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models (ICLR 2022)☆29Feb 9, 2022Updated 4 years ago
- A fast, effective data attribution method for neural networks in PyTorch☆240Nov 18, 2024Updated last year
- Data for "Datamodels: Predicting Predictions with Training Data"☆96May 25, 2023Updated 3 years ago
- [ICML 2024] PyTorch implementation for "Diversified Batch Selection for Training Acceleration"☆10Jul 30, 2024Updated last year
- Pretraining summarization models using a corpus of nonsense☆13Sep 28, 2021Updated 4 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- An implementation of online data mixing for the Pile dataset, based on the GPT-NeoX library.☆14Jan 9, 2024Updated 2 years ago
- Implementation of the paper: Selective_Backpropagation from paper Accelerating Deep Learning by Focusing on the Biggest Losers☆15Feb 2, 2020Updated 6 years ago
- ☆10May 21, 2026Updated last week
- HomebrewNLP in JAX flavour for maintable TPU-Training☆51Jan 20, 2024Updated 2 years ago
- D-Adaptation for SGD, Adam and AdaGrad☆532Jan 22, 2025Updated last year
- Debiasing Through Data Attribution☆13May 23, 2024Updated 2 years ago
- source code for ICLR'22 paper "VOS: Learning What You Don’t Know by Virtual Outlier Synthesis"☆325Oct 1, 2023Updated 2 years ago
- Data Valuation on In-Context Examples (ACL23)☆24Jan 12, 2025Updated last year
- ☆21Mar 15, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Uncertainty-Aware Curriculum Learning for Neural Machine Translation (ACL 2020)☆11Jun 12, 2020Updated 5 years ago
- Fast and simple stream processing of files in tar files, useful for deep learning, big data, and many other applications.☆136Dec 10, 2023Updated 2 years ago
- A simple and efficient baseline for data attribution☆11Nov 10, 2023Updated 2 years ago
- Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"☆28May 2, 2022Updated 4 years ago
- Official code for the paper: "Metadata Archaeology"☆19May 10, 2023Updated 3 years ago
- Implementation of Gradient Information Optimization (GIO) for effective and scalable training data selection☆14Jun 22, 2023Updated 2 years ago
- Code for "Tracing Knowledge in Language Models Back to the Training Data"☆40Dec 27, 2022Updated 3 years ago
- ☆13Aug 20, 2021Updated 4 years ago
- More interactive weak supervision with FlyingSquid☆315Sep 1, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A simple Jax implementation of influence functions.☆20Apr 9, 2024Updated 2 years ago
- Implementation of AugMix (2020) in TensorFlow☆16May 27, 2022Updated 4 years ago
- Code for NeurIPS 2020 Paper --- Continual Learning of a Mixed Sequence of Similar and Dissimilar Tasks☆21Oct 24, 2022Updated 3 years ago