☆213Oct 10, 2022Updated 3 years ago
Alternatives and similar repositories for RHO-Loss
Users that are interested in RHO-Loss are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models (ICLR 2022)☆29Feb 9, 2022Updated 4 years ago
- A fast, effective data attribution method for neural networks in PyTorch☆233Nov 18, 2024Updated last year
- Data for "Datamodels: Predicting Predictions with Training Data"☆96May 25, 2023Updated 2 years ago
- Pretraining summarization models using a corpus of nonsense☆13Sep 28, 2021Updated 4 years ago
- An implementation of online data mixing for the Pile dataset, based on the GPT-NeoX library.☆13Jan 9, 2024Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆54Jan 18, 2023Updated 3 years ago
- Implementation of the paper: Selective_Backpropagation from paper Accelerating Deep Learning by Focusing on the Biggest Losers☆15Feb 2, 2020Updated 6 years ago
- ☆10Oct 20, 2023Updated 2 years ago
- HomebrewNLP in JAX flavour for maintable TPU-Training☆51Jan 20, 2024Updated 2 years ago
- Implementation of the spotlight: a method for discovering systematic errors in deep learning models☆11Oct 5, 2021Updated 4 years ago
- D-Adaptation for SGD, Adam and AdaGrad☆530Jan 22, 2025Updated last year
- Debiasing Through Data Attribution☆12May 23, 2024Updated last year
- Data Valuation on In-Context Examples (ACL23)☆24Jan 12, 2025Updated last year
- ☆21Mar 15, 2023Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Uncertainty-Aware Curriculum Learning for Neural Machine Translation (ACL 2020)☆11Jun 12, 2020Updated 5 years ago
- Fast and simple stream processing of files in tar files, useful for deep learning, big data, and many other applications.☆135Dec 10, 2023Updated 2 years ago
- Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"☆28May 2, 2022Updated 3 years ago
- Official code for the paper: "Metadata Archaeology"☆19May 10, 2023Updated 2 years ago
- Implementation of Gradient Information Optimization (GIO) for effective and scalable training data selection☆14Jun 22, 2023Updated 2 years ago
- Train very large language models in Jax.☆210Oct 21, 2023Updated 2 years ago
- A machine learning library capable of training various deep neural networks (RNNs, LSTMs, DBNs, ect...) on a GPU. It makes use of auto-di…☆10Aug 28, 2018Updated 7 years ago
- ☆13Aug 20, 2021Updated 4 years ago
- Code for "Tracing Knowledge in Language Models Back to the Training Data"☆39Dec 27, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Masked Siamese Networks for Label-Efficient Learning (https://arxiv.org/abs/2204.07141)☆464May 9, 2022Updated 3 years ago
- More interactive weak supervision with FlyingSquid☆316Sep 1, 2020Updated 5 years ago
- A simple Jax implementation of influence functions.☆20Apr 9, 2024Updated last year
- Implementation of AugMix (2020) in TensorFlow☆16May 27, 2022Updated 3 years ago
- Code release for "Clue Me In: Semi-Supervised FGVC with Out-of-Distribution Data".☆13Apr 11, 2022Updated 3 years ago
- Code for NeurIPS 2020 Paper --- Continual Learning of a Mixed Sequence of Similar and Dissimilar Tasks☆21Oct 24, 2022Updated 3 years ago
- ☆17May 19, 2023Updated 2 years ago
- ☆19Feb 25, 2024Updated 2 years ago
- A Survey of Dataset Refinement for Problems in Computer Vision Datasets☆34Sep 12, 2025Updated 6 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A case study of efficient training of large language models using commodity hardware.☆68Aug 4, 2022Updated 3 years ago
- Aioli: A unified optimization framework for language model data mixing☆32Jan 17, 2025Updated last year
- atmaCup #11 の Public 4th / Private 5th Solution のリポジトリです。☆12Aug 3, 2021Updated 4 years ago
- This is a pytorch version for Non-local Neural Networks(onging)☆27May 18, 2019Updated 6 years ago
- DSIR large-scale data selection framework for language model training☆271Apr 7, 2024Updated last year
- AdaCat☆48Aug 4, 2022Updated 3 years ago
- ☆13Feb 14, 2022Updated 4 years ago