☆214Oct 10, 2022Updated 3 years ago
Alternatives and similar repositories for RHO-Loss
Users that are interested in RHO-Loss are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A fast, effective data attribution method for neural networks in PyTorch☆236Nov 18, 2024Updated last year
- Data for "Datamodels: Predicting Predictions with Training Data"☆96May 25, 2023Updated 2 years ago
- [ICML 2024] PyTorch implementation for "Diversified Batch Selection for Training Acceleration"☆10Jul 30, 2024Updated last year
- Pretraining summarization models using a corpus of nonsense☆13Sep 28, 2021Updated 4 years ago
- An implementation of online data mixing for the Pile dataset, based on the GPT-NeoX library.☆14Jan 9, 2024Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆54Jan 18, 2023Updated 3 years ago
- ☆10Oct 20, 2023Updated 2 years ago
- HomebrewNLP in JAX flavour for maintable TPU-Training☆51Jan 20, 2024Updated 2 years ago
- Implementation of the spotlight: a method for discovering systematic errors in deep learning models☆11Oct 5, 2021Updated 4 years ago
- D-Adaptation for SGD, Adam and AdaGrad☆531Jan 22, 2025Updated last year
- Debiasing Through Data Attribution☆13May 23, 2024Updated last year
- source code for ICLR'22 paper "VOS: Learning What You Don’t Know by Virtual Outlier Synthesis"☆323Oct 1, 2023Updated 2 years ago
- Data Valuation on In-Context Examples (ACL23)☆24Jan 12, 2025Updated last year
- ☆21Mar 15, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Uncertainty-Aware Curriculum Learning for Neural Machine Translation (ACL 2020)☆11Jun 12, 2020Updated 5 years ago
- A simple and efficient baseline for data attribution☆11Nov 10, 2023Updated 2 years ago
- Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"☆28May 2, 2022Updated 3 years ago
- Official code for the paper: "Metadata Archaeology"☆19May 10, 2023Updated 2 years ago
- Implementation of Gradient Information Optimization (GIO) for effective and scalable training data selection☆14Jun 22, 2023Updated 2 years ago
- Train very large language models in Jax.☆209Oct 21, 2023Updated 2 years ago
- A machine learning library capable of training various deep neural networks (RNNs, LSTMs, DBNs, ect...) on a GPU. It makes use of auto-di…☆10Aug 28, 2018Updated 7 years ago
- ☆13Aug 20, 2021Updated 4 years ago
- Code for "Tracing Knowledge in Language Models Back to the Training Data"☆39Dec 27, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- More interactive weak supervision with FlyingSquid☆315Sep 1, 2020Updated 5 years ago
- A simple Jax implementation of influence functions.☆20Apr 9, 2024Updated 2 years ago
- Code release for "Clue Me In: Semi-Supervised FGVC with Out-of-Distribution Data".☆13Apr 11, 2022Updated 4 years ago
- Code for NeurIPS 2020 Paper --- Continual Learning of a Mixed Sequence of Similar and Dissimilar Tasks☆21Oct 24, 2022Updated 3 years ago
- ☆19Feb 25, 2024Updated 2 years ago
- A Survey of Dataset Refinement for Problems in Computer Vision Datasets☆34Sep 12, 2025Updated 7 months ago
- A case study of efficient training of large language models using commodity hardware.☆68Aug 4, 2022Updated 3 years ago
- Aioli: A unified optimization framework for language model data mixing☆32Jan 17, 2025Updated last year
- DSIR large-scale data selection framework for language model training☆272Apr 7, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- AdaCat☆48Aug 4, 2022Updated 3 years ago
- This is an official repository for "LAVA: Data Valuation without Pre-Specified Learning Algorithms" (ICLR2023).☆52Jun 5, 2024Updated last year
- ☆13Feb 14, 2022Updated 4 years ago
- ☆44Oct 13, 2023Updated 2 years ago
- ☆32May 24, 2023Updated 2 years ago
- NeurIPS 2024 tutorial on LLM Inference☆49Dec 10, 2024Updated last year
- Visual Taste Approximator (VTA) is a very simple tool that helps anyone create an automatic replica of themselves that can approximate th…☆40Sep 11, 2022Updated 3 years ago