google-parfait / dataset_grouperLinks
Libraries for efficient and scalable group-structured dataset pipelines.
☆26Updated 3 weeks ago
Alternatives and similar repositories for dataset_grouper
Users that are interested in dataset_grouper are comparing it to the libraries listed below
Sorting:
- Recycling diverse models☆45Updated 2 years ago
- Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)☆80Updated last year
- Latest Weight Averaging (NeurIPS HITY 2022)☆30Updated 2 years ago
- A simple Jax implementation of influence functions.☆16Updated last year
- ☆37Updated 3 years ago
- Repo to reproduce results for Where to Begin? On the Impact of Pre-Training and Initialization in Federated Learning☆25Updated 2 years ago
- ☆23Updated 2 years ago
- ModelDiff: A Framework for Comparing Learning Algorithms☆59Updated last year
- One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation☆40Updated 9 months ago
- Privacy backdoors☆51Updated last year
- Data for "Datamodels: Predicting Predictions with Training Data"☆97Updated 2 years ago
- Federated posterior averaging implemented in JAX☆51Updated 2 years ago
- Model Fusion via Optimal Transport, NeurIPS 2020☆148Updated 2 years ago
- ☆22Updated 2 years ago
- Official PyTorch Implementation for Meaning Representations from Trajectories in Autoregressive Models (ICLR 2024)☆21Updated last year
- Private Adaptive Optimization with Side Information (ICML '22)☆16Updated 3 years ago
- Repo for the paper: PerAda: Parameter-Efficient Federated Learning Personalization with Generalization Guarantees (CVPR 2024)☆19Updated 11 months ago
- JORA: JAX Tensor-Parallel LoRA Library (ACL 2024)☆34Updated last year
- A modern look at the relationship between sharpness and generalization [ICML 2023]☆43Updated last year
- DP-FTRL from "Practical and Private (Deep) Learning without Sampling or Shuffling" for centralized training.☆29Updated last month
- Can GPT-4 Perform Neural Architecture Search?☆87Updated last year
- The repository contains code for Adaptive Data Optimization☆25Updated 7 months ago
- ☆30Updated last year
- This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…☆37Updated 2 years ago
- ☆31Updated last year
- ☆20Updated 2 years ago
- AutoMoE: Neural Architecture Search for Efficient Sparsely Activated Transformers☆47Updated 2 years ago
- ☆55Updated 11 months ago
- ☆27Updated 5 months ago
- [ICML 2024] Junk DNA Hypothesis: A Task-Centric Angle of LLM Pre-trained Weights through Sparsity; Lu Yin*, Ajay Jaiswal*, Shiwei Liu, So…☆16Updated 2 months ago