NVIDIA / Diversity-SamplingLinks
GPU-accelerated algorithm for subsampling datasets while preserving diversity
☆21Updated last year
Alternatives and similar repositories for Diversity-Sampling
Users that are interested in Diversity-Sampling are comparing it to the libraries listed below
Sorting:
- ☆14Updated last month
- Official implementation of "GPT or BERT: why not both?"☆53Updated 2 weeks ago
- Solution of Kaggle competition: Feedback Prize - Evaluating Student Writing☆16Updated 3 years ago
- ☆74Updated 2 years ago
- 2nd Place Solution - Kaggle Challenge: Learning Equality - Curriculum Recommendations☆13Updated 2 years ago
- some common Huggingface transformers in maximal update parametrization (µP)☆81Updated 3 years ago
- ☆21Updated 3 years ago
- ☆17Updated 2 years ago
- Fast, Modern, and Low Precision PyTorch Optimizers☆94Updated this week
- ☆13Updated 3 years ago
- Why Do We Need Weight Decay in Modern Deep Learning? [NeurIPS 2024]☆66Updated 9 months ago
- Various transformers for FSDP research☆37Updated 2 years ago
- VS Code Extension for Kaggle☆17Updated 6 months ago
- Efficient Transformers with Dynamic Token Pooling☆61Updated 2 years ago
- Randomized Positional Encodings Boost Length Generalization of Transformers☆81Updated last year
- A fast implementation of T5/UL2 in PyTorch using Flash Attention☆105Updated 3 months ago
- HomebrewNLP in JAX flavour for maintable TPU-Training☆50Updated last year
- Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorch☆39Updated 3 years ago
- Implementation of Bitune: Bidirectional Instruction-Tuning☆19Updated last week
- A case study of efficient training of large language models using commodity hardware.☆69Updated 2 years ago
- ☆81Updated last year
- Implementation of Mixout with PyTorch☆75Updated 2 years ago
- Official code release for "SuperBPE: Space Travel for Language Models"☆54Updated 2 weeks ago
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆74Updated 7 months ago
- Early solution for Google AI4Code competition☆76Updated 3 years ago
- A fusion of a linear layer and a cross entropy loss, written for pytorch in triton.☆68Updated 10 months ago
- ☆53Updated 8 months ago
- [EMNLP'23] Official Code for "FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models"☆32Updated 2 weeks ago
- ☆37Updated last year
- Embedding Recycling for Language models☆38Updated last year