[KDD 2023] code for "Test accuracy vs. generalization gap: model selection in NLP without accessing training or testing data" https://arxiv.org/pdf/2202.02842.pdf
☆12Oct 17, 2022Updated 3 years ago
Alternatives and similar repositories for Generalization_metrics_for_NLP
Users that are interested in Generalization_metrics_for_NLP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2023 Spotlight] Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training☆36Apr 7, 2025Updated 11 months ago
- [NeurIPS 2021] code for "Taxonomizing local versus global structure in neural network loss landscapes" https://arxiv.org/abs/2107.11228☆20Jan 7, 2022Updated 4 years ago
- [NeurIPS 2024] AlphaPruning: Using Heavy-Tailed Self Regularization Theory for Improved Layer-wise Pruning of Large Language Models☆33Jun 9, 2025Updated 9 months ago
- Open source code for ICML 2025 Paper: Eigenspectrum Analysis of Neural Networks without Aspect Ratio Bias☆43Nov 14, 2025Updated 4 months ago
- Localizing Memorized Sequences in Language Models☆21Oct 15, 2025Updated 5 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆18Nov 10, 2024Updated last year
- ☆18Mar 25, 2021Updated 5 years ago
- ☆12Aug 21, 2020Updated 5 years ago
- Welcome to the 'In Context Learning Theory' Reading Group☆30Nov 8, 2024Updated last year
- Source code for the paper "Positional Attention: Expressivity and Learnability of Algorithmic Computation"☆14May 26, 2025Updated 10 months ago
- [ACL 2025] Official implementation of the "CoT-ICL Lab" framework☆11Oct 10, 2025Updated 5 months ago
- 南昌大学研究生学位论文LaTex模板☆11Jan 17, 2022Updated 4 years ago
- The official repository for AdaMuon☆35Aug 27, 2025Updated 6 months ago
- Gated Pretrained Transformer model for robust denoised sequence-to-sequence modelling☆10May 29, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Machine Learning from Human Preferences☆30Feb 13, 2026Updated last month
- H3M-SSMoEs: Hypergraph-based Multimodal Learning with LLM Reasoning and Style-Structured Mixture of Experts☆26Feb 20, 2026Updated last month
- Code for Characterizing Scaling and Transfer Learning Behavior of FNO in SciML☆53May 31, 2023Updated 2 years ago
- V-Mapper -☆10Aug 6, 2023Updated 2 years ago
- PyTorch Implementation of GPT-2☆32Sep 4, 2024Updated last year
- The official implementation of the paper "Large Scale Knowledge Washing"☆10Jun 12, 2024Updated last year
- Implementation of Paper: Long-term Forecasting with TiDE: Time-series Dense Encoder☆21Nov 1, 2024Updated last year
- (NeurIPS 2025 D&B Track) OverLayBench: A Benchmark for Layout-to-Image Generation with Dense Overlaps☆25Jan 22, 2026Updated 2 months ago
- Course repository for the Spring COMP790 course "Deep Learning" at UNC☆23Feb 2, 2022Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Acquire features from a 3D object using a ray-cast approach.☆20Mar 31, 2025Updated 11 months ago
- rest2vec: Vectorizing the resting-state functional connectome using graph embedding☆10Jul 6, 2023Updated 2 years ago
- fast trainer for educational purposes☆24Mar 12, 2026Updated 2 weeks ago
- Scaling Sparse Fine-Tuning to Large Language Models☆18Jan 31, 2024Updated 2 years ago
- ☆12Mar 4, 2024Updated 2 years ago
- Landscaper is a comprehensive Python framework designed for exploring the loss landscapes of deep learning models.☆41Jan 27, 2026Updated 2 months ago
- Concise Reasoning via Reinforcement Learning☆13Apr 16, 2025Updated 11 months ago
- Matlab Notebook for visualizing random matrix theory results and their applications to machine learning☆136May 14, 2023Updated 2 years ago
- Tools for generating and comparing Decorated Merge Trees, enriched persistence-based topological data descriptors.☆17Aug 27, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Mapper Interactive is a customizable visualization framework for the analysis and visualization of high-dimensional point cloud data usin…☆26Jul 1, 2023Updated 2 years ago
- Provide implementations and pre-trained models of MobileNet-v1, v2, and v3☆16Dec 11, 2020Updated 5 years ago
- Neural network approximators of linear algebra operations on GPU with PyTorch☆17May 30, 2022Updated 3 years ago
- A Wasserstein Subsequence Kernel for Time Series.☆21Jun 17, 2024Updated last year
- ☆18Jan 17, 2024Updated 2 years ago
- 🔋 Utilities for scientific python☆19Oct 16, 2025Updated 5 months ago
- Implementation of Poincare Embedding in PyTorch☆13Jul 27, 2017Updated 8 years ago