[KDD 2023] code for "Test accuracy vs. generalization gap: model selection in NLP without accessing training or testing data" https://arxiv.org/pdf/2202.02842.pdf
☆12Oct 17, 2022Updated 3 years ago
Alternatives and similar repositories for Generalization_metrics_for_NLP
Users that are interested in Generalization_metrics_for_NLP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2023 Spotlight] Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training☆36Apr 7, 2025Updated last year
- [NeurIPS 2021] code for "Taxonomizing local versus global structure in neural network loss landscapes" https://arxiv.org/abs/2107.11228☆20Jan 7, 2022Updated 4 years ago
- [NeurIPS 2024] AlphaPruning: Using Heavy-Tailed Self Regularization Theory for Improved Layer-wise Pruning of Large Language Models☆34Jun 9, 2025Updated 11 months ago
- Open source code for ICML 2025 Paper: Eigenspectrum Analysis of Neural Networks without Aspect Ratio Bias☆43Nov 14, 2025Updated 6 months ago
- ☆19Nov 10, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Localizing Memorized Sequences in Language Models☆22Oct 15, 2025Updated 7 months ago
- ☆18Mar 25, 2021Updated 5 years ago
- ☆12Aug 21, 2020Updated 5 years ago
- Welcome to the 'In Context Learning Theory' Reading Group☆31Nov 8, 2024Updated last year
- Source code for the paper "Positional Attention: Expressivity and Learnability of Algorithmic Computation"☆14May 26, 2025Updated 11 months ago
- [ACL 2025] Official implementation of the "CoT-ICL Lab" framework☆11May 1, 2026Updated 3 weeks ago
- 南昌大学研究生学位论文LaTex模板☆11Jan 17, 2022Updated 4 years ago
- The official repository for AdaMuon☆39Aug 27, 2025Updated 8 months ago
- Gated Pretrained Transformer model for robust denoised sequence-to-sequence modelling☆10May 29, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- H3M-SSMoEs: Hypergraph-based Multimodal Learning with LLM Reasoning and Style-Structured Mixture of Experts☆29Feb 20, 2026Updated 3 months ago
- Code for Characterizing Scaling and Transfer Learning Behavior of FNO in SciML☆54May 31, 2023Updated 2 years ago
- V-Mapper -☆10Aug 6, 2023Updated 2 years ago
- Machine Learning from Human Preferences☆33Mar 23, 2026Updated 2 months ago
- Implementation and explorations into Blackbox Gradient Sensing (BGS), an evolutionary strategies approach proposed in a Google Deepmind p…☆20Apr 17, 2026Updated last month
- The official implementation of the paper "Large Scale Knowledge Washing"☆10Jun 12, 2024Updated last year
- PyTorch Implementation of GPT-2☆33Sep 4, 2024Updated last year
- Implementation of Paper: Long-term Forecasting with TiDE: Time-series Dense Encoder☆21Nov 1, 2024Updated last year
- (NeurIPS 2025 D&B Track) OverLayBench: A Benchmark for Layout-to-Image Generation with Dense Overlaps☆26May 4, 2026Updated 3 weeks ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- rest2vec: Vectorizing the resting-state functional connectome using graph embedding☆10Jul 6, 2023Updated 2 years ago
- Acquire features from a 3D object using a ray-cast approach.☆21Mar 31, 2025Updated last year
- Course repository for the Spring COMP790 course "Deep Learning" at UNC☆23Feb 2, 2022Updated 4 years ago
- fast trainer for educational purposes☆26May 4, 2026Updated 3 weeks ago
- Scaling Sparse Fine-Tuning to Large Language Models☆19Jan 31, 2024Updated 2 years ago
- ☆12Mar 4, 2024Updated 2 years ago
- Landscaper is a comprehensive Python framework designed for exploring the loss landscapes of deep learning models.☆43Jan 27, 2026Updated 3 months ago
- Concise Reasoning via Reinforcement Learning☆13Apr 16, 2025Updated last year
- Matlab Notebook for visualizing random matrix theory results and their applications to machine learning☆138May 14, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Tools for generating and comparing Decorated Merge Trees, enriched persistence-based topological data descriptors.☆17Aug 27, 2022Updated 3 years ago
- Mapper Interactive is a customizable visualization framework for the analysis and visualization of high-dimensional point cloud data usin…☆26Mar 31, 2026Updated last month
- Provide implementations and pre-trained models of MobileNet-v1, v2, and v3☆16Dec 11, 2020Updated 5 years ago
- Neural network approximators of linear algebra operations on GPU with PyTorch☆17May 30, 2022Updated 3 years ago
- A Wasserstein Subsequence Kernel for Time Series.☆21Jun 17, 2024Updated last year
- ☆18Jan 17, 2024Updated 2 years ago
- 🔋 Utilities for scientific python☆19Oct 16, 2025Updated 7 months ago