[NeurIPS 2023 Spotlight] Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training
☆37Apr 7, 2025Updated last year
Alternatives and similar repositories for TempBalance
Users that are interested in TempBalance are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆19Nov 10, 2024Updated last year
- [NeurIPS 2024] AlphaPruning: Using Heavy-Tailed Self Regularization Theory for Improved Layer-wise Pruning of Large Language Models☆34Jun 9, 2025Updated last year
- [NeurIPS 2021] code for "Taxonomizing local versus global structure in neural network loss landscapes" https://arxiv.org/abs/2107.11228☆20Jan 7, 2022Updated 4 years ago
- Open source code for ICML 2025 Paper: Eigenspectrum Analysis of Neural Networks without Aspect Ratio Bias☆44Nov 14, 2025Updated 7 months ago
- Dataset and code for the paper MentalManip: A Dataset For Fine-grained Analysis of Mental Manipulation in Conversations (ACL'24).☆26May 2, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [CVPR23] "Towards Compositional Adversarial Robustness: Generalizing Adversarial Training to Composite Semantic Perturbations" by Lei Hsi…☆23Sep 17, 2025Updated 9 months ago
- This is the official code for the paper "Safety Tax: Safety Alignment Makes Your Large Reasoning Models Less Reasonable".☆34Mar 11, 2025Updated last year
- ☆17Feb 3, 2022Updated 4 years ago
- Welcome to the 'In Context Learning Theory' Reading Group☆31Nov 8, 2024Updated last year
- Source code for the paper "Positional Attention: Expressivity and Learnability of Algorithmic Computation"☆14May 26, 2025Updated last year
- Debiasing Through Data Attribution☆13May 23, 2024Updated 2 years ago
- The official repository for AdaMuon☆39Aug 27, 2025Updated 10 months ago
- Code associated with ICML (2024). "Defense against Backdoor Attack on Pre-trained Language Models via Head Pruning and Attention Normaliz…☆10Feb 22, 2026Updated 4 months ago
- ☆46Oct 1, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Two-party Privacy-preserving Neural Network Training using Split Learning and Homomorphic Encryption (CKKS Scheme)☆12Sep 23, 2025Updated 9 months ago
- [NeurIPS'24] Official implement of "PrivCirNet: Efficient Private Inference via Block Circulant Transformation"☆15Feb 26, 2026Updated 4 months ago
- Offcial Repo of Paper "Eliminating Position Bias of Language Models: A Mechanistic Approach""☆23Jun 13, 2025Updated last year
- [NeurIPS'24] "NeuralFuse: Learning to Recover the Accuracy of Access-Limited Neural Network Inference in Low-Voltage Regimes" by Hao-Lun …☆10Sep 18, 2025Updated 9 months ago
- Python 3 support for the MS COCO caption evaluation tools☆14Jun 14, 2024Updated 2 years ago
- The official implementation of the paper "Large Scale Knowledge Washing"☆10Jun 12, 2024Updated 2 years ago
- Implementation of Paper: Long-term Forecasting with TiDE: Time-series Dense Encoder☆21Nov 1, 2024Updated last year
- Implementation for the protocols described in https://eprint.iacr.org/2023/1700☆14Apr 29, 2026Updated 2 months ago
- Code release for MPCViT accepted by ICCV 2023☆16Jan 6, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- (NeurIPS 2025 D&B Track) OverLayBench: A Benchmark for Layout-to-Image Generation with Dense Overlaps☆27May 4, 2026Updated last month
- [ICLR 2025] This is the official implementation for the paper: "Large Language Models Meet Symbolic Provers for Logical Reasoning Evaluat…☆46Jun 11, 2025Updated last year
- ☆11Dec 10, 2019Updated 6 years ago
- The repository of the paper "REEF: Representation Encoding Fingerprints for Large Language Models," aims to protect the IP of open-source…☆79Jan 16, 2025Updated last year
- ☆116Jan 21, 2025Updated last year
- iDeepV: predicting RBP binding sites using vector representation learned from sequences with a CNN.☆10Jul 16, 2019Updated 6 years ago
- [ICML 2024] Junk DNA Hypothesis: A Task-Centric Angle of LLM Pre-trained Weights through Sparsity; Lu Yin*, Ajay Jaiswal*, Shiwei Liu, So…☆16Apr 21, 2025Updated last year
- ☆10Nov 6, 2024Updated last year
- Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training☆54Dec 13, 2025Updated 6 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Towards Better Graph Representation Learning with Parameterized Decomposition & Filtering☆13Aug 22, 2023Updated 2 years ago
- Concise Reasoning via Reinforcement Learning☆13Apr 16, 2025Updated last year
- Official implementation of the USENIX Security 2024 paper ModelGuard: Information-Theoretic Defense Against Model Extraction Attacks.☆25Dec 6, 2023Updated 2 years ago
- ☆18Aug 15, 2022Updated 3 years ago
- [NeurIPS 2023 Spotlight] Code for "Contrastive Lift: 3D Object Instance Segmentation by Slow-Fast Contrastive Fusion"☆73Nov 3, 2023Updated 2 years ago
- BH hackathon☆14Apr 4, 2024Updated 2 years ago
- ☆11Nov 8, 2023Updated 2 years ago