☆18Nov 10, 2024Updated last year
Alternatives and similar repositories for alphalora
Users that are interested in alphalora are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2023 Spotlight] Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training☆36Apr 7, 2025Updated 11 months ago
- [NeurIPS 2024] AlphaPruning: Using Heavy-Tailed Self Regularization Theory for Improved Layer-wise Pruning of Large Language Models☆33Jun 9, 2025Updated 9 months ago
- [KDD 2023] code for "Test accuracy vs. generalization gap: model selection in NLP without accessing training or testing data" https://arx…☆12Oct 17, 2022Updated 3 years ago
- This is the official code for the paper "Safety Tax: Safety Alignment Makes Your Large Reasoning Models Less Reasonable".☆29Mar 11, 2025Updated last year
- [NeurIPS 2021] code for "Taxonomizing local versus global structure in neural network loss landscapes" https://arxiv.org/abs/2107.11228☆20Jan 7, 2022Updated 4 years ago
- Source code for the paper "Positional Attention: Expressivity and Learnability of Algorithmic Computation"☆14May 26, 2025Updated 9 months ago
- ☆177Jul 22, 2024Updated last year
- Dataset and code for the paper MentalManip: A Dataset For Fine-grained Analysis of Mental Manipulation in Conversations (ACL'24).☆25May 2, 2025Updated 10 months ago
- [ACL 2025] Official implementation of the "CoT-ICL Lab" framework☆11Oct 10, 2025Updated 5 months ago
- Debiasing Through Data Attribution☆12May 23, 2024Updated last year
- Experiments with reasoning models, training techniques, papers☆26Updated this week
- Implementation of the Paper "Entity Linking in Web Tables with Multiple Linked Knowledge Bases"☆10Oct 27, 2017Updated 8 years ago
- [ICML 2024] Code for the paper "MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-Experts"☆10Jul 1, 2024Updated last year
- [ICLR 2025] On Evluating the Durability of Safegurads for Open-Weight LLMs☆13Jun 20, 2025Updated 9 months ago
- Code for the ICLR'24 paper: MT-RANKER : Reference-free machine translation evaluation by inter-system ranking☆10Feb 29, 2024Updated 2 years ago
- [NeurIPS'24] "NeuralFuse: Learning to Recover the Accuracy of Access-Limited Neural Network Inference in Low-Voltage Regimes"☆10Sep 18, 2025Updated 6 months ago
- ☆23Feb 3, 2026Updated last month
- Use contrastive learning to train a large language model (LLM) as a retriever☆12Jul 19, 2024Updated last year
- ☆18Mar 25, 2021Updated 4 years ago
- ☆14Jan 30, 2021Updated 5 years ago
- Zero-shot entity linking with less data☆15Aug 1, 2022Updated 3 years ago
- [ICML 2024] Junk DNA Hypothesis: A Task-Centric Angle of LLM Pre-trained Weights through Sparsity; Lu Yin*, Ajay Jaiswal*, Shiwei Liu, So…☆16Apr 21, 2025Updated 11 months ago
- Concise Reasoning via Reinforcement Learning☆13Apr 16, 2025Updated 11 months ago
- Code for NeurIPS 2022 Spotlight paper " Non-Monotonic Latent Alignments for CTC-Based Non-Autoregressive Machine Translation"☆20Nov 16, 2022Updated 3 years ago
- Code for "Domain Adaptive Meta-learning for Dialogue State Tracking"(TASLP2021)☆10Sep 14, 2021Updated 4 years ago
- ☆12Feb 26, 2020Updated 6 years ago
- Neural network approximators of linear algebra operations on GPU with PyTorch☆17May 30, 2022Updated 3 years ago
- 赛题的解题思路描述和项目源代码☆16Jan 31, 2024Updated 2 years ago
- Independent implementation of DBCA method from http://arxiv.org/abs/1912.09713☆11Nov 25, 2020Updated 5 years ago
- STABILIZING GRADIENTS FOR DEEP NEURAL NETWORKS VIA EFFICIENT SVD PARAMETERIZATION☆16Jun 5, 2018Updated 7 years ago
- Creates CMM script that can directly executed on Kaggle from easy merge script☆14Mar 6, 2026Updated 2 weeks ago
- This framework implements key experiments on the sparse double descent phenomenon (ICML 2022).☆15Dec 13, 2022Updated 3 years ago
- Recording and processing Tobii eyeX and 4C with the standard SDK☆14Apr 12, 2018Updated 7 years ago
- Code for "Context-Aware Recurrent Encoder for Neural Machine Translation" (TASLP 2017)☆12Oct 29, 2018Updated 7 years ago
- PyTorch implementation of LAMB for ImageNet/ResNet-50 training☆13May 13, 2021Updated 4 years ago
- Code for "Variational Neural Discourse Relation Recognizer" (EMNLP 2016)☆16Dec 29, 2017Updated 8 years ago
- [ICLR24] AutoVP: An Automated Visual Prompting Framework and Benchmark☆22Sep 18, 2025Updated 6 months ago
- ☆25Oct 20, 2022Updated 3 years ago
- Code and data of the EMNLP 2022 Main Conference paper "Reduce Catastrophic Forgetting of Dense Retrieval Training with Teleportation Nega…☆18Mar 25, 2024Updated last year