☆19Nov 10, 2024Updated last year
Alternatives and similar repositories for alphalora
Users that are interested in alphalora are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2024] AlphaPruning: Using Heavy-Tailed Self Regularization Theory for Improved Layer-wise Pruning of Large Language Models☆34Jun 9, 2025Updated last year
- [NeurIPS 2021] code for "Taxonomizing local versus global structure in neural network loss landscapes" https://arxiv.org/abs/2107.11228☆20Jan 7, 2022Updated 4 years ago
- Source code for the paper "Positional Attention: Expressivity and Learnability of Algorithmic Computation"☆14May 26, 2025Updated last year
- ☆179Jul 22, 2024Updated last year
- Dataset and code for the paper MentalManip: A Dataset For Fine-grained Analysis of Mental Manipulation in Conversations (ACL'24).☆26May 2, 2025Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Debiasing Through Data Attribution☆13May 23, 2024Updated 2 years ago
- Experiments with reasoning models, training techniques, papers☆30Jun 2, 2026Updated last week
- [ICML 2024] Code for the paper "MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-Experts"☆10Jul 1, 2024Updated last year
- [ICLR 2025] On Evluating the Durability of Safegurads for Open-Weight LLMs☆13Jun 20, 2025Updated 11 months ago
- ☆11Apr 24, 2018Updated 8 years ago
- Code for the ICLR'24 paper: MT-RANKER : Reference-free machine translation evaluation by inter-system ranking☆10Feb 29, 2024Updated 2 years ago
- Answering Ambiguous Questions via Iterative Prompting☆14May 25, 2024Updated 2 years ago
- ☆13Jan 30, 2021Updated 5 years ago
- ☆19Mar 28, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Zero-shot entity linking with less data☆15Aug 1, 2022Updated 3 years ago
- Code for NeurIPS 2022 Spotlight paper " Non-Monotonic Latent Alignments for CTC-Based Non-Autoregressive Machine Translation"☆20Nov 16, 2022Updated 3 years ago
- ☆12Feb 26, 2020Updated 6 years ago
- (CVPR 2023) TokenHPE: Learning Orientation Tokens for Efficient Head Pose Estimation via Transformers☆12Oct 29, 2023Updated 2 years ago
- Kanban board made with TailwindCSS☆11Jun 10, 2021Updated 5 years ago
- Independent implementation of DBCA method from http://arxiv.org/abs/1912.09713☆11Nov 25, 2020Updated 5 years ago
- ☆13Jun 6, 2022Updated 4 years ago
- This repository is the implementation of the paper Training Free Pretrained Model Merging (CVPR2024).☆34Mar 5, 2024Updated 2 years ago
- This framework implements key experiments on the sparse double descent phenomenon (ICML 2022).☆15Dec 13, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Code for "Context-Aware Recurrent Encoder for Neural Machine Translation" (TASLP 2017)☆12Oct 29, 2018Updated 7 years ago
- PyTorch implementation of LAMB for ImageNet/ResNet-50 training☆13May 13, 2021Updated 5 years ago
- This repository collects various works that reproduce DeepSeek R1, as well as works related to DeepSeek R1 and the DeepSeek series.☆19Apr 27, 2025Updated last year
- Code for "Variational Neural Discourse Relation Recognizer" (EMNLP 2016)☆16Dec 29, 2017Updated 8 years ago
- semi-autoregressive neural machine translation☆23Sep 9, 2018Updated 7 years ago
- The code implementation of "M2DF: Multi-grained Multi-curriculum Denoising Framework for Multimodal Aspect-based Sentiment Analysis"☆17Dec 8, 2023Updated 2 years ago
- ☆18Jan 17, 2024Updated 2 years ago
- [ICLR24] "AutoVP: An Automated Visual Prompting Framework and Benchmark" by Hsi-Ai Tsao*, Lei Hsiung*, Pin-Yu Chen, Sijia Liu, and Tsung-…☆23Sep 18, 2025Updated 8 months ago
- Code and data of the EMNLP 2022 Main Conference paper "Reduce Catastrophic Forgetting of Dense Retrieval Training with Teleportation Nega…☆18Mar 25, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A copy of the DirectX Headers from MinGW-64.☆14Sep 7, 2023Updated 2 years ago
- Code for "Multi-Modal Neural Machine Translation with Deep Semantic Interactions" (Information Sciences)☆16May 21, 2021Updated 5 years ago
- Vite + Mantine + Vanilla extract template☆12Updated this week
- Code for "An AST Structure Enhanced Decoder for Code Generation"☆15Oct 14, 2021Updated 4 years ago
- Code for "A Novel Graph-based Multi-modal Fusion Encoder for Neural Machine Translation"(ACL2020)☆13Sep 14, 2021Updated 4 years ago
- Codes for Merging Large Language Models☆37Aug 7, 2024Updated last year
- The official implementation for SETA (TIP 2024).☆11Feb 17, 2025Updated last year