☆19Nov 10, 2024Updated last year
Alternatives and similar repositories for alphalora
Users that are interested in alphalora are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2024] AlphaPruning: Using Heavy-Tailed Self Regularization Theory for Improved Layer-wise Pruning of Large Language Models☆34Jun 9, 2025Updated 11 months ago
- This is the official code for the paper "Safety Tax: Safety Alignment Makes Your Large Reasoning Models Less Reasonable".☆33Mar 11, 2025Updated last year
- Source code for the paper "Positional Attention: Expressivity and Learnability of Algorithmic Computation"☆14May 26, 2025Updated 11 months ago
- ☆179Jul 22, 2024Updated last year
- Dataset and code for the paper MentalManip: A Dataset For Fine-grained Analysis of Mental Manipulation in Conversations (ACL'24).☆26May 2, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Debiasing Through Data Attribution☆13May 23, 2024Updated last year
- Experiments with reasoning models, training techniques, papers☆29Updated this week
- The official repository for AdaMuon☆38Aug 27, 2025Updated 8 months ago
- Source code for "SimCKP: Simple Contrastive Learning of Keyphrase Representations", Findings of EMNLP 2023☆12Jun 20, 2025Updated 11 months ago
- ☆24Apr 19, 2026Updated last month
- A basic implementation of a SAT attack on logic locking.☆13Jun 30, 2021Updated 4 years ago
- Code for NeurIPS 2022 Spotlight paper " Non-Monotonic Latent Alignments for CTC-Based Non-Autoregressive Machine Translation"☆20Nov 16, 2022Updated 3 years ago
- Code for "Exploiting reverse target-side contexts for neural machine translation via asynchronous bidirectional decoding" (Artificial Int…☆11Dec 27, 2022Updated 3 years ago
- Neural network approximators of linear algebra operations on GPU with PyTorch☆17May 30, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Kanban board made with TailwindCSS☆11Jun 10, 2021Updated 4 years ago
- [ACL2024 Findings]DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling☆17Jun 6, 2024Updated last year
- A CUDA kernel optimization toolkit for validation, benchmarking, Nsight Compute profiling, bottleneck analysis, and iterative tuning. It …☆162Apr 22, 2026Updated 3 weeks ago
- STABILIZING GRADIENTS FOR DEEP NEURAL NETWORKS VIA EFFICIENT SVD PARAMETERIZATION☆16Jun 5, 2018Updated 7 years ago
- CAKE Library for constant-bandwidth matrix multiplication on CPUs☆14Apr 6, 2024Updated 2 years ago
- This repository is the implementation of the paper Training Free Pretrained Model Merging (CVPR2024).☆34Mar 5, 2024Updated 2 years ago
- This framework implements key experiments on the sparse double descent phenomenon (ICML 2022).☆15Dec 13, 2022Updated 3 years ago
- Open source code for ICML 2025 Paper: Eigenspectrum Analysis of Neural Networks without Aspect Ratio Bias☆43Nov 14, 2025Updated 6 months ago
- Code for "Context-Aware Recurrent Encoder for Neural Machine Translation" (TASLP 2017)☆12Oct 29, 2018Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- PyTorch implementation of LAMB for ImageNet/ResNet-50 training☆13May 13, 2021Updated 5 years ago
- semi-autoregressive neural machine translation☆23Sep 9, 2018Updated 7 years ago
- ☆25Oct 20, 2022Updated 3 years ago
- Code for “An Iterative Multi-Source Mutual Knowledge Transfer Framework for Machine Reading Comprehension” (IJCAI2020)☆17Mar 18, 2021Updated 5 years ago
- A copy of the DirectX Headers from MinGW-64.☆14Sep 7, 2023Updated 2 years ago
- Code for "A Novel Graph-based Multi-modal Fusion Encoder for Neural Machine Translation"(ACL2020)☆13Sep 14, 2021Updated 4 years ago
- A Model Agnostic function to directly remove specified layers from the LLM☆10May 23, 2024Updated last year
- Codes for Merging Large Language Models☆36Aug 7, 2024Updated last year
- The official implementation for SETA (TIP 2024).☆11Feb 17, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [CVPR 2025] Scalable Video-to-Dataset Generation for Cross-Platform Mobile Agents☆32Jun 3, 2025Updated 11 months ago
- A PyTorch Implementation of Neural Turing Machine☆14Jul 24, 2020Updated 5 years ago
- ☆21Feb 13, 2023Updated 3 years ago
- First instruction-tuning dataset distilled from Claude2 (52k Alpaca prompts)!☆13Oct 22, 2023Updated 2 years ago
- Codes for reproducing the white-box adversarial attacks in “EAD: Elastic-Net Attacks to Deep Neural Networks via Adversarial Examples,” …☆21Sep 22, 2018Updated 7 years ago
- PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)☆30Dec 12, 2024Updated last year
- [COLM 2025] EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees☆31Jul 11, 2025Updated 10 months ago