[ICLR 2025] CAMEx: Curvature-Aware Merging of Experts
☆24Mar 1, 2025Updated last year
Alternatives and similar repositories for CAMEx
Users that are interested in CAMEx are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for "A Framework for Controllable Pareto Front Learning with Completed Scalarization Functions and its Applications"☆16Aug 11, 2024Updated last year
- Build a Recurrent Neural Network solving Optimization Problems☆10Nov 17, 2021Updated 4 years ago
- Inverse Discriminative Networks for Handwritten Signature Verification☆56Sep 29, 2022Updated 3 years ago
- Code for the paper "Rethinking Importance Weighting for Deep Learning under Distribution Shift".☆32Apr 9, 2021Updated 5 years ago
- Using Spectral Temporal Graph Neural Network model for the major assignment of Time Series Analysis and Forecasting course, with multivar…☆10Mar 27, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Giải tích số: Các phương pháp xấp xỉ nghiệm: chia đôi, lặp đơn, đa thức đặc trưng, newton, Cholesky_LU, Danilevski, Gauss, Gauss-Jordan, …☆11Jul 21, 2023Updated 2 years ago
- Recommendation system with actor and critic☆18Aug 10, 2022Updated 3 years ago
- Public code release for the paper "Reawakening knowledge: Anticipatory recovery from catastrophic interference via structured training"☆11Oct 27, 2025Updated 6 months ago
- code release for "Unrolling SGD: Understanding Factors Influencing Machine Unlearning" published at EuroS&P'22☆25Mar 13, 2022Updated 4 years ago
- Code for CVPR22 paper "Deep Unlearning via Randomized Conditionally Independent Hessians"☆25Jul 9, 2022Updated 3 years ago
- MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation☆17Sep 2, 2024Updated last year
- [ICLR 2024] Official implementation of Bellman Optimal Stepsize Straightening of Flow-Matching Models☆37Feb 25, 2024Updated 2 years ago
- ☆16Apr 30, 2026Updated 3 weeks ago
- Adaptive gradient descent without descent☆53Oct 12, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- PyTorch implementation of LIMoE☆52Apr 1, 2024Updated 2 years ago
- ☆16Apr 1, 2023Updated 3 years ago
- MNIST experiment from Tensorizing neural networks (Novikov et al. 2015)☆14Oct 22, 2019Updated 6 years ago
- CLUE: A Clinical Language Understanding Evaluation for LLMs☆21Jan 22, 2025Updated last year
- u-MPS implementation and experimentation code used in the paper Tensor Networks for Probabilistic Sequence Modeling (https://arxiv.org/ab…☆19Jul 2, 2020Updated 5 years ago
- ☆22Oct 14, 2021Updated 4 years ago
- End-to-end training of Retrieval-Augmented LMs (REALM, RAG)☆23Nov 22, 2023Updated 2 years ago
- [ICML 2024] SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models☆22May 28, 2024Updated last year
- The code for paper: Hierarchical Document Refinement for Long-context Retrieval-augmented Generation [ACL2025 Oral]☆46Aug 25, 2025Updated 8 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A collection of research on specialized medical LLMs for specific diseases and distinct medical specialties, organized by ICD-10 chapters…☆51Oct 10, 2025Updated 7 months ago
- Code for the paper "Tensor Networks for Maching Learning"☆17Nov 7, 2019Updated 6 years ago
- pytorch implementation of Structured Bayesian Pruning☆19Jul 13, 2018Updated 7 years ago
- Metamodeling, sensitivity analysis and visualization using the tensor train format☆21Sep 8, 2022Updated 3 years ago
- Code and resources for the Lorenz et al. (2021) QNLP paper☆29Jul 20, 2023Updated 2 years ago
- Ongoing research training transformer models at scale☆40Updated this week
- Jax implementation of VIT-VQGAN☆10Jan 25, 2024Updated 2 years ago
- ☆60Jun 17, 2020Updated 5 years ago
- GPU methods for alpha matting, including cutting edge research algorithms by Philip G. Lee.☆12Jan 8, 2014Updated 12 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The latest research progress of Contrastive Learning(CL), Data Augmentation(DA) and Self-Supervised Learning(SSL) in Recommender Systems☆433Sep 2, 2025Updated 8 months ago
- Code for "Variational Reasoning for Language Models"☆60Sep 29, 2025Updated 7 months ago
- [EMNLP 2024] CompAct: Compressing Retrieved Documents Actively for Question Answering☆38Sep 20, 2024Updated last year
- Approximate the product between infinite functional objects on a manifold -- i.e. belief products☆12Updated this week
- Certified Removal from Machine Learning Models☆69Aug 23, 2021Updated 4 years ago
- This repository contains the official code for "Flexible Biometrics Recognition: Bridging the Multimodality Gap through Attention, Alignm…☆11Oct 9, 2024Updated last year
- Integration examples and utilities for VOT toolkit☆10Feb 18, 2026Updated 3 months ago