☆12Feb 11, 2026Updated 3 weeks ago
Alternatives and similar repositories for realistic_evaluation_of_model_merging_for_compositional_generalization
Users that are interested in realistic_evaluation_of_model_merging_for_compositional_generalization are comparing it to the libraries listed below
Sorting:
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆31Jan 31, 2023Updated 3 years ago
- This tool uses Ethereum's peer-discovery protocol to measure the size of the Ethereum network (testnets included)☆16Dec 15, 2022Updated 3 years ago
- ☆19Jul 31, 2025Updated 7 months ago
- Codes for Merging Large Language Models☆35Aug 7, 2024Updated last year
- official code repo for paper "Merging Models on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging"☆23Oct 11, 2025Updated 4 months ago
- ☆18Aug 19, 2024Updated last year
- Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…☆22Nov 8, 2023Updated 2 years ago
- Code for the paper "Mehta, S. V., Patil, D., Chandar, S., & Strubell, E. (2023). An Empirical Investigation of the Role of Pre-training i…☆17Mar 18, 2024Updated last year
- [NeurIPS'24] Official PyTorch implementation for paper "Knowledge Composition using Task Vectors with Learned Anisotropic Scaling"☆27Feb 24, 2025Updated last year
- ☆28May 4, 2023Updated 2 years ago
- Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"☆32Nov 4, 2024Updated last year
- Losslessly encode text natively with arithmetic coding and HuggingFace Transformers☆77Oct 28, 2025Updated 4 months ago
- You should use PySR to find scaling laws. Here's an example.☆33Sep 30, 2023Updated 2 years ago
- Context is Key: A Benchmark for Forecasting with Essential Textual Information☆87Feb 11, 2026Updated 3 weeks ago
- A Data-Driven Approach to Predict the Success of Bank Telemarketing☆10Apr 27, 2021Updated 4 years ago
- [CVPR 2020] A generative model with latent factors that are independent and localized.☆12Mar 27, 2025Updated 11 months ago
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆144Sep 10, 2023Updated 2 years ago
- Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data☆45Feb 18, 2025Updated last year
- Patching open-vocabulary models by interpolating weights☆91Sep 28, 2023Updated 2 years ago
- Active Inference & Category Theory☆10Mar 11, 2024Updated last year
- Tools and models for estimating Filecoin energy use from on-chain proofs☆11Jun 14, 2024Updated last year
- ☆10Jul 16, 2023Updated 2 years ago
- Internet never forgots and now thought police never fails☆14Jul 27, 2025Updated 7 months ago
- A collection of demos and utilities prepared ahead of the Vector Institute Privacy Enhancing Techniques (PETs) Bootcamp.☆15Sep 22, 2022Updated 3 years ago
- ☆12Jul 4, 2024Updated last year
- Guide to interviewing for industry machine learning roles (data/applied/research scientist, ML engineer, etc).☆11Dec 28, 2022Updated 3 years ago
- [ICDCS 2023] Evaluation and Optimization of Gradient Compression for Distributed Deep Learning☆10Apr 28, 2023Updated 2 years ago
- Code used in the analyses described in "Personalized brain circuit scores identify clinically distinct biotypes in depression and anxiety…☆11May 4, 2024Updated last year
- ☆47Nov 8, 2024Updated last year
- FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion☆205Feb 6, 2026Updated 3 weeks ago
- Official Repository for Dataset Inference for LLMs☆42Jul 25, 2024Updated last year
- Official Pytorch repo of CVPR'23 and NeurIPS'23 papers on understanding replication in diffusion models.☆113Nov 22, 2023Updated 2 years ago
- ☆13Feb 18, 2026Updated 2 weeks ago
- The official implementation of Hard Negative Sampling via Large Language Models for Recommendation.☆11Jan 17, 2026Updated last month
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆45Oct 1, 2025Updated 5 months ago
- Code for "What really matters in matrix-whitening optimizers?"☆22Oct 31, 2025Updated 4 months ago
- Synthetic graph generator☆12Nov 7, 2023Updated 2 years ago
- Reference implementation of models from Nyonic Model Factory☆12May 13, 2024Updated last year
- The Structure and Interpretation of Deep Networks Handbook☆14Dec 14, 2024Updated last year