☆12Feb 11, 2026Updated 2 months ago
Alternatives and similar repositories for realistic_evaluation_of_model_merging_for_compositional_generalization
Users that are interested in realistic_evaluation_of_model_merging_for_compositional_generalization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆31Jan 31, 2023Updated 3 years ago
- Codes for Merging Large Language Models☆36Aug 7, 2024Updated last year
- ☆19Jul 31, 2025Updated 9 months ago
- You should use PySR to find scaling laws. Here's an example.☆33Sep 30, 2023Updated 2 years ago
- This tool uses Ethereum's peer-discovery protocol to measure the size of the Ethereum network (testnets included)☆15Dec 15, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆18Aug 19, 2024Updated last year
- ☆14Oct 7, 2024Updated last year
- Official Repository for "LLMs as Visual Explainers: Advancing Image Classification with Evolving Visual Descriptions"☆15Apr 20, 2025Updated last year
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Models☆15Mar 8, 2023Updated 3 years ago
- official code repo for paper "Merging Models on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging"☆25Oct 11, 2025Updated 6 months ago
- Code for the paper "Mehta, S. V., Patil, D., Chandar, S., & Strubell, E. (2023). An Empirical Investigation of the Role of Pre-training i…☆17Mar 18, 2024Updated 2 years ago
- Course website static sources.☆11Dec 7, 2022Updated 3 years ago
- Repository for "Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators"☆12Mar 25, 2025Updated last year
- Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…☆23Nov 8, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ECIR'21: Simplified TinyBERT: Knowledge Distillation for Document Retrieval☆17Apr 25, 2021Updated 5 years ago
- Code and Dataset release of "Carpe Diem: On the Evaluation of World Knowledge in Lifelong Language Models" (NAACL 2024)☆10Oct 16, 2024Updated last year
- ☆12Jan 1, 2024Updated 2 years ago
- [NeurIPS'24] Official PyTorch implementation for paper "Knowledge Composition using Task Vectors with Learned Anisotropic Scaling"☆28Feb 24, 2025Updated last year
- Official implementation for "MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models"☆20Oct 26, 2024Updated last year
- Ready to go, downloadable models for Keras☆11Oct 28, 2019Updated 6 years ago
- ☆14Jan 3, 2025Updated last year
- Code for our paper titled "Lens: Rethinking Multilingual Enhancement for Large Language Models"☆11Oct 15, 2024Updated last year
- Losslessly encode text natively with arithmetic coding and HuggingFace Transformers☆79Oct 28, 2025Updated 6 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- An implementation of torchngp + semantic-nerf☆13Sep 10, 2023Updated 2 years ago
- Github for "Reduced, Reused and Recycled" (NeurIPS 2021 Best Paper, D&B Track)☆17Jan 8, 2022Updated 4 years ago
- Code for using TOTEM on EEG data☆15Sep 24, 2025Updated 7 months ago
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆150Sep 10, 2023Updated 2 years ago
- ☆15Apr 2, 2024Updated 2 years ago
- Representation Surgery for Multi-Task Model Merging. ICML, 2024.☆48Oct 10, 2024Updated last year
- Patching open-vocabulary models by interpolating weights☆91Sep 28, 2023Updated 2 years ago
- FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion☆216Apr 27, 2026Updated last week
- ☆12May 28, 2025Updated 11 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆16Nov 11, 2024Updated last year
- [NeurIPS 2025] Reasoning Models Better Express Their Confidence"☆23Nov 19, 2025Updated 5 months ago
- Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"☆33Nov 4, 2024Updated last year
- ☆19Feb 6, 2026Updated 2 months ago
- A website for viewing statistics of various areas in the United States.☆24Updated this week
- 🚀 First survey on Attention Sink in Transformers — 180+ papers on utilization, interpretation, and mitigation.☆69Apr 16, 2026Updated 2 weeks ago
- Are foundation LMs multilingual knowledge bases? (EMNLP 2023)☆19Dec 8, 2023Updated 2 years ago