☆17Feb 4, 2025Updated last year
Alternatives and similar repositories for Self-MoA
Users that are interested in Self-MoA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [AAAI 2025] Augmenting Math Word Problems via Iterative Question Composing (https://arxiv.org/abs/2401.09003)☆23Oct 2, 2025Updated 6 months ago
- Symmetrical Visual Contrastive Optimization: Aligning Vision-Language Models with Minimal Contrastive Images☆18Jun 4, 2025Updated 10 months ago
- [ECCV'24 Oral] PiTe: Pixel-Temporal Alignment for Large Video-Language Model☆17Feb 13, 2025Updated last year
- ☆18Mar 2, 2026Updated last month
- ☆39Jan 25, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- OneEdit: A Neural-Symbolic Collaboratively Knowledge Editing System.☆20Oct 14, 2024Updated last year
- [ICML 2024] Code for the paper "MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-Experts"☆10Jul 1, 2024Updated last year
- A single-line modification to any (dualizer-based) optimizer that allows the optimizer to adapt to the scale of the gradients as they cha…☆19Jan 11, 2025Updated last year
- [ICML 2023] Meta-SAGE: Scale Meta-Learning Scheduled Adaptation with Guided Exploration for Mitigating Scale Shift on Combinatorial Optim…☆10Dec 19, 2023Updated 2 years ago
- Stick-breaking attention☆63Jul 1, 2025Updated 9 months ago
- The official source code for Self-Guided Robust Graph Structure Refinement (SG-GSR) at WWW 2024 Research Track.☆17Apr 23, 2024Updated last year
- ☆19Jan 10, 2025Updated last year
- ☆12Dec 4, 2023Updated 2 years ago
- ☆20Apr 16, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The official source code for "Subgraph Federated Learning for Local Generalization (FedLoG)" at ICLR 2025 (Oral).☆15May 6, 2025Updated 11 months ago
- [ICLR 2025] Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates (Oral)☆83Oct 23, 2024Updated last year
- ☆15Feb 21, 2024Updated 2 years ago
- Collections of RLxLM experiments using minimal codes☆14Feb 17, 2025Updated last year
- [ECCV 2024] Code for the paper "Mew: Multiplexed Immunofluorescence Image Analysis through an Efficient Multiplex Network"☆17Jul 27, 2024Updated last year
- Code for 'Contrastive Multi-Document Question Generation'☆11Oct 16, 2022Updated 3 years ago
- [ICML 2025] Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment (https://arxiv.org/abs/2410.02197)☆40Sep 8, 2025Updated 7 months ago
- ☆19Nov 4, 2025Updated 5 months ago
- A Domain-Specific Language, Jailbreak Attack Synthesizer and Dynamic LLM Redteaming Toolkit☆27Dec 5, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆10May 8, 2018Updated 7 years ago
- ☆22May 2, 2025Updated 11 months ago
- Some starter code for training/testing some basic CNN models given our data.☆10Feb 15, 2017Updated 9 years ago
- [CIKM 2023 Short] Code for the paper "S-Mixup: Structural Mixup for Graph Neural Networks"☆18Aug 21, 2023Updated 2 years ago
- A Mechanistic‑Interpretability study that finds the structural dynamics of Large Language Models under fine‑tuning.☆16May 30, 2025Updated 10 months ago
- OmniByteFormer is a generalized Transformer model that can process any type of data by converting it into byte sequences, bypassing tradi…☆15Apr 13, 2026Updated last week
- ☆27May 13, 2025Updated 11 months ago
- Source code for EMNLP2022 paper "Finding Skill Neurons in Pre-trained Transformers via Prompt Tuning".☆18Mar 13, 2023Updated 3 years ago
- Pytorch routines for (Ker)nel (Mac)hines☆12Oct 10, 2025Updated 6 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Code for "VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement [ACL 2026 Findings]"☆53Apr 7, 2026Updated last week
- [CoLM 24] Official Repository of MambaByte: Token-free Selective State Space Model☆25Oct 12, 2024Updated last year
- Simple MoE - Day 17 of 365 Days of Repos☆18Jan 17, 2025Updated last year
- [ICML 2025] EffiCoder: Enhancing Code Generation in Large Language Models through Efficiency-Aware Fine-tuning☆16May 24, 2025Updated 10 months ago
- The code implementation of Symbolic-MoE☆46Sep 2, 2025Updated 7 months ago
- ☆16Nov 15, 2023Updated 2 years ago
- A self-hosted version of WaterCrawl, a powerful web crawling and data extraction platform.☆13Jul 27, 2025Updated 8 months ago