☆23Mar 18, 2024Updated 2 years ago
Alternatives and similar repositories for ModelGPT
Users that are interested in ModelGPT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [WWW2023] PyTorch implementation of "DUET: A Tuning-Free Device-Cloud Collaborative Parameters Generation Framework for Efficient Device …☆25Mar 3, 2025Updated last year
- Code for "Can We Scale Transformers to Predict Parameters of Diverse ImageNet Models?" [ICML 2023]☆39Aug 27, 2024Updated last year
- Repository for the paper "InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners"☆64Dec 4, 2025Updated 3 months ago
- [ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibrati…☆45Jun 30, 2024Updated last year
- ☆73May 23, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Curated LLM (ICML 2024)☆14Oct 23, 2024Updated last year
- Code for paper "ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic Tensor Selection" (MobiSys'23)☆14Nov 1, 2023Updated 2 years ago
- DELTA-pytorch:DELTA: Dynamically Optimizing GPU Memory beyond Tensor Recomputation☆12Apr 16, 2024Updated last year
- ☆54May 8, 2023Updated 2 years ago
- The code for the paper 'Heterogeneous Risk Minimization' of ICML2021.☆25Sep 11, 2021Updated 4 years ago
- This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use" (ACL 2025 Oral).☆390Aug 16, 2025Updated 7 months ago
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆27Apr 17, 2024Updated last year
- The official implementation for Common Sense Enhanced Knowledge-based Recommendation with Large Language Model☆14Apr 21, 2024Updated last year
- ☆66Sep 6, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆15Dec 11, 2023Updated 2 years ago
- Official implementation for the paper Lancet: Accelerating Mixture-of-Experts Training via Whole Graph Computation-Communication Overlapp…☆14Nov 17, 2025Updated 4 months ago
- This repository contains the official implementation of the paper entitled with "FedAPEN: Personalized Cross-silo Federated Learning with…☆14Dec 4, 2023Updated 2 years ago
- This repository is the official implementation of "DG-Mamba: Robust and Efficient Dynamic Graph Structure Learning with Selective State S…☆22Apr 17, 2025Updated 11 months ago
- ☆47May 16, 2023Updated 2 years ago
- Paper: "Aggregating Capacity in FL through Successive Layer Training for Computationally-Constrained Devices"☆18Jan 10, 2024Updated 2 years ago
- Code Repository for the ICML 2024 paper: "Towards Scalable and Versatile Weight Space Learning".☆30Sep 9, 2024Updated last year
- Clustered Compositional Embeddings☆11Oct 25, 2023Updated 2 years ago
- Residual vector quantization for KV cache compression in large language model☆12Oct 22, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A PyTorch native platform for training generative AI models☆16Nov 18, 2025Updated 4 months ago
- [WWW'25]The official implementation of Graph Representation Learning via Causal Diffusion for Out-of-Distribution Recommendation☆18Mar 29, 2025Updated last year
- The implementation of TNNLS paper "Multi-agent Continual Coordination via Progressive Task Contextualization".☆15Dec 24, 2024Updated last year
- Deep Networks Grok All the Time and Here is Why☆38May 18, 2024Updated last year
- ☆22May 3, 2025Updated 10 months ago
- Learning Accurate Decision Trees with Bandit Feedback via Quantized Gradient Descent☆16Sep 8, 2022Updated 3 years ago
- A novel Hybrid-Domain Network (HDNet) for Infrared small target detection task.☆22Nov 4, 2025Updated 4 months ago
- ☆13Oct 29, 2021Updated 4 years ago
- Official code for the paper "Attention as a Hypernetwork"☆55Feb 24, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆16Dec 9, 2023Updated 2 years ago
- [COLM 2025: 1st Workshop on the Application of LLM Explainability to Reasoning and Planning] Latent Chain-of-Thought? Decoding the Depth-…☆17Oct 4, 2025Updated 5 months ago
- SparCL: Sparse Continual Learning on the Edge @ NeurIPS 22☆30May 29, 2023Updated 2 years ago
- ☆21Oct 22, 2025Updated 5 months ago
- EduAgent: Generative Student Agents in Learning☆32Feb 14, 2026Updated last month
- Official repo for BWLer: Barycentric Weight Layer☆30Mar 20, 2026Updated last week
- ☆13Aug 19, 2024Updated last year