☆20May 28, 2025Updated last year
Alternatives and similar repositories for awesome_papers
Users that are interested in awesome_papers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2024] Efficiency for Free: Ideal Data Are Transportable Representations☆19Jan 19, 2025Updated last year
- [Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models☆43Mar 11, 2025Updated last year
- A comprehensive framework for benchmarking single and multi-agent systems across a wide range of tasks—evaluating performance, accuracy, …☆38Jun 5, 2026Updated 3 weeks ago
- Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model☆13Feb 11, 2025Updated last year
- ☆23Nov 26, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [CVPR 2024] On the Diversity and Realism of Distilled Dataset: An Efficient Dataset Distillation Paradigm☆83Feb 24, 2025Updated last year
- [ICML 2023] FedBR: Improving Federated Learning on Heterogeneous Data via Local Learning Bias Reduction☆29Mar 7, 2024Updated 2 years ago
- [ICLR 2025] Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models☆162Jul 9, 2025Updated 11 months ago
- [ICLR 2025] ELICIT: LLM Augmentation Via External In-context Capability☆14Mar 11, 2025Updated last year
- Envision: Benchmarking Unified Understanding & Generation for Causal World Process Insights☆32Jan 9, 2026Updated 5 months ago
- The source code for running LLMs on the AAAR-1.0 benchmark.☆20Apr 5, 2025Updated last year
- Official code and dataset for our EMNLP 2024 Findings paper: Stark: Social Long-Term Multi-Modal Conversation with Persona Commonsense Kn…☆19Dec 27, 2024Updated last year
- Official implementation of RMoE (Layerwise Recurrent Router for Mixture-of-Experts)☆32Aug 4, 2024Updated last year
- ☆29Mar 12, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [ICLR2026] AliTok: Towards Sequence Modeling Alignment between Tokenizer and Autoregressive Model☆56Oct 12, 2025Updated 8 months ago
- Official implementation of our paper "Bidirectional Consistency Models"; and reproduced Improved Consistency Models (iCT).☆27May 10, 2025Updated last year
- A curated collection of research and techniques for protecting intellectual property of large language models, including watermarking, fi…☆51Jun 10, 2026Updated 3 weeks ago
- 机器学习乐园:主要包括机器学习基础,深度学习实践,工业应用。☆15Nov 14, 2022Updated 3 years ago
- NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference☆69Dec 9, 2024Updated last year
- 河海大学每日健康打卡☆12Dec 4, 2021Updated 4 years ago
- [CVPR 2025] LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding☆84Jul 4, 2025Updated 11 months ago
- 😎 基于知识的文本生成相关文章总结与个人笔记☆20Oct 5, 2024Updated last year
- [ACL 25] SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities☆30Apr 2, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- An LLM leaderboard for stateful agents☆21Oct 16, 2025Updated 8 months ago
- Pocket-sized digital musical instrument inspired by the piano and the accordion☆13Oct 9, 2024Updated last year
- 中南大学机器学习2023年秋季学期作业☆21Dec 12, 2024Updated last year
- Official PyTorch Implementation of EMoE: Unlocking Emergent Modularity in Large Language Models [main conference @ NAACL2024]☆39May 28, 2024Updated 2 years ago
- 实践番茄工作法:工作时屏蔽浪费时间的网站,休息时允许访问。A Chrome/Edge extension that helps you stay focused by blocking sites during work timers and letting you bro…☆13Jul 26, 2022Updated 3 years ago
- Official repo for NeurIPS'24 paper "WAGLE: Strategic Weight Attribution for Effective and Modular Unlearning in Large Language Models"☆19Dec 16, 2024Updated last year
- 大三下Web课设 - 中南大学主页 - JavaWeb☆13Dec 6, 2019Updated 6 years ago
- [ICLR 2026] Any-step Generation via N-th Order Recursive Consistent Velocity Field Estimation☆36Feb 4, 2026Updated 4 months ago
- ☆11Sep 16, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆54Mar 6, 2025Updated last year
- ☆43Jul 1, 2024Updated 2 years ago
- Code release for "Generative Modeling of Weights: Generalization or Memorization?"☆22Apr 9, 2026Updated 2 months ago
- ☆38Dec 25, 2025Updated 6 months ago
- [ACM MM 2025] Phys4DGen: Physics-Compliant 4D Generation with Multi-Material Composition Perception☆13Apr 18, 2026Updated 2 months ago
- [WACV2023] This is the official PyTorch impelementation of our paper "[Rethinking Rotation in Self-Supervised Contrastive Learning: Adapt…☆12Feb 24, 2023Updated 3 years ago
- ☆10Feb 12, 2024Updated 2 years ago