☆12May 7, 2026Updated last month
Alternatives and similar repositories for Working
Users that are interested in Working are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pytorch implementation of various token mixers; Attention Mechanisms, MLP, and etc for understanding computer vision papers and other tas…☆17Mar 11, 2026Updated 3 months ago
- Adapter-X: A Novel General Parameter-Efficient Fine-Tuning Framework for Vision☆11Jul 22, 2024Updated last year
- ☆13Aug 13, 2024Updated last year
- A Challenge on Dialog Systems with Retrieval Augmented Generation (FutureDial-RAG), Co-located with SLT2024 FutureDial-RAG Challenge☆11Aug 10, 2024Updated last year
- ☆17Nov 29, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆17Sep 17, 2023Updated 2 years ago
- a benckmark for evaluating logical reasoning of LLMs☆23Jan 25, 2024Updated 2 years ago
- ☆55Dec 31, 2025Updated 5 months ago
- Code for ECML-PKDD 2022 paper "GraphMixup: Improving Class-Imbalanced Node Classification by Reinforcement Mixup and Self-supervised Cont…☆24Jun 7, 2023Updated 3 years ago
- An official implementation of the paper "How Sparse Can We Prune A Deep Network: A Fundamental Limit Viewpoint".☆29Nov 13, 2024Updated last year
- 天津大学智能与计算学部研究生一年级上学期期末复习材料,内容包括工程数学,中特,自辩☆18Jun 15, 2019Updated 7 years ago
- MemoChat: Tuning LLMs to Use Memos for Consistent Long-Range Open-Domain Conversation☆29Apr 18, 2024Updated 2 years ago
- LLaMA: Open and Efficient Foundation Language Models☆19Apr 21, 2023Updated 3 years ago
- 基于T5模型的中文文本纠错☆34Nov 3, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆36Mar 25, 2024Updated 2 years ago
- SummScreen: A Dataset for Abstractive Screenplay Summarization (ACL 2022)☆41May 22, 2022Updated 4 years ago
- ☆42May 4, 2026Updated last month
- [CVPR 2025] Official implementation of paper "MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders".☆52Jun 7, 2025Updated last year
- Inference code for LLaMA models☆41Mar 13, 2023Updated 3 years ago
- Self-Controlled Memory System for LLMs☆50Apr 26, 2024Updated 2 years ago
- Metacognitive Prompting Improves Understanding in Large Language Models (NAACL 2024)☆46Nov 8, 2023Updated 2 years ago
- The official implementation of "2024NeurIPS Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation"☆54Dec 30, 2024Updated last year
- ☆32Oct 22, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Train an LLM LoRA using a specific dataset to enable the LLM to continue stories in a specific style based on the plot and background.通过特…☆51Oct 6, 2024Updated last year
- [NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging☆82Mar 1, 2025Updated last year
- Official implementation of "Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization"☆82Apr 12, 2024Updated 2 years ago
- Temporal Commonsense Reasoning in Dialog☆72Jun 9, 2021Updated 5 years ago
- Paper list of "The Life Cycle of Knowledge in Big Language Models: A Survey"☆58Aug 24, 2023Updated 2 years ago
- Code Repo for EfficientRAG: Efficient Retriever for Multi-Hop Question Answering☆69Mar 4, 2025Updated last year
- PyTorch implementation of Soft MoE by Google Brain in "From Sparse to Soft Mixtures of Experts" (https://arxiv.org/pdf/2308.00951.pdf)☆83Oct 5, 2023Updated 2 years ago
- [ICLR 2025] DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆119Aug 17, 2025Updated 9 months ago
- [NeurIPS 2024 Spotlight ⭐️ & TPAMI 2025] Parameter-Inverted Image Pyramid Networks (PIIP)☆113Aug 5, 2025Updated 10 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Code for "ReClor: A Reading Comprehension Dataset Requiring Logical Reasoning" (ICLR 2020)☆82Jul 2, 2024Updated last year
- Do Large Language Models Know What They Don’t Know?☆102Nov 8, 2024Updated last year
- [ICRA 2026] UniFuture: A 4D Driving World Model for Future Generation and Perception☆158Feb 26, 2026Updated 3 months ago
- [AAAI 2025] Linear-complexity Visual Sequence Learning with Gated Linear Attention☆117Jun 17, 2024Updated last year
- ☆142Aug 11, 2022Updated 3 years ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆136Mar 21, 2025Updated last year
- [NeurIPS 2024] A Unified Framework for 3D Scene Understanding☆177Jul 7, 2025Updated 11 months ago