ssmmtt / WorkingView external linksLinks
☆11Jul 9, 2025Updated 7 months ago
Alternatives and similar repositories for Working
Users that are interested in Working are comparing it to the libraries listed below
Sorting:
- A Challenge on Dialog Systems with Retrieval Augmented Generation (FutureDial-RAG), Co-located with SLT2024 FutureDial-RAG Challenge☆11Aug 10, 2024Updated last year
- ☆11Aug 13, 2024Updated last year
- ☆15Oct 19, 2024Updated last year
- ☆17Sep 17, 2023Updated 2 years ago
- 天津大学智能与计算学部研究生一年级上学期期末复习材料,内容包括工程数学,中特,自辩☆17Jun 15, 2019Updated 6 years ago
- Unofficial Implementation of Selective Attention Transformer☆20Oct 31, 2024Updated last year
- ☆19Aug 9, 2024Updated last year
- Code for ECML-PKDD 2022 paper "GraphMixup: Improving Class-Imbalanced Node Classification by Reinforcement Mixup and Self-supervised Cont…☆25Jun 7, 2023Updated 2 years ago
- LLaMA: Open and Efficient Foundation Language Models☆19Apr 21, 2023Updated 2 years ago
- MemoChat: Tuning LLMs to Use Memos for Consistent Long-Range Open-Domain Conversation☆28Apr 18, 2024Updated last year
- 基于T5模型的中文文本纠错☆34Nov 3, 2024Updated last year
- Self contained pytorch implementation of a sinkhorn based router, for mixture of experts or otherwise☆40Aug 29, 2024Updated last year
- ☆42Aug 13, 2025Updated 6 months ago
- graphrag的基础架构☆46Oct 17, 2024Updated last year
- SummScreen: A Dataset for Abstractive Screenplay Summarization (ACL 2022)☆41May 22, 2022Updated 3 years ago
- The official implementation of "2024NeurIPS Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation"☆52Dec 30, 2024Updated last year
- Paper list of "The Life Cycle of Knowledge in Big Language Models: A Survey"☆59Aug 24, 2023Updated 2 years ago
- ☆77Nov 13, 2023Updated 2 years ago
- Light local website for displaying performances from different chat models.☆87Nov 13, 2023Updated 2 years ago
- Hammer: Robust Function-Calling for On-Device Language Models via Function Masking☆113Jun 13, 2025Updated 8 months ago
- LAiW: A Chinese Legal Large Language Models Benchmark☆89Jul 3, 2024Updated last year
- [NeurIPS 24] MoE Jetpack: From Dense Checkpoints to Adaptive Mixture of Experts for Vision Tasks☆134Nov 23, 2024Updated last year
- [ICCV 2025] Official implementation of LLaVA-KD: A Framework of Distilling Multimodal Large Language Models☆125Oct 14, 2025Updated 4 months ago
- ICLR 2024, Towards Lossless Dataset Distillation via Difficulty-Aligned Trajectory Matching☆105May 23, 2024Updated last year
- [AAAI 2025] Linear-complexity Visual Sequence Learning with Gated Linear Attention☆115Jun 17, 2024Updated last year
- Implementation of Switch Transformers from the paper: "Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficien…☆136Jan 17, 2026Updated 3 weeks ago
- This is the repository of DEER, a Dynamic Early Exit in Reasoning method for Large Reasoning Language Models.☆179Jul 7, 2025Updated 7 months ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆134Mar 21, 2025Updated 10 months ago
- (NeurIPS 2023 spotlight) Large-scale Dataset Distillation/Condensation, 50 IPC (Images Per Class) achieves the highest 60.8% on original …☆136Nov 15, 2024Updated last year
- A generalized framework for subspace tuning methods in parameter efficient fine-tuning.☆171Jan 29, 2026Updated 2 weeks ago
- [NeurIPS 2024] A Unified Framework for 3D Scene Understanding☆170Jul 7, 2025Updated 7 months ago
- 🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆317Jan 3, 2026Updated last month
- [CVPR 2024] Dynamic Adapter Meets Prompt Tuning: Parameter-Efficient Transfer Learning for Point Cloud Analysis☆181Oct 11, 2024Updated last year
- Implementation of the conditionally routed attention in the CoLT5 architecture, in Pytorch☆231Sep 6, 2024Updated last year
- [NeurIPS 2024] Official code of ”LION: Linear Group RNN for 3D Object Detection in Point Clouds“☆226Dec 16, 2025Updated last month
- [NeurIPS 2024] Classification Done Right for Vision-Language Pre-Training☆227Mar 20, 2025Updated 10 months ago
- [ICCV2023] DETR Doesn’t Need Multi-Scale or Locality Design☆226Nov 14, 2023Updated 2 years ago
- Baichuan2代码的逐行解析版本,适合小白☆213Sep 20, 2023Updated 2 years ago
- [SCIS] SAM3D: Zero-Shot 3D Object Detection via Segment Anything Model☆226Jan 28, 2024Updated 2 years ago