yuanzhoulvpi2017 / vscode_debug_transformersView external linksLinks
☆414Feb 10, 2025Updated last year
Alternatives and similar repositories for vscode_debug_transformers
Users that are interested in vscode_debug_transformers are comparing it to the libraries listed below
Sorting:
- ☆120Jun 30, 2024Updated last year
- 基于DPO算法微调语言大模型,简单好上手。☆50Jul 3, 2024Updated last year
- 复现大模型相关算法及一些学习记录☆2,975Updated this week
- Official release for SplArt: Articulation Estimation and Part-level Reconstruction with 3D Gaussian Splatting.☆29Jun 5, 2025Updated 8 months ago
- LEO: A powerful Hybrid Multimodal LLM☆19Jan 18, 2025Updated last year
- An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)☆8,989Feb 6, 2026Updated last week
- ☆186Jan 20, 2026Updated 3 weeks ago
- Latest Advances on Multimodal Large Language Models☆17,337Updated this week
- 本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)☆23,111Dec 30, 2025Updated last month
- EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL☆4,572Jan 29, 2026Updated 2 weeks ago
- Repo for Anonymous purpose, pls don't distribute☆10Oct 2, 2024Updated last year
- Solve Visual Understanding with Reinforced VLMs☆5,833Oct 21, 2025Updated 3 months ago
- ☆58Jun 7, 2025Updated 8 months ago
- 目前各大高校领域将各种信息分布在不同的部门信息门户下,存在典型的信息孤岛问题,各个部门信息没有形成互通。当前,老师和学生存在很多有关本校相关文件、政策和活动等众多方面智能问答的统一入口的需求,例如财务处、人事处、学工处、教务处、图书馆等存在各种政策和文件规定,目前在校师生都…☆35Aug 5, 2024Updated last year
- SeRL: Self-Play Reinforcement Learning for Large Language Models with Limited Data☆21Jan 24, 2026Updated 2 weeks ago
- The repository of EMNLP 2023 "MixEdit: Revisiting Data Augmentation and Beyond for Grammatical Error Correction"☆12Nov 25, 2023Updated 2 years ago
- 手把手带你实战 Huggingface Transformers 课程视频同步更新在B站与YouTube☆3,771Jul 15, 2024Updated last year
- Code for DVD A Diagnostic Dataset for Multi-step Reasoning in Video Grounded Dialogue☆14Oct 12, 2021Updated 4 years ago
- Huggingface PPO Demo☆19Sep 7, 2025Updated 5 months ago
- ☆42Jan 24, 2026Updated 3 weeks ago
- 基于LLM实现CHIP2021-Task3中文临床术语标准化任务,准确率约70%。☆15Dec 16, 2024Updated last year
- A Triton-only attention backend for vLLM☆23Updated this week
- ☆14Aug 26, 2024Updated last year
- AFAC2024金融智能创新大赛☆65Nov 27, 2024Updated last year
- 主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题☆12,379Apr 30, 2025Updated 9 months ago
- Consuming Resrouce via Auto-generation for LLM-DoS Attack under Black-box Settings☆18Sep 1, 2025Updated 5 months ago
- Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (…☆12,594Updated this week
- Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks☆3,799Feb 3, 2026Updated last week
- [CVPR 2024 Highlight] OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allo…☆396Aug 24, 2024Updated last year
- [CVPR2025] Official implementation of the paper "Multi-Layer Visual Feature Fusion in Multimodal LLMs: Methods, Analysis, and Best Practi…☆44Oct 29, 2025Updated 3 months ago
- 《自然语言理解与行业知识图谱-概念、方法与工程落地》 一书中介绍的各个章节的算法展示代码☆13Jun 24, 2024Updated last year
- ☆60Apr 13, 2025Updated 10 months ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆17Jan 31, 2026Updated last week
- survery of small language models☆18Jul 23, 2024Updated last year
- [S&P 2026] SoK: Evaluating Jailbreak Guardrails for Large Language Models☆35Dec 17, 2025Updated last month
- [ICLR 2026] End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning☆354Jan 12, 2026Updated last month
- LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案☆1,300Dec 14, 2023Updated 2 years ago
- ☆4,552Sep 14, 2025Updated 5 months ago
- Graph Chain-of-Thought: Augmenting Large Language Models by Reasoning on Graphs (ACL 2024)☆295Dec 24, 2024Updated last year