☆62Mar 8, 2025Updated last year
Alternatives and similar repositories for DeepSeek-Distill-Qwen-For-Child
Users that are interested in DeepSeek-Distill-Qwen-For-Child are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pytorch DDP Traning Demo☆31Oct 20, 2024Updated last year
- ☆23Aug 20, 2025Updated 9 months ago
- About Official implementation of "KARMA: A Multilevel Decomposition Hybrid Mamba Framework for Multivariate Long-Term Time Series Forecas…☆23Jul 15, 2025Updated 10 months ago
- 通义千问的DPO训练☆65Sep 21, 2024Updated last year
- LLM Tokenizer with BPE algorithm☆49May 7, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This is a detailed code demo on how to conduct Full-Param Supervised Fine-tuning (SFT) and DPO (Direct Preference Optimization)☆19Jan 9, 2025Updated last year
- 基于CLIP实现以文精准 搜图☆16Sep 20, 2023Updated 2 years ago
- 本项目由三个模块构成。意图识别:判断用户的意图是业务型还是闲聊型;模型检索:该部分构建一个语料库,当用户 发起新的query(通过意图识别判断为业务型对话)时,为用户匹配query检索的最佳response,使用HSWN进行召回(粗排), 然后构建句子的相似度,并利用Lig…☆12Feb 18, 2021Updated 5 years ago
- FastEM 是一个广告管理系统,采用开源许可证和商业许可证发行。☆12Nov 22, 2011Updated 14 years ago
- Code for the paper: Graph Jigsaw Learning for Cartoon Face Recognition☆10Jul 1, 2022Updated 3 years ago
- 为视障人群生成电影,输入是电影剧本和mkv格式电影,输出为带有解说的电影☆12Jul 28, 2019Updated 6 years ago
- ☆16Jul 29, 2022Updated 3 years ago
- from MHA, MQA, GQA to MLA by 苏剑林, with code☆49Feb 19, 2025Updated last year
- Official Repo For AAAI 2026 Accepted Paper "Rethinking the Spatio-Temporal Alignment of End-to-End 3D Perception"☆32Mar 25, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆32Aug 25, 2023Updated 2 years ago
- A simple ad server in Go☆16Aug 19, 2014Updated 11 years ago
- simple multi-class GBDT☆15Feb 24, 2014Updated 12 years ago
- 一个时间管理类app项目,该app能够直观看到一周内自己把时间都花在了什么地方上。同时也可以很方便的记录时间。 不仅可以管理时间,还可以记录经验,记录灵感,添加倒数日,添加周常事件。☆15Jul 29, 2022Updated 3 years ago
- ☆27Feb 18, 2025Updated last year
- ☆15Feb 18, 2024Updated 2 years ago
- Source code for AAAI 2021 paper "A Supervised Multi-Head Self-Attention Network for Nested Named Entity Recognition""☆16Jun 16, 2021Updated 4 years ago
- A simple, easy-to-hack GraphRAG implementation☆15Sep 21, 2024Updated last year
- ☆11Aug 10, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 南京大学《计算传播与广告》课程☆17Jan 16, 2016Updated 10 years ago
- Implementation of our paper, Your Negative May not Be True Negative: Boosting Image-Text Matching with False Negative Elimination..☆20Dec 3, 2023Updated 2 years ago
- ☆58Jan 5, 2026Updated 5 months ago
- ☆16Oct 24, 2023Updated 2 years ago
- Python wrapper for fast inference with GPT-SoVITS☆14Apr 20, 2024Updated 2 years ago
- CenterPoint model trained with MMDetection3d on custom dataset, and deployed with TensorRT☆35Mar 15, 2023Updated 3 years ago
- ☆16Dec 22, 2021Updated 4 years ago
- 跟着Tensorrt_pro学习各种知识☆39Nov 25, 2022Updated 3 years ago
- [ACL 2024] DiFiNet: Boundary-Aware Semantic Differentiation and Filtration Network for Nested Named Entity Recognition☆17Oct 2, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- SFT+RL boosts multimodal reasoning☆50Jun 27, 2025Updated 11 months ago
- ☆14Dec 20, 2022Updated 3 years ago
- leetcode-hot100的题目,和 Interview-code-practice-python(https://github.com/leeguandong/Interview-code-practice-python)互为一体,找工作的好帮手。☆44Oct 25, 2024Updated last year
- Official implementation of the paper: "Deep learning for ECG classification: A comparative study of 1D and 2D representations and multimo…☆37Apr 12, 2024Updated 2 years ago
- Implementation of HGCN for AQA☆17Jun 24, 2023Updated 2 years ago
- ☆20May 26, 2026Updated 2 weeks ago
- ☆45Apr 9, 2024Updated 2 years ago