☆59Mar 8, 2025Updated last year
Alternatives and similar repositories for DeepSeek-Distill-Qwen-For-Child
Users that are interested in DeepSeek-Distill-Qwen-For-Child are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆20Aug 20, 2025Updated 7 months ago
- 通义千问的DPO训练☆64Sep 21, 2024Updated last year
- ☆45Jul 1, 2024Updated last year
- This is a detailed code demo on how to conduct Full-Param Supervised Fine-tuning (SFT) and DPO (Direct Preference Optimization)☆18Jan 9, 2025Updated last year
- LLM Tokenizer with BPE algorithm☆48May 7, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- xgboost复现☆15Oct 6, 2024Updated last year
- AI CUP 2024 RAG☆13Nov 19, 2024Updated last year
- 本项目由三个模块构成。意图识别:判断用户的意图是业务型还是闲聊型;模型检索:该部分构建一个语料库,当用户 发起新的query(通过意图识别判断为业务型对话)时,为用户匹配query检索的最佳response,使用HSWN进行召回(粗排), 然后构建句子的相似度,并利用Lig…☆12Feb 18, 2021Updated 5 years ago
- FastEM 是一个广告管理系统,采用开源许可证和商业许可证发行。☆12Nov 22, 2011Updated 14 years ago
- LLM-MapBook: AI-Powered Maps for Storytelling. Extracts geo-coordinates from books, visualizes on interactive maps, offering immersive st…☆12Aug 27, 2024Updated last year
- Official repository for the paper Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regressi…☆23Oct 1, 2025Updated 5 months ago
- Code for the paper: Graph Jigsaw Learning for Cartoon Face Recognition☆10Jul 1, 2022Updated 3 years ago
- from MHA, MQA, GQA to MLA by 苏剑林, with code☆46Feb 19, 2025Updated last year
- ☆31Aug 25, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 本项目用于Embedding模型的相关实验,包括Embedding模型评估、Embedding模型微调、Embedding模型量化等。☆73Jul 16, 2024Updated last year
- Code for "A Simple but Effective Approach to Improve Structured Language Model Output for Information Extraction"☆15Mar 15, 2024Updated 2 years ago
- ☆17Mar 8, 2024Updated 2 years ago
- ☆15Feb 18, 2024Updated 2 years ago
- Lisflood OS (LISVAP)☆12Jan 26, 2026Updated 2 months ago
- 基于200万条医疗数据对DeepSeek-R1-Distill-Qwen-32B进行fine tune且部署☆161Feb 25, 2025Updated last year
- EPG广告管理系统☆10Dec 12, 2013Updated 12 years ago
- Implementation of our paper, Your Negative May not Be True Negative: Boosting Image-Text Matching with False Negative Elimination..☆20Dec 3, 2023Updated 2 years ago
- A lightweight, production-ready C++ library for LLM tokenization, fully compatible with HuggingFace tokenizer.json.☆24Jan 4, 2026Updated 2 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- a SAM based geographic information extraction tool just by interactive click on remote sensing image, as well as an efficient geospatial …☆18Dec 18, 2023Updated 2 years ago
- R Package to simulate plausible versions of SRTM and MERIT DEMs☆11Oct 22, 2019Updated 6 years ago
- recommand system ----Item Based CF (Collaborative filtering)☆18Apr 29, 2018Updated 7 years ago
- ☆15Mar 24, 2024Updated 2 years ago
- ☆16Oct 24, 2023Updated 2 years ago
- Python wrapper for fast inference with GPT-SoVITS☆14Apr 20, 2024Updated last year
- Implementation of D-SRGAN (Digital elevation map specific SRGAN for super resolution) in Pytorch.☆11Sep 13, 2022Updated 3 years ago
- ☆16Dec 22, 2021Updated 4 years ago
- ICCV'23 Dual Learning with Dynamic Knowledge Distillation for Partially Relevant Video Retrieval☆19Aug 22, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- 跟着Tensorrt_pro学习各种知识☆39Nov 25, 2022Updated 3 years ago
- [IJCAI'23] Semantic-aware Generation of Multi-view Portrait Drawings (SAGE)☆10Feb 25, 2024Updated 2 years ago
- ☆14Dec 20, 2022Updated 3 years ago
- Official implementation of the paper: "Deep learning for ECG classification: A comparative study of 1D and 2D representations and multimo…☆34Apr 12, 2024Updated last year
- update can run under py3, YEDDA☆14Dec 20, 2018Updated 7 years ago
- ☆19Mar 19, 2026Updated last week
- 离线部署大模型,构建一个可以上传本地知识库进行RAG问答且可以自行调用工具的Agent。☆39Apr 23, 2024Updated last year