☆59Mar 8, 2025Updated last year
Alternatives and similar repositories for DeepSeek-Distill-Qwen-For-Child
Users that are interested in DeepSeek-Distill-Qwen-For-Child are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pytorch DDP Traning Demo☆31Oct 20, 2024Updated last year
- ☆23Aug 20, 2025Updated 8 months ago
- About Official implementation of "KARMA: A Multilevel Decomposition Hybrid Mamba Framework for Multivariate Long-Term Time Series Forecas…☆21Jul 15, 2025Updated 9 months ago
- ☆13Oct 24, 2023Updated 2 years ago
- This project leverages autogen multi agent framework along with Azure OpenAI Assistants API to automate data analysis and report generati…☆13Feb 5, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆46Jul 1, 2024Updated last year
- 通义千问的DPO训练☆65Sep 21, 2024Updated last year
- LLM Tokenizer with BPE algorithm☆49May 7, 2024Updated last year
- This is a detailed code demo on how to conduct Full-Param Supervised Fine-tuning (SFT) and DPO (Direct Preference Optimization)☆19Jan 9, 2025Updated last year
- ☆19Apr 25, 2023Updated 3 years ago
- A simple deep learning framework inspired by Dezero and PyTorch☆31Jan 27, 2025Updated last year
- This is a named entity recognition (NER) dataset for OSINT towards the national defense domain.☆10Apr 21, 2023Updated 3 years ago
- 本项目由三个模块构成。意图识别:判断用户的意图是业务型还是闲聊型;模型检索:该部分构建一个语料库,当用户 发起新的query(通过意图识别判断为业务型对话)时,为用户匹配query检索的最佳response,使用HSWN进行召回(粗排), 然后构建句子的相似度,并利用Lig…☆12Feb 18, 2021Updated 5 years ago
- 它是一款面向数学建模的自动化Agent,基于Spring AI Alibaba Graph框架开发:从问题陈述出发自动构建并求解数学模型、进行数据验证与可视化,最终输出包含方法、结果、代码、数据与参考文献的可直接投稿的完整论文。☆36Nov 21, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official Repo For AAAI 2026 Accepted Paper "Rethinking the Spatio-Temporal Alignment of End-to-End 3D Perception"☆30Mar 25, 2026Updated last month
- Code for the paper: Graph Jigsaw Learning for Cartoon Face Recognition☆10Jul 1, 2022Updated 3 years ago
- 为视障人群生成电影,输入是电影剧本和mkv格式电影,输出为带有解说的电影☆12Jul 28, 2019Updated 6 years ago
- from MHA, MQA, GQA to MLA by 苏剑林, with code☆47Feb 19, 2025Updated last year
- ☆16Jul 29, 2022Updated 3 years ago
- ☆45Dec 4, 2023Updated 2 years ago
- ☆31Aug 25, 2023Updated 2 years ago
- 本项目用于Embedding模型的相关实验,包括Embedding模型评估、Embedding模型微调、Embedding模型量化等。☆74Jul 16, 2024Updated last year
- Official code Implementation of "Text and Image Are Mutually Beneficial: Enhancing Training-Free Few-Shot Classification with CLIP" (AAA…☆21Dec 17, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆27Feb 18, 2025Updated last year
- Code for "A Simple but Effective Approach to Improve Structured Language Model Output for Information Extraction"☆16Mar 15, 2024Updated 2 years ago
- [AAAI 2024] GMMFormer: Gaussian-Mixture-Model Based Transformer for Efficient Partially Relevant Video Retrieval☆20May 10, 2024Updated last year
- ☆11Aug 10, 2022Updated 3 years ago
- Introduction page of a challenging text-to-SQL dataset: KaggleDBQA☆44Sep 20, 2023Updated 2 years ago
- ☆17Dec 4, 2024Updated last year
- ☆19Jul 22, 2025Updated 9 months ago
- CEll spatial Organization-based graph convolutional network☆27Feb 3, 2024Updated 2 years ago
- Image captioning using CNN and RNN☆11Mar 24, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- CenterPoint model trained with MMDetection3d on custom dataset, and deployed with TensorRT☆35Mar 15, 2023Updated 3 years ago
- Python wrapper for fast inference with GPT-SoVITS☆14Apr 20, 2024Updated 2 years ago
- ☆36Aug 25, 2023Updated 2 years ago
- ICCV'23 Dual Learning with Dynamic Knowledge Distillation for Partially Relevant Video Retrieval☆19Aug 22, 2025Updated 8 months ago
- 跟着Tensorrt_pro学习各种知识☆39Nov 25, 2022Updated 3 years ago
- SFT+RL boosts multimodal reasoning☆48Jun 27, 2025Updated 10 months ago
- Official implementation of the paper: "Deep learning for ECG classification: A comparative study of 1D and 2D representations and multimo…☆35Apr 12, 2024Updated 2 years ago