A distilled DeepSeek-R1 variant built on Qwen2.5-32B, fine-tuned with curated data for enhanced performance and efficiency. <metadata> gpu: A100 | collections: ["vLLM"] </metadata>
☆17Mar 11, 2025Updated last year
Alternatives and similar repositories for deepseek-r1-distill-qwen-32b
Users that are interested in deepseek-r1-distill-qwen-32b are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Browser extension for LOBSTR wallet.☆10Oct 10, 2025Updated 8 months ago
- Rucio K8s tutorial☆11May 7, 2026Updated last month
- Video Summarization Transformer: Implementation in PyTorch of the Transformer model for video summarisation☆10Oct 27, 2020Updated 5 years ago
- PyTorch implementation of GRPO.☆16Apr 21, 2025Updated last year
- ☆16Mar 18, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆22Jun 16, 2026Updated last week
- Custom triton kernels for training Karpathy's nanoGPT.☆19Oct 21, 2024Updated last year
- RuCLIP-SB (Russian Contrastive Language–Image Pretraining SWIN-BERT) is a multimodal model for obtaining images and text similarities and…☆15Jan 25, 2022Updated 4 years ago
- Geographical Graph Attention Networks: Spatial Deep Learning Models for Spatial Prediction and Exploratory Spatial Data Analysis☆19Jul 28, 2025Updated 11 months ago
- Automates NFT minting on Zora Network using multiple accounts with unique delays and target trx. Supports proxies for bypassing RPC restr…☆15Sep 11, 2023Updated 2 years ago
- Classify documents using Python based on SVM and TF-IDF.☆15Nov 19, 2019Updated 6 years ago
- Twitter Bots List 🤖 a collective list of bots on Twitter☆20Jan 19, 2023Updated 3 years ago
- This is an introduction to Retrieval-Augmented Generation (RAG) for beginners . It uses Llama 2 LLM, FAISS vector store, and LangChain as…☆17Jul 8, 2025Updated 11 months ago
- Our solution to ML Talent Match hackathon☆11Mar 22, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official repository for Senpi Agent Skills. Contain starters, registry, etc.☆31Jan 20, 2026Updated 5 months ago
- A powerful, innovative, and real-time crypto trading bot that triggers trades based on keywords from Twitter tweets.📱 Real-time Twitter …☆26May 8, 2024Updated 2 years ago
- Image captioning using CNN and RNN☆11Mar 24, 2025Updated last year
- This script allows you to withdraw coins from the binance balance to many wallets. It will be useful for participating in retrodrops and …☆17Apr 16, 2023Updated 3 years ago
- Moment Detection in Long Tutorial Videos☆20May 8, 2024Updated 2 years ago
- NYCU Intro2AI Final Project☆21Jun 5, 2023Updated 3 years ago
- Automatic provisioning system written in Laravel 12 / PHP 8.5 using Filament 5☆39Jun 2, 2026Updated 3 weeks ago
- ☆93Updated this week
- A bot that fetches tweets from crypto-related Twitter accounts as well as info on CryptoPanic website. Then, English news is translated i…☆30Apr 11, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Video Summarization With Spatiotemporal Vision Transformer☆23Jul 5, 2023Updated 2 years ago
- Implementation of X/Twitter v1, v2, and GraphQL APIs☆22Jul 15, 2024Updated last year
- Tests the Black-Scholes model's performance on forecasting option call prices of a selected option chain dataset. Discusses factors such …☆22Feb 19, 2024Updated 2 years ago
- Solving Problems with Applied Deep Learning (ITS-530)☆28Apr 18, 2026Updated 2 months ago
- Didactic Web crawler for Web Search Engines (CS 6913) course at NYU☆10Dec 8, 2022Updated 3 years ago
- RL significantly the reasoning capability of Qwen2.5-1.5B-Instruct☆31Feb 23, 2025Updated last year
- ☆21Mar 26, 2023Updated 3 years ago
- ☆12Sep 9, 2022Updated 3 years ago
- Collection of tensorflow notebooks tutorials for implementing some basic Deep Learning architectures.