A distilled DeepSeek-R1 variant built on Qwen2.5-32B, fine-tuned with curated data for enhanced performance and efficiency. <metadata> gpu: A100 | collections: ["vLLM"] </metadata>
☆17Mar 11, 2025Updated last year
Alternatives and similar repositories for deepseek-r1-distill-qwen-32b
Users that are interested in deepseek-r1-distill-qwen-32b are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Browser extension for LOBSTR wallet.☆10Oct 10, 2025Updated 7 months ago
- Rucio K8s tutorial☆11May 7, 2026Updated last week
- Video Summarization Transformer: Implementation in PyTorch of the Transformer model for video summarisation☆10Oct 27, 2020Updated 5 years ago
- PyTorch implementation of GRPO.☆16Apr 21, 2025Updated last year
- ☆16Mar 18, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆22Jan 12, 2026Updated 4 months ago
- Geographical Graph Attention Networks: Spatial Deep Learning Models for Spatial Prediction and Exploratory Spatial Data Analysis☆18Jul 28, 2025Updated 9 months ago
- Custom triton kernels for training Karpathy's nanoGPT.☆19Oct 21, 2024Updated last year
- RuCLIP-SB (Russian Contrastive Language–Image Pretraining SWIN-BERT) is a multimodal model for obtaining images and text similarities and…☆15Jan 25, 2022Updated 4 years ago
- Automates NFT minting on Zora Network using multiple accounts with unique delays and target trx. Supports proxies for bypassing RPC restr…☆15Sep 11, 2023Updated 2 years ago
- Classify documents using Python based on SVM and TF-IDF.☆15Nov 19, 2019Updated 6 years ago
- Twitter Bots List 🤖 a collective list of bots on Twitter☆20Jan 19, 2023Updated 3 years ago
- This is an introduction to Retrieval-Augmented Generation (RAG) for beginners . It uses Llama 2 LLM, FAISS vector store, and LangChain as…☆17Jul 8, 2025Updated 10 months ago
- Our solution to ML Talent Match hackathon☆11Mar 22, 2024Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Official repository for Senpi Agent Skills. Contain starters, registry, etc.☆29Jan 20, 2026Updated 4 months ago
- A powerful, innovative, and real-time crypto trading bot that triggers trades based on keywords from Twitter tweets.📱 Real-time Twitter …☆27May 8, 2024Updated 2 years ago
- Image captioning using CNN and RNN☆11Mar 24, 2025Updated last year
- This script allows you to withdraw coins from the binance balance to many wallets. It will be useful for participating in retrodrops and …☆18Apr 16, 2023Updated 3 years ago
- Moment Detection in Long Tutorial Videos☆20May 8, 2024Updated 2 years ago
- NYCU Intro2AI Final Project☆21Jun 5, 2023Updated 2 years ago
- Automatic provisioning system written in Laravel 12 / PHP 8.5 using Filament 5☆37Mar 20, 2026Updated 2 months ago
- ☆86Updated this week
- A bot that fetches tweets from crypto-related Twitter accounts as well as info on CryptoPanic website. Then, English news is translated i…☆30Apr 11, 2023Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Video Summarization With Spatiotemporal Vision Transformer☆23Jul 5, 2023Updated 2 years ago
- Implementation of X/Twitter v1, v2, and GraphQL APIs☆22Jul 15, 2024Updated last year
- Tests the Black-Scholes model's performance on forecasting option call prices of a selected option chain dataset. Discusses factors such …☆22Feb 19, 2024Updated 2 years ago
- Solving Problems with Applied Deep Learning (ITS-530)☆28Apr 18, 2026Updated last month
- Didactic Web crawler for Web Search Engines (CS 6913) course at NYU☆10Dec 8, 2022Updated 3 years ago
- RL significantly the reasoning capability of Qwen2.5-1.5B-Instruct☆31Feb 23, 2025Updated last year
- ☆21Mar 26, 2023Updated 3 years ago
- ☆12Sep 9, 2022Updated 3 years ago
- Collection of tensorflow notebooks tutorials for implementing some basic Deep Learning architectures.☆28Aug 15, 2018Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Генерация расписания занятий для студентов ИТМО программы Искусственный интеллект.☆20Sep 17, 2024Updated last year
- E-Commerce Website A/B testing: Recommend which of two landing pages to keep based on A/B testing☆24Dec 21, 2017Updated 8 years ago
- Open AI Gym - Pendulum-v1 reinforcement learning (DQN, SAC)☆21Jan 26, 2024Updated 2 years ago
- ☆27Dec 29, 2024Updated last year
- 🎭 Проекты, которые я выполняю самостоятельно. Датасеты беру из открытых источников.☆25Nov 29, 2022Updated 3 years ago
- ☆12Oct 23, 2020Updated 5 years ago
- Enable a Jupyter Notebook user to invoke Node.js commands.☆13Apr 21, 2022Updated 4 years ago