inferless / deepseek-r1-distill-qwen-32bLinks
A distilled DeepSeek-R1 variant built on Qwen2.5-32B, fine-tuned with curated data for enhanced performance and efficiency. <metadata> gpu: A100 | collections: ["vLLM"] </metadata>
☆15Updated 8 months ago
Alternatives and similar repositories for deepseek-r1-distill-qwen-32b
Users that are interested in deepseek-r1-distill-qwen-32b are comparing it to the libraries listed below
Sorting:
- ☆10Updated last week
- ☆21Updated 9 months ago
- 基于大模型生成内容的智能语音对讲☆10Updated last year
- Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference an…☆22Updated 6 months ago
- Work with your business data using natural language☆18Updated last year
- Dify Streamlit Chat App☆14Updated last year
- Gemma2(9B), Llama3-8B-Finetune-and-RAG, code base for sample, implemented in Kaggle platform☆22Updated 9 months ago
- A Python implementation of an agent swarm system that works with local LLM servers. The system allows you to create multiple agents that …☆11Updated last year
- ⚡ Official Java SDK for Dify AI platform - Complete Chat, Workflow & Knowledge Base APIs with real-time streaming☆20Updated 5 months ago
- Probably one of the lightest native RAG + Agent apps out there,experience the power of Agent-powered models and Agent-driven knowledge ba…☆26Updated 5 months ago
- ☆51Updated last year
- Synthetic Data Generation using LLM via Argilla, Distilabel, ChatGPT, etc.☆30Updated last year
- ☆28Updated 4 months ago
- Open-source examples and guides for building with the Qwen. Browse a collection of snippets, advanced techniques and walkthroughs.☆30Updated last year
- AI Multi-agent system for real-time, adaptive supply chain coordination and optimization leveraging responsive AI clusters.☆33Updated last year
- Luann (fka TypeAgent) allows you to create many LLM based agent(Various types of agent,scale up)☆22Updated 6 months ago
- A simple WeChat Official Account layout tool based on Dify☆16Updated 4 months ago
- 🤗 HF Downloader (Hugging Face Downloader) 📦 A user-friendly GUI tool for downloading Hugging Face resources with enhanced connectivity…☆11Updated 10 months ago
- Agent Watch is an AgentOps monitoring library designed for Crew AI applications.☆21Updated 11 months ago
- ☆17Updated 4 months ago
- Automated and fast parsing of local project directories and GitHub directories, one-click deployment of local parsing with AutoGPT(自动化快速解…☆26Updated last year
- 🔎 A deep-dive into HyDE for Advanced LLM RAG + 💡 Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, covera…☆33Updated last year
- ☆22Updated last year
- AI_Powered_Dev_Search_Engine☆12Updated last year
- Prompt Injection & Prevention techniques. Secure your AI Chatbots built using LLMs.☆13Updated last year
- ☆13Updated 10 months ago
- This AI Agent retrieves the latest news articles based on a multi keyword using the Serp API. It processes the results and returns struct…☆11Updated 9 months ago
- 🎈 A series of lightweight GPT models featuring TinyGPT Base (~51M params) and TinyGPT-MoE (~85M params). Fast, creative text generation …☆16Updated 2 months ago
- [ICLR'25] ApolloMoE: Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts☆51Updated last year
- ☆40Updated 11 months ago