Convert LLaMA3.1-8B to DeepSeek R1 MLA & MoE (raw)
☆24Mar 10, 2025Updated last year
Alternatives and similar repositories for LLaMA3.1-8B-DeepSeekR1-MLA-MoE
Users that are interested in LLaMA3.1-8B-DeepSeekR1-MLA-MoE are comparing it to the libraries listed below
Sorting:
- EraX-VL-7B-V1 is the multimodal large language model developed by EraX team, base on Qwen2-VL.☆12Dec 31, 2024Updated last year
- An Enhanced Version of Piper especially for Vietnamese :)☆28Apr 24, 2025Updated 10 months ago
- ☆25Mar 29, 2024Updated last year
- JARVIS Chatbot: a local simple RAG assistant with PDF files☆28Sep 12, 2025Updated 5 months ago
- Software Engineering Back End Microservices Project☆15Nov 20, 2024Updated last year
- ☆11Apr 25, 2025Updated 10 months ago
- Support for training SSD on TF2☆12Mar 29, 2023Updated 2 years ago
- Use MobileNet SSD and openCV to detect and count car on road☆12Jan 13, 2020Updated 6 years ago
- LLM, Fine Tuning, Llama 2, Gemma, Mixtral, vLLM, LangChain, RAG, ChromaDB, FAISS☆13Mar 5, 2024Updated 2 years ago
- TabMini: A Benchmark Suite for Evaluating and Analyzing the Data Efficiency of Tabular Classifiers☆10Mar 31, 2025Updated 11 months ago
- ☆36Apr 25, 2021Updated 4 years ago
- A comprehensive ELT pipeline for analyzing passenger satisfaction data. Features a modern data architecture with Apache Airflow for extra…☆12Oct 5, 2025Updated 5 months ago
- Vietnamese GPT-J API service deployed with Docker & Helm chart☆10Dec 11, 2022Updated 3 years ago
- ☆49Aug 14, 2024Updated last year
- [ICLR26] AI-based scaling law discovery☆26Jan 30, 2026Updated last month
- ☆17Feb 25, 2026Updated last week
- Demo of using Airflow☆11Jun 24, 2022Updated 3 years ago
- Top 9 private leaderboard & Top 17 public leaderboard☆10Dec 1, 2022Updated 3 years ago
- An automatic question generation system using rule based NLP processing techniques.☆10Feb 9, 2020Updated 6 years ago
- ☆12May 20, 2025Updated 9 months ago
- Single-file, pure CUDA C implementation for running inference on Qwen3 0.6B GGUF. No Dependencies.☆23Nov 26, 2025Updated 3 months ago
- ☆13Sep 28, 2021Updated 4 years ago
- Code repository for the paper on "Predicting the Performance of Black-Box LLMs through Self-Queries".☆12Jan 9, 2025Updated last year
- ☆10Nov 13, 2022Updated 3 years ago
- ☆15Aug 19, 2025Updated 6 months ago
- Demo of crawl 20 years lottery data and do EDA☆11May 6, 2021Updated 4 years ago
- Demo of predict and train YOLOv8 with custom data☆16Feb 1, 2023Updated 3 years ago
- A lightweight CLI coding agent focused on speed, determinism, and developer control☆48Jan 13, 2026Updated last month
- Demo of deploy YOLOv6 model as API☆10Aug 6, 2022Updated 3 years ago
- A FastAPI application that integrates with Telegram using webhooks and OpenAI Agents SDK for AI-powered stock trading assistance, utilizi…☆16May 11, 2025Updated 9 months ago
- Pioneering in Vietnamese Multimodal Large Language Model☆52Jan 23, 2025Updated last year
- A static deobfuscator for JavaScript Malware☆13May 6, 2020Updated 5 years ago
- Ollama Mistral with Langchain RAG Agent and Custom tools☆11Jul 6, 2024Updated last year
- Vast.ai python sdk☆21Updated this week
- A simple sample of OpenCV with Python☆12Oct 6, 2020Updated 5 years ago
- Streamline data pipelines for AI. Process datasets across 1000s of machines, and optimize data for blazing fast model training.☆16Sep 18, 2024Updated last year
- Hybrid-Anchor Rotation Detector for Oriented Object Detection (ICCV'25-SEA)☆16Aug 11, 2025Updated 6 months ago
- ☆12Oct 6, 2024Updated last year
- Demo of Instrusion Warning using Yolo and OpenCV☆11Jul 20, 2022Updated 3 years ago