EraX-JS-Company / LLaMA3.1-8B-DeepSeekR1-MLA-MoELinks
Convert LLaMA3.1-8B to DeepSeek R1 MLA & MoE (raw)
☆24Updated 10 months ago
Alternatives and similar repositories for LLaMA3.1-8B-DeepSeekR1-MLA-MoE
Users that are interested in LLaMA3.1-8B-DeepSeekR1-MLA-MoE are comparing it to the libraries listed below
Sorting:
- ☆68Updated last year
- ☆25Updated last year
- To simplify and streamline LLM operations, empowering developers and organizations to harness the full potential of large language models…☆131Updated last year
- Baseline achieving 0.8 accuracy on the private test set in the ZaloAI Challenge 2023 Elementary Math Solving☆24Updated last year
- Baseline for ZaloAI Challenge 2023 Elementary Math Solving☆68Updated 2 years ago
- Bud500: A Comprehensive Vietnamese ASR Dataset☆69Updated 3 months ago
- Pioneering in Vietnamese Multimodal Large Language Model☆51Updated last year
- ☆78Updated last year
- ☆76Updated 8 months ago
- A collection of Vietnamese Natural Language Processing resources.☆306Updated 3 months ago
- EraX-VL-7B-V1 is the multimodal large language model developed by EraX team, base on Qwen2-VL.☆12Updated last year
- RAG for Vietnamese Wikipedia corpus.☆35Updated 2 years ago
- Dự án bao gồm: 1. Xây dựng bộ dữ Instructions Vietnamese (chất lượng, nhiều, và đa dạng). 2.LLM Training, Finetuning, Evaluating & Testin…☆275Updated 5 months ago
- Ai c ũng có thể tự tạo chatbot bằng huấn luyện chỉ dẫn, với 12G GPU (RTX 3060) và khoảng vài chục MB dữ liệu☆114Updated 2 years ago
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆26Updated last year
- This project demonstrates a production-grade MLOps pipeline that deploys a YOLOv11-based face detection service on Google Kubernetes Engi…☆38Updated 7 months ago
- Comprehensive tools for building (Retrieval Augmented Generation) RAG chatbots.☆82Updated 11 months ago
- ☆67Updated last year
- wav2graph: A Framework for Supervised Learning Knowledge Graph from Speech☆95Updated 6 months ago
- ☆107Updated 2 years ago
- A Vietnamese-English Neural Machine Translation System (INTERSPEECH 2022)☆136Updated last year
- RAG Best Practice on Vietnamese☆256Updated 3 months ago
- Vistral-V: Visual Instruction Tuning for Vistral - Vietnamese Large Vision-Language Model.☆23Updated last year
- ☆55Updated 10 months ago
- ☆11Updated 2 years ago
- NTTU Chatbot - A student support chatbot using LLM + Document Retriever (RAG) in Vietnamese☆109Updated 8 months ago
- A project improves the quality and accuracy of the Vietnamese language.☆52Updated 7 months ago
- [LREC-COLING 2024 (Oral), Interspeech 2024 (Oral), NAACL 2025, ACL 2025, EMNLP 2025] A Series of Multilingual Multitask Medical Speech Pr…☆372Updated last month
- ☆32Updated 2 years ago
- Built and deployed scalable LLM retrieval APIs on a hybrid GCP architecture with full CI/CD, IaC, and monitoring☆70Updated 5 months ago