linhduongtuan / doctorwithbloom
We finetune Bloomz-7b1-mt using LoRA with the chatdoctor-200k dataset at here https://huggingface.co/LinhDuong/doctorwithbloomz-7b1-mt and https://huggingface.co/LinhDuong/doctorwithbloomz-7b1.
☆30Updated last year
Related projects ⓘ
Alternatives and complementary repositories for doctorwithbloom
- Machine Reading Comprehension special for the Vietnamese language☆38Updated 2 years ago
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback☆91Updated last year
- VNHSGE: Vietnamese High School Graduation Examination Dataset for Large Language Models☆25Updated last year
- ICONIP2021 - A Vietnamese Medical Dataset for IC and NER☆15Updated last year
- Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigs…☆183Updated last year
- ☆21Updated 8 months ago
- [ACL 2024 Demo] SeaLLMs - Large Language Models for Southeast Asia☆149Updated 3 months ago
- Xây dựng tập dữ liệu 500GB (20% done) văn bản tiếng Việt để huấn luyện mô hình ngôn ngữ lớn☆25Updated last year
- ☆17Updated 2 years ago
- ToRoLaMa: The Vietnamese Instruction-Following and Chat Model☆23Updated 10 months ago
- Sentence Embeddings with BERT & XLNet☆27Updated 4 years ago
- 👨🏻💻 Code release for Vietnamese chatbot from scratch [Published in IEEE IMCOM 2022]☆17Updated 3 months ago
- Vistral-V: Visual Instruction Tuning for Vistral - Vietnamese Large Vision-Language Model.☆19Updated 4 months ago
- ☆59Updated 6 months ago
- Vietnamese long form question answering system with documents retrieval.☆19Updated 7 months ago
- ViText2SQL: A dataset for Vietnamese Text-to-SQL semantic parsing (EMNLP-2020 Findings)☆28Updated 4 months ago
- A dataset for Vietnamese Spelling Correction☆15Updated 3 years ago
- BERT-based joint intent detection and slot filling with intent-slot attention mechanism (INTERSPEECH 2021)☆84Updated 4 months ago
- Pre-training script for BART in JAX/Flax☆37Updated 2 years ago
- RecGPT: Generative Pre-training for Text-based Recommendation (ACL 2024)☆30Updated 2 months ago
- ViDeBERTa: A powerful pre-trained language model for Vietnamese, EACL 2023☆54Updated last year
- Fast whitespace correction with Transformers☆14Updated 6 months ago
- MISCA: A Joint Model for Multiple Intent Detection and Slot Filling with Intent-Slot Co-Attention (EMNLP 2023 - Findings)☆23Updated 4 months ago
- We identify the desiderata for a comprehensive benchmark and propose Visually Rich Document Understanding (VRDU). VRDU contains two datas…☆74Updated last year
- Ai cũng có thể tự tạo chatbot bằng huấn luyện chỉ dẫn, với 12G GPU (RTX 3060) và khoảng vài chục MB dữ liệu☆111Updated last year
- Use LoRA technique to improve training Large Language Model☆12Updated last year
- The search for the best Conversational AI pipeline☆14Updated 4 years ago
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning☆86Updated last year
- Translate large dataset to any language with google translation api and multithreads processing, no key required!☆56Updated last month
- A Robustly Optimized BERT Pretraining Approach for Vietnamese☆31Updated 3 months ago