linhduongtuan / doctorwithbloomLinks
We finetune Bloomz-7b1-mt using LoRA with the chatdoctor-200k dataset at here https://huggingface.co/LinhDuong/doctorwithbloomz-7b1-mt and https://huggingface.co/LinhDuong/doctorwithbloomz-7b1.
☆30Updated 2 years ago
Alternatives and similar repositories for doctorwithbloom
Users that are interested in doctorwithbloom are comparing it to the libraries listed below
Sorting:
- Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigs…☆184Updated 2 years ago
- Machine Reading Comprehension special for the Vietnamese language☆42Updated 3 years ago
- ViDeBERTa: A powerful pre-trained language model for Vietnamese, EACL 2023☆56Updated last year
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback☆97Updated last year
- BERT-based joint intent detection and slot filling with intent-slot attention mechanism (INTERSPEECH 2021)☆87Updated last year
- 👨🏻💻 Code release for Vietnamese chatbot from scratch [Published in IEEE IMCOM 2022]☆17Updated last year
- A multi-purpose toolkit for table-to-text generation: web interface, Python bindings, CLI commands.☆56Updated last year
- ☆71Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆33Updated 3 months ago
- Pre-training script for BART in JAX/Flax☆38Updated 3 years ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆78Updated 9 months ago
- Ai cũng có thể tự tạo chatbot bằng huấn luyện chỉ dẫn, với 12G GPU (RTX 3060) và khoảng vài chục MB dữ liệu☆113Updated 2 years ago
- VNHSGE: Vietnamese High School Graduation Examination Dataset for Large Language Models☆27Updated 2 years ago
- [ACL 2024 Demo] SeaLLMs - Large Language Models for Southeast Asia☆170Updated last year
- ToRoLaMa: The Vietnamese Instruction-Following and Chat Model☆23Updated last year
- A Streamlit app running GPT-2 language model for text classification, built with Pytorch, Transformers and AWS SageMaker.☆39Updated 3 years ago
- Xây dựng tập dữ liệu 500GB (20% done) văn bản tiếng Việt để huấn luyện mô hình ngôn ngữ lớn☆28Updated 2 years ago
- ☆30Updated 2 years ago
- ICONIP2021 - A Vietnamese Medical Dataset for IC and NER☆20Updated 2 years ago
- ☆16Updated 2 years ago
- Define Transformers, T5 model and RoBERTa Encoder decoder model for product names generation☆48Updated 3 years ago
- Showcasing various NLP Downstream tasks Training with pre-trained Language models using Pytorch Lightning☆13Updated 3 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆93Updated 2 years ago
- We identify the desiderata for a comprehensive benchmark and propose Visually Rich Document Understanding (VRDU). VRDU contains two datas…☆80Updated 2 years ago
- 📚 Text Classification with LoRA (Low-Rank Adaptation) of Language Models - Efficiently fine-tune large language models for text classifi…☆52Updated last year
- A dataset for Vietnamese Spelling Correction☆15Updated 3 years ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆75Updated 9 months ago
- Tools for managing datasets for governance and training.☆85Updated 2 months ago
- LLM_library is a comprehensive repository serves as a one-stop resource hands-on code, insightful summaries.☆69Updated last year
- A extension of Transformers library to include T5ForSequenceClassification class.☆39Updated 2 years ago