symato / physics_of_llmsLinks

Các thí nghiệm liên quan tới LLMs cho tiếng Việt (insprised by Physics of LLMs Series)

☆11

Alternatives and similar repositories for physics_of_llms

Users that are interested in physics_of_llms are comparing it to the libraries listed below

Sorting:

ZaloAI-Jaist / VMLU
☆73Updated last year
qhnhynmm / ViOCRVQA-Dataset
The largest VQA dataset for Vietnamese. Related to the text content in the image.
☆21Updated 8 months ago
baochi0212 / LaVy
Pioneering in Vietnamese Multimodal Large Language Model
☆51Updated 10 months ago
Xdao85 / VNHSGE
VNHSGE: Vietnamese High School Graduation Examination Dataset for Large Language Models
☆28Updated 2 years ago
bino282 / ViNLP
☆16Updated 3 years ago
HySonLab / ViDeBERTa
ViDeBERTa: A powerful pre-trained language model for Vietnamese, EACL 2023
☆58Updated 2 years ago
nguyenvulebinh / extractive-qa-mrc
Machine Reading Comprehension special for the Vietnamese language
☆41Updated 3 years ago
telexyz / GPT4VN
Ai cũng có thể tự tạo chatbot bằng huấn luyện chỉ dẫn, với 12G GPU (RTX 3060) và khoảng vài chục MB dữ liệu
☆114Updated 2 years ago
VinAIResearch / ViText2SQL
ViText2SQL: A dataset for Vietnamese Text-to-SQL semantic parsing (EMNLP-2020 Findings)
☆36Updated last year
hllj / Vistral-V
Vistral-V: Visual Instruction Tuning for Vistral - Vietnamese Large Vision-Language Model.
☆23Updated last year
taindp98 / Vietnamese-Chatbot
👨🏻‍💻 Code release for Vietnamese chatbot from scratch [Published in IEEE IMCOM 2022]
☆17Updated last year
nlp-uoregon / Okapi
Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback
☆97Updated 2 years ago
telexyz / vi
Xây dựng tập dữ liệu 500GB (20% done) văn bản tiếng Việt để huấn luyện mô hình ngôn ngữ lớn
☆29Updated 2 years ago
duongna21 / bartflax
Pre-training script for BART in JAX/Flax
☆38Updated 3 years ago
Upaya07 / NeurIPS-llm-efficiency-challenge
Code for NeurIPS LLM Efficiency Challenge
☆59Updated last year
heraclex12 / Viwiki-spelling
A dataset for Vietnamese Spelling Correction
☆15Updated 4 years ago
VinAIResearch / PhoMT
PhoMT: A High-Quality and Large-Scale Benchmark Dataset for Vietnamese-English Machine Translation (EMNLP 2021)
☆47Updated 6 months ago
linhduongtuan / doctorwithbloom
We finetune Bloomz-7b1-mt using LoRA with the chatdoctor-200k dataset at here https://huggingface.co/LinhDuong/doctorwithbloomz-7b1-mt an…
☆30Updated 2 years ago
VinAIResearch / JointIDSF
BERT-based joint intent detection and slot filling with intent-slot attention mechanism (INTERSPEECH 2021)
☆87Updated last year
VinAIResearch / BARTpho
BARTpho: Pre-trained Sequence-to-Sequence Models for Vietnamese (INTERSPEECH 2022)
☆104Updated last year
davanstrien / haiku-dpo
Using open source LLMs to build synthetic datasets for direct preference optimization
☆70Updated last year
ds4v / absa-vlsp-2018
End-to-end Multi-task Solutions for Aspect Category Sentiment Analysis (ACSA) on Vietnamese reviews, using PhoBERT as pretrained model
☆29Updated last year
Reasoning-Lab / Elementary-Math-Solving-Zalo-AI-2023
Baseline for ZaloAI Challenge 2023 Elementary Math Solving
☆69Updated last year
kh4nh12 / ViVQA
☆14Updated 3 years ago
Oztobuzz / Vista
This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…
☆26Updated last year
vietai / ViT5
☆72Updated 2 years ago
daniel-furman / sft-demos
Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.
☆78Updated last year
trapoom555 / Language-Model-STS-CFT
Improving Text Embedding of Language Models Using Contrastive Fine-tuning
☆66Updated last year
leduckhai / wav2graph
wav2graph: A Framework for Supervised Learning Knowledge Graph from Speech
☆94Updated 5 months ago
allbyai / ToRoLaMa
ToRoLaMa: The Vietnamese Instruction-Following and Chat Model
☆24Updated last year