symato / physics_of_llmsLinks
Các thí nghiệm liên quan tới LLMs cho tiếng Việt (insprised by Physics of LLMs Series)
☆11Updated last year
Alternatives and similar repositories for physics_of_llms
Users that are interested in physics_of_llms are comparing it to the libraries listed below
Sorting:
- ☆73Updated last year
- The largest VQA dataset for Vietnamese. Related to the text content in the image.☆21Updated 8 months ago
- Pioneering in Vietnamese Multimodal Large Language Model☆51Updated 10 months ago
- VNHSGE: Vietnamese High School Graduation Examination Dataset for Large Language Models☆28Updated 2 years ago
- ☆16Updated 3 years ago
- ViDeBERTa: A powerful pre-trained language model for Vietnamese, EACL 2023☆58Updated 2 years ago
- Machine Reading Comprehension special for the Vietnamese language☆41Updated 3 years ago
- Ai cũng có thể tự tạo chatbot bằng huấn luyện chỉ dẫn, với 12G GPU (RTX 3060) và khoảng vài chục MB dữ liệu☆114Updated 2 years ago
- ViText2SQL: A dataset for Vietnamese Text-to-SQL semantic parsing (EMNLP-2020 Findings)☆36Updated last year
- Vistral-V: Visual Instruction Tuning for Vistral - Vietnamese Large Vision-Language Model.☆23Updated last year
- 👨🏻💻 Code release for Vietnamese chatbot from scratch [Published in IEEE IMCOM 2022]☆17Updated last year
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback☆97Updated 2 years ago
- Xây dựng tập dữ liệu 500GB (20% done) văn bản tiếng Việt để huấn luyện mô hình ngôn ngữ lớn☆29Updated 2 years ago
- Pre-training script for BART in JAX/Flax☆38Updated 3 years ago
- Code for NeurIPS LLM Efficiency Challenge☆59Updated last year
- A dataset for Vietnamese Spelling Correction☆15Updated 4 years ago
- PhoMT: A High-Quality and Large-Scale Benchmark Dataset for Vietnamese-English Machine Translation (EMNLP 2021)☆47Updated 6 months ago
- We finetune Bloomz-7b1-mt using LoRA with the chatdoctor-200k dataset at here https://huggingface.co/LinhDuong/doctorwithbloomz-7b1-mt an…☆30Updated 2 years ago
- BERT-based joint intent detection and slot filling with intent-slot attention mechanism (INTERSPEECH 2021)☆87Updated last year
- BARTpho: Pre-trained Sequence-to-Sequence Models for Vietnamese (INTERSPEECH 2022)☆104Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimization☆70Updated last year
- End-to-end Multi-task Solutions for Aspect Category Sentiment Analysis (ACSA) on Vietnamese reviews, using PhoBERT as pretrained model☆29Updated last year
- Baseline for ZaloAI Challenge 2023 Elementary Math Solving☆69Updated last year
- ☆14Updated 3 years ago
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆26Updated last year
- ☆72Updated 2 years ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆78Updated last year
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆66Updated last year
- wav2graph: A Framework for Supervised Learning Knowledge Graph from Speech☆94Updated 5 months ago
- ToRoLaMa: The Vietnamese Instruction-Following and Chat Model☆24Updated last year