mettamind-ai / physics_of_llmsLinks
Các thí nghiệm liên quan tới LLMs cho tiếng Việt (insprised by Physics of LLMs Series)
☆11Updated last year
Alternatives and similar repositories for physics_of_llms
Users that are interested in physics_of_llms are comparing it to the libraries listed below
Sorting:
- ☆78Updated last year
- ☆16Updated 3 years ago
- ViDeBERTa: A powerful pre-trained language model for Vietnamese, EACL 2023☆58Updated 2 years ago
- The largest VQA dataset for Vietnamese. Related to the text content in the image.☆19Updated 9 months ago
- Machine Reading Comprehension special for the Vietnamese language☆41Updated 3 years ago
- Pioneering in Vietnamese Multimodal Large Language Model☆51Updated last year
- Xây dựng tập dữ liệu 500GB (20% done) văn bản tiếng Việt để huấn luyện mô hình ngôn ngữ lớn☆29Updated 2 years ago
- VNHSGE: Vietnamese High School Graduation Examination Dataset for Large Language Models☆29Updated 2 years ago
- Ai cũng có thể tự tạo chatbot bằng huấn luyện chỉ dẫn, với 12G GPU (RTX 3060) và khoảng vài chục MB dữ liệu☆114Updated 2 years ago
- 👨🏻💻 Code release for Vietnamese chatbot from scratch [Published in IEEE IMCOM 2022]☆17Updated last year
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback☆96Updated 2 years ago
- BERT-based joint intent detection and slot filling with intent-slot attention mechanism (INTERSPEECH 2021)☆87Updated last year
- Vistral-V: Visual Instruction Tuning for Vistral - Vietnamese Large Vision-Language Model.☆23Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimization☆72Updated last year
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆77Updated last year
- 🌏 Modular retrievers for zero-shot multilingual IR.☆30Updated last year
- Fast whitespace correction with Transformers☆17Updated 5 months ago
- Code for NeurIPS LLM Efficiency Challenge☆60Updated last year
- A dataset for Vietnamese Spelling Correction☆15Updated 4 years ago
- BARTpho: Pre-trained Sequence-to-Sequence Models for Vietnamese (INTERSPEECH 2022)☆104Updated last year
- ViText2SQL: A dataset for Vietnamese Text-to-SQL semantic parsing (EMNLP-2020 Findings)☆36Updated last year
- Baseline for ZaloAI Challenge 2023 Elementary Math Solving☆68Updated 2 years ago
- Pre-training script for BART in JAX/Flax☆38Updated 3 years ago
- PhoMT: A High-Quality and Large-Scale Benchmark Dataset for Vietnamese-English Machine Translation (EMNLP 2021)☆48Updated 7 months ago
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆26Updated last year
- ChatGPT solutions for the MLE interview☆14Updated 3 years ago
- A fine-tuned Large Language Model (LLM) for the Vietnamese language based on the Llama 2 model.☆17Updated 2 years ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆49Updated 2 years ago
- ☆75Updated 2 years ago
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆34Updated 3 months ago