DAMO-NLP-SG / DAMO-SeaLLMs
[ACL 2024 Demo] SeaLLMs - Large Language Models for Southeast Asia
☆163Updated 7 months ago
Alternatives and similar repositories for DAMO-SeaLLMs:
Users that are interested in DAMO-SeaLLMs are comparing it to the libraries listed below
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback☆94Updated last year
- [EMNLP-2024] ⚓️ Sailor: Open Language Models for South-East Asia☆127Updated 3 months ago
- ☆66Updated 10 months ago
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning☆89Updated last year
- RecGPT: Generative Pre-training for Text-based Recommendation (ACL 2024)☆30Updated 6 months ago
- 🚢 Data Toolkit for Sailor Language Models☆87Updated last month
- VNHSGE: Vietnamese High School Graduation Examination Dataset for Large Language Models☆25Updated last year
- South-East Asia Large Language Models☆297Updated this week
- Multilingual Large Language Models Evaluation Benchmark☆118Updated 7 months ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆207Updated 4 months ago
- Distillation Contrastive Decoding: Improving LLMs Reasoning with Contrastive Decoding and Distillation☆34Updated last year
- Data and code for paper "M3Exam: A Multilingual, Multimodal, Multilevel Benchmark for Examining Large Language Models"☆98Updated last year
- Who's Who: Large Language Models Meet Knowledge Conflicts in Practice (EMNLP 2024 Findings)☆10Updated last month
- The official evaluation suite and dynamic data release for MixEval.☆233Updated 4 months ago
- Unofficial implementation of AlpaGasus☆90Updated last year
- [ICML 2024] Selecting High-Quality Data for Training Language Models☆158Updated 9 months ago
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆131Updated 4 months ago
- ☆142Updated 11 months ago
- Code for ACL2023 paper: Pre-Training to Learn in Context☆108Updated 8 months ago
- Implementation of paper Data Engineering for Scaling Language Models to 128K Context☆454Updated last year
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆147Updated 6 months ago
- Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper☆129Updated 8 months ago
- Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigs…☆184Updated last year
- ☆99Updated 3 months ago
- A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.☆78Updated 2 months ago
- 🔱 Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs☆54Updated this week
- This repository contains the joint use of CPO and SimPO method for better reference-free preference learning methods.☆51Updated 7 months ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆73Updated 5 months ago
- Evaluating LLMs' multi-round chatting capability via assessing conversations generated by two LLM instances.☆148Updated last year
- [ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning☆349Updated 6 months ago