DAMO-NLP-SG / SeaLLMs
[ACL 2024 Demo] SeaLLMs - Large Language Models for Southeast Asia
☆140Updated last month
Related projects: ⓘ
- Distillation Contrastive Decoding: Improving LLMs Reasoning with Contrastive Decoding and Distillation☆29Updated 6 months ago
- ⚓️ Sailor: Open Language Models for South-East Asia☆98Updated 2 weeks ago
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback☆89Updated last year
- South-East Asia Large Language Models☆242Updated last week
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆118Updated last week
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning☆80Updated last year
- Multilingual Large Language Models Evaluation Benchmark☆91Updated 3 weeks ago
- VNHSGE: Vietnamese High School Graduation Examination Dataset for Large Language Models☆22Updated last year
- Data and code for paper "M3Exam: A Multilingual, Multimodal, Multilevel Benchmark for Examining Large Language Models"☆91Updated last year
- Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigs…☆183Updated last year
- Implementation of paper Data Engineering for Scaling Language Models to 128K Context☆416Updated 6 months ago
- Reformatted Alignment☆111Updated 4 months ago
- ☆57Updated 4 months ago
- [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement☆143Updated 5 months ago
- [ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"☆333Updated last month
- Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper☆119Updated 2 months ago
- ☆185Updated last month
- [NAACL'24] Dataset, code and models for "TableLlama: Towards Open Large Generalist Models for Tables".☆104Updated 4 months ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆101Updated 2 weeks ago
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"☆102Updated 2 months ago
- Official repo for "Make Your LLM Fully Utilize the Context"☆239Updated 4 months ago
- Official implementation for the paper "LongEmbed: Extending Embedding Models for Long Context Retrieval"☆108Updated 4 months ago
- We finetune Bloomz-7b1-mt using LoRA with the chatdoctor-200k dataset at here https://huggingface.co/LinhDuong/doctorwithbloomz-7b1-mt an…☆30Updated last year
- RecGPT: Generative Pre-training for Text-based Recommendation (ACL 2024)☆25Updated last month
- The official evaluation suite and dynamic data release for MixEval.☆200Updated last week
- Expert Specialized Fine-Tuning☆129Updated last month
- [arxiv:2406.17419]Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA☆62Updated last month
- 🚢 Data Toolkit for Sailor Language Models☆74Updated 2 months ago
- [ICML 2024] Selecting High-Quality Data for Training Language Models☆134Updated 3 months ago
- ☆12Updated 9 months ago