aisingapore / sealion
South-East Asia Large Language Models
☆300Updated last week
Alternatives and similar repositories for sealion:
Users that are interested in sealion are comparing it to the libraries listed below
- [ACL 2024 Demo] SeaLLMs - Large Language Models for Southeast Asia☆167Updated 8 months ago
- A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.☆81Updated 3 months ago
- VNHSGE: Vietnamese High School Graduation Examination Dataset for Large Language Models☆25Updated last year
- Repo for the Belebele dataset, a massively multilingual reading comprehension dataset.☆327Updated 4 months ago
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback☆94Updated last year
- ☆634Updated 4 months ago
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆275Updated last month
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆213Updated 5 months ago
- Solutions provided to Chip Huyen's Machine Learning Interview Book with GPT☆40Updated last year
- Distillation Contrastive Decoding: Improving LLMs Reasoning with Contrastive Decoding and Distillation☆35Updated last year
- ☆515Updated 5 months ago
- Best practices for distilling large language models.☆526Updated last year
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆231Updated 5 months ago
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning☆90Updated last year
- ☆40Updated 11 months ago
- Sample notebooks and prompts for LLM evaluation☆124Updated last week
- Let's build better datasets, together!☆259Updated 4 months ago
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.☆124Updated last year
- Translate large dataset to any language with google translation api and multithreads processing, no key required!☆67Updated 6 months ago
- Official repository for ORPO☆448Updated 10 months ago
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day☆255Updated last year
- [EMNLP-2024] ⚓️ Sailor: Open Language Models for South-East Asia☆129Updated 4 months ago
- Multilingual Large Language Models Evaluation Benchmark☆123Updated 8 months ago
- Automatically evaluate your LLMs in Google Colab☆616Updated 11 months ago
- Evaluate your LLM's response with Prometheus and GPT4 💯☆911Updated last month
- Banishing LLM Hallucinations Requires Rethinking Generalization☆273Updated 9 months ago
- Stanford NLP Python library for Representation Finetuning (ReFT)☆1,462Updated 2 months ago
- Implementation of ConGen: Unsupervised Control and Generalization Distillation For Sentence Representation (Finding of EMNLP 2022).☆22Updated last year
- An open collection of methodologies to help with successful training of large language models.☆486Updated last year
- A set of scripts and notebooks on LLM finetunning and dataset creation☆106Updated 7 months ago