aqweteddy / ChatVector
Official code for the ACL 2024 paper: Chat Vector: A Simple Approach to Equip LLMs with Instruction Following and Model Alignment in New Languages.
☆50Updated 11 months ago
Alternatives and similar repositories for ChatVector
Users that are interested in ChatVector are comparing it to the libraries listed below
Sorting:
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision☆87Updated 6 months ago
- ☆75Updated 4 months ago
- Unofficial re-implementation of "Trusting Your Evidence: Hallucinate Less with Context-aware Decoding"☆30Updated 5 months ago
- [NeurIPS 2024] How do Large Language Models Handle Multilingualism?☆34Updated 6 months ago
- ☆17Updated last year
- code for EMNLP 2024 paper: Interpreting Arithmetic Mechanism in Large Language Models through Comparative Neuron Analysis☆11Updated 6 months ago
- LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation☆23Updated 3 weeks ago
- Unofficial implementation of AlpaGasus☆91Updated last year
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆109Updated last year
- ☆12Updated 9 months ago
- Multilingual Large Language Models Evaluation Benchmark☆123Updated 8 months ago
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning☆89Updated last year
- [𝐄𝐌𝐍𝐋𝐏 𝐅𝐢𝐧𝐝𝐢𝐧𝐠𝐬 𝟐𝟎𝟐𝟒 & 𝐀𝐂𝐋 𝟐𝟎𝟐𝟒 𝐍𝐋𝐑𝐒𝐄 𝐎𝐫𝐚𝐥] 𝘌𝘯𝘩𝘢𝘯𝘤𝘪𝘯𝘨 𝘔𝘢𝘵𝘩𝘦𝘮𝘢𝘵𝘪𝘤𝘢𝘭 𝘙𝘦𝘢𝘴𝘰𝘯𝘪𝘯…☆50Updated last year
- evolve llm training instruction, from english instruction to any language.☆117Updated last year
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages☆46Updated 5 months ago
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.☆12Updated last year
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆151Updated 8 months ago
- Code for Zero-Shot Tokenizer Transfer☆127Updated 4 months ago
- Evaluation of the Cross-Lingual Knowledge Alignment in LLMs☆9Updated last year
- Model merging is a highly efficient approach for long-to-short reasoning.☆46Updated last month
- Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)☆214Updated 2 months ago
- Awesome LLM for NLG Evaluation Papers☆24Updated last year
- [ICLR 2025] 🧬 RegMix: Data Mixture as Regression for Language Model Pre-training (Spotlight)☆135Updated 3 months ago
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆74Updated 11 months ago
- Organize the Web: Constructing Domains Enhances Pre-Training Data Curation☆47Updated 2 weeks ago
- Code for ACL 2024 paper "Soft Self-Consistency Improves Language Model Agents"☆19Updated 8 months ago
- ☆50Updated last year
- OpenBA-V2: 3B LLM (Large Language Model) with T5 architecture, utilizing model pruning technique and continuing pretraining from OpenBA-1…☆25Updated last year
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…☆111Updated 10 months ago
- The attention map viewer for LLaMA models.☆34Updated last year