zake7749 / Kyara
a lightweight-yet-efficient Chinese LLM.
☆11Updated last week
Related projects ⓘ
Alternatives and complementary repositories for Kyara
- ROUGE score calculator with traditional chinese word segmentation☆9Updated 3 years ago
- A publishing website of a table collecting meta-learning-related papers in the area of human language processing.☆17Updated 3 years ago
- ☆14Updated 2 years ago
- ☆17Updated 2 years ago
- [NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective☆29Updated last year
- Code for COLING22 paper, DPTDR: Deep Prompt Tuning for Dense Passage Retrieval☆24Updated last year
- This repository contains the joint use of CPO and SimPO method for better reference-free preference learning methods.☆35Updated 3 months ago
- ☆10Updated 2 years ago
- Transitioning from Open-Domain Chit-Chat to Task-Oriented Dialogues☆41Updated 2 years ago
- This repository is the official implementation of our EMNLP 2022 paper ELMER: A Non-Autoregressive Pre-trained Language Model for Efficie…☆26Updated 2 years ago
- ☆17Updated 6 months ago
- Code for "A BERT-based Distractor Generation Scheme with Multi-tasking and Negative Answer Training Strategies."☆28Updated 2 years ago
- Calculate the probability of a paper being accepted by EMNLP2023 based on score distribution of ACL2023.☆14Updated last year
- Taiwanese Translation with BERT based model and RNN. Collection of Taiwanese text corpus☆11Updated 2 years ago
- ☆17Updated last month
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆73Updated 9 months ago
- ☆29Updated last year
- The code of paper "Learning to Break the Loop: Analyzing and Mitigating Repetitions for Neural Text Generation" published at NeurIPS 202…☆40Updated 2 years ago
- ☆10Updated 10 months ago
- Towards Systematic Measurement for Long Text Quality☆28Updated 2 months ago
- [ICLR 2023] PyTorch code of Summarization Programs: Interpretable Abstractive Summarization with Neural Modular Trees☆23Updated last year
- 🍳 NLPrep - dataset tool for many natural language processing task☆28Updated 3 years ago
- mSimCSE: Multilingual SimCSE☆33Updated last year
- ☆14Updated 8 months ago
- ☆14Updated 5 months ago
- Long Context Extension and Generalization in LLMs☆39Updated last month
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆56Updated last year
- [EMNLP'23] Code for "Non-autoregressive Text Editing with Copy-aware Latent Alignments".☆19Updated last year
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆70Updated 8 months ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆44Updated 11 months ago