An experimental implementation of the retrieval-enhanced language model
☆75Dec 29, 2022Updated 3 years ago
Alternatives and similar repositories for mengzi-retrieval-lm
Users that are interested in mengzi-retrieval-lm are comparing it to the libraries listed below
Sorting:
- PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an…☆286Oct 20, 2022Updated 3 years ago
- SIGIR 2021: Proactive Retrieval-based Chatbots based on Relevant Knowledge and Goals☆11Jul 30, 2021Updated 4 years ago
- EasyRLHF aims to provide an easy and minimal interface to train aligned language models, using off-the-shelf solutions and datasets☆10Dec 12, 2023Updated 2 years ago
- Awesome Reinforcement Learning from Human Feedback, the secret behind ChatGPT XD☆23Dec 13, 2022Updated 3 years ago
- Korean Benchmark for Korean Legal Language Understanding☆17Nov 16, 2024Updated last year
- Implementation of VQ-VAE with a GPT-style sampler in the JAX and Haiku ecosystem.☆12Nov 23, 2023Updated 2 years ago
- final-project-level3-nlp-02 created by GitHub Classroom☆11Dec 31, 2021Updated 4 years ago
- [EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674☆195Jun 14, 2023Updated 2 years ago
- Serving large language model with transformers☆13Oct 18, 2022Updated 3 years ago
- ☆32Nov 18, 2025Updated 3 months ago
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.☆164Oct 4, 2023Updated 2 years ago
- VaLM: Visually-augmented Language Modeling. ICLR 2023.☆56Mar 6, 2023Updated 3 years ago
- Train large COMET (T5-3B/GPT2-XL) with small memory (on 11GB memory GPUs like 1080/2080) using DeepSpeed.☆14Jan 23, 2022Updated 4 years ago
- Russian dialog datasets parsers and crawlers.☆15Sep 6, 2021Updated 4 years ago
- PyTorch code for the RetoMaton paper: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022)☆76Jul 16, 2022Updated 3 years ago
- 매주 목요일, 20:00 모임☆16Jul 24, 2020Updated 5 years ago
- Embedding-based evaluation metrics for dialogue generation.☆15Jan 8, 2023Updated 3 years ago
- [EMNLP 2021] Efficient Contrastive Learning via Novel Data Augmentation and Curriculum Learning☆17Jun 28, 2025Updated 8 months ago
- Convenient Text-to-Text Training for Transformers☆19Dec 10, 2021Updated 4 years ago
- Official codebase accompanying our ACL 2022 paper "RELiC: Retrieving Evidence for Literary Claims" (https://relic.cs.umass.edu).☆20May 14, 2022Updated 3 years ago
- [ACL-IJCNLP 2021] "EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets" by Xiaohan Chen, Yu Cheng, Shuohang Wang, Zhe Gan, …☆18Dec 30, 2021Updated 4 years ago
- [ACL2023] Source code for Decouple knowledge from paramters for plug-and-play language modeling☆20Sep 18, 2023Updated 2 years ago
- Code associated with the paper **SkipBERT: Efficient Inference with Shallow Layer Skipping**, at ACL 2022☆16Jun 22, 2022Updated 3 years ago
- ☆17Dec 16, 2022Updated 3 years ago
- The official code of our paper at EMNLP 2022: Back to the Future: Bidirectional Information Decoupling Network for Multi-turn Dialogue Mo…☆16Feb 17, 2023Updated 3 years ago
- Seq2BF:based on paper《Sequence to Backward and Forward Sequences: A Content-Introducing Approach to Generative Short-Text Conversation》,C…☆17Nov 18, 2018Updated 7 years ago
- A scalable automated alignment method for large language models. Resources for "Aligning Large Language Models via Self-Steering Optimiza…☆20Nov 21, 2024Updated last year
- The implementation of the paper "Retrieval-Free Knowledge-Grounded Dialogue Response Generation with Adapters".☆17May 24, 2022Updated 3 years ago
- Submission to the inverse scaling prize☆23Jul 23, 2023Updated 2 years ago
- Generative Retrieval Transformer☆29Jul 23, 2023Updated 2 years ago
- ☆19Sep 20, 2022Updated 3 years ago
- Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF☆24Oct 8, 2024Updated last year
- Fine-grained Post-training for Improving Retrieval-based Dialogue Systems - NAACL 2021☆95Jul 8, 2021Updated 4 years ago
- Pretrain and finetune ELECTRA with fastai and huggingface. (Results of the paper replicated !)☆331Jan 10, 2024Updated 2 years ago
- ☆19Apr 1, 2022Updated 3 years ago
- Momentum Decoding: Open-ended Text Generation as Graph Exploration☆19Jan 27, 2023Updated 3 years ago
- "Zero-Training Sentence Embedding via Orthogonal Basis" paper implementation☆19Dec 23, 2018Updated 7 years ago
- ☆22Apr 12, 2022Updated 3 years ago
- This repo investigates LLMs' tendency to exhibit acquiescence bias in sequential QA interactions. Includes evaluation methods, datasets, …☆49Sep 23, 2025Updated 5 months ago