Improving Neural Text Generation with Reinforcement Learning
☆23Jan 13, 2021Updated 5 years ago
Alternatives and similar repositories for implicit-unlikelihood-training
Users that are interested in implicit-unlikelihood-training are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆50Nov 10, 2021Updated 4 years ago
- Code for SIGIR-2021 full paper: Initiative-Aware Self-Supervised Learning for Knowledge-Grounded Conversations☆11Aug 3, 2021Updated 4 years ago
- Примеры пропозалов для подачи заявки в Open.TLab☆27Dec 15, 2022Updated 3 years ago
- ☆42Mar 8, 2021Updated 5 years ago
- Russian dialog datasets parsers and crawlers.☆15Sep 6, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆11Sep 7, 2020Updated 5 years ago
- Reference implementation of algorithms for reinforcement learning and Markov decision processes.☆12Jan 28, 2021Updated 5 years ago
- Distributed & asynchronous DQN implementation using gRPC and PyTorch.☆10Feb 15, 2021Updated 5 years ago
- This repository is the official implementation of our EMNLP 2022 paper ELMER: A Non-Autoregressive Pre-trained Language Model for Efficie…☆26Oct 27, 2022Updated 3 years ago
- A small library with distillation, quantization and pruning pipelines☆26Apr 20, 2021Updated 5 years ago
- ☆10Jan 5, 2018Updated 8 years ago
- Implementation of the retriever distillation procedure as outlined in the paper "Distilling Knowledge from Reader to Retriever"☆32Dec 16, 2020Updated 5 years ago
- Variable-order CRFs with structure learning☆17Aug 1, 2024Updated last year
- Code accompanying our papers on the "Generative Distributional Control" framework☆118Dec 7, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official implementation for "Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size", NeurIPS 2022, Offline RL Worksho…☆21Feb 27, 2023Updated 3 years ago
- Re-implementation of Progressive Neural Networks with PyTorch☆15Jul 25, 2024Updated last year
- The source code of our ACL paper "A Training-free and Reference-free Summarization Evaluation Metric via Centrality-weighted Relevance an…☆14May 6, 2023Updated 3 years ago
- PyTorch implementation of R2D2 (Recurrent Reply Distributed DQN)☆13Nov 14, 2019Updated 6 years ago
- ☆13Jul 5, 2021Updated 4 years ago
- [ICLR 2026] "VideoReasonBench: Can MLLMs Perform Vision-Centric Complex Video Reasoning?", Yuanxin Liu, Kun Ouyang, Haoning Wu, Yi Liu, L…☆39Jan 30, 2026Updated 4 months ago
- Implementation of "Learning Deep Generative Models"☆12Jun 4, 2019Updated 7 years ago
- Pytorch codebase for Capturing label characteristics in VAEs☆13May 1, 2021Updated 5 years ago
- Official Implementation for the ICLR2023 paper "Fuzzy Alignments in Directed Acyclic Graph for Non-autoregressive Machine Translation"☆14Mar 1, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This is the code repo of our Pattern Recognition journal on IPR protection of Image Captioning Models☆11Aug 29, 2023Updated 2 years ago
- My PhD thesis, titled "Reasonably Programmable Syntax"☆15Aug 28, 2018Updated 7 years ago
- Collaborative inference of latent diffusion via hivemind☆12May 29, 2023Updated 3 years ago
- ☆10Feb 12, 2020Updated 6 years ago
- Korean Parallel Corpus☆11Nov 27, 2014Updated 11 years ago
- Implementation/experiments for L4DC 2020 submission "Optimal Cost Design for Model Predictive Control"☆12Apr 23, 2021Updated 5 years ago
- Code to reproduce experiments appearing in the academic paper Lost Relatives of the Gumbel Trick☆17Jun 14, 2017Updated 9 years ago
- An implementation of the Hopfield Network using PyTorch, leveraging CUDA for linear algebra speedup☆15Nov 19, 2025Updated 6 months ago
- [ICLR 2025] Implementation of "Node Identifiers: Compact, Discrete Representations for Efficient Graph Learning"☆17Jun 6, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆11Dec 9, 2020Updated 5 years ago
- Deep Reinforcement Learning for Dialogue Generation using SEQ2SEQ model☆11Feb 23, 2021Updated 5 years ago
- A question generator described in paper "Exploring Model and Data for Image Question Answering"☆23Nov 21, 2015Updated 10 years ago
- learning to play atari games with reinforcement learning☆10Jan 4, 2016Updated 10 years ago
- [WIP] A tool for C++ code modification to augment data for clone detection tools☆10Jan 13, 2026Updated 5 months ago
- ☆11Jul 5, 2020Updated 5 years ago
- code for kdd feasibiiity☆12Jul 17, 2023Updated 2 years ago