Improving Neural Text Generation with Reinforcement Learning
☆23Jan 13, 2021Updated 5 years ago
Alternatives and similar repositories for implicit-unlikelihood-training
Users that are interested in implicit-unlikelihood-training are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for the paper "PALBERT: Teaching ALBERT to Ponder", NeurIPS 2022 Spotlight☆37Apr 8, 2023Updated 3 years ago
- ☆13Aug 7, 2021Updated 4 years ago
- 👀 VITRina: VIsual Token Representations☆11Jun 15, 2023Updated 2 years ago
- Code for SIGIR-2021 full paper: Initiative-Aware Self-Supervised Learning for Knowledge-Grounded Conversations☆11Aug 3, 2021Updated 4 years ago
- Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, O…☆12Jan 31, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆11Sep 7, 2020Updated 5 years ago
- Joint Extraction & Compression text Summarization☆40Nov 1, 2019Updated 6 years ago
- Reference implementation of algorithms for reinforcement learning and Markov decision processes.☆12Jan 28, 2021Updated 5 years ago
- This repository is the official implementation of our EMNLP 2022 paper ELMER: A Non-Autoregressive Pre-trained Language Model for Efficie…☆26Oct 27, 2022Updated 3 years ago
- ☆10Jan 5, 2018Updated 8 years ago
- Implementation of the retriever distillation procedure as outlined in the paper "Distilling Knowledge from Reader to Retriever"☆32Dec 16, 2020Updated 5 years ago
- Variable-order CRFs with structure learning☆17Aug 1, 2024Updated last year
- Code accompanying our papers on the "Generative Distributional Control" framework☆118Dec 7, 2022Updated 3 years ago
- This repository contains the code for our ICML 2025 paper——LENSLLM: Unveiling Fine-Tuning Dynamics for LLM Selection🎉☆26May 29, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official implementation for "Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size", NeurIPS 2022, Offline RL Worksho…☆21Feb 27, 2023Updated 3 years ago
- Pytorch Implementation of AAMAS 2021 paper <Energy-Based Imitation Learning>☆11Oct 8, 2021Updated 4 years ago
- ☆23Mar 31, 2023Updated 3 years ago
- [ICLR 2026] "VideoReasonBench: Can MLLMs Perform Vision-Centric Complex Video Reasoning?", Yuanxin Liu, Kun Ouyang, Haoning Wu, Yi Liu, L…☆37Jan 30, 2026Updated 2 months ago
- Implementation of "Learning Deep Generative Models"☆12Jun 4, 2019Updated 6 years ago
- Official implementation for LaCo (EMNLP 2024 Findings)☆21Oct 3, 2024Updated last year
- [NeurIPS 2025] Official implementation of "Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning"☆32Oct 20, 2025Updated 5 months ago
- My PhD thesis, titled "Reasonably Programmable Syntax"☆15Aug 28, 2018Updated 7 years ago
- Visualize neural networks using TikZ in Julia☆15Jan 29, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Collaborative inference of latent diffusion via hivemind☆12May 29, 2023Updated 2 years ago
- ☆10Feb 12, 2020Updated 6 years ago
- ☆27Oct 26, 2024Updated last year
- Yet Another PyTorch Tutorial☆12Jan 18, 2021Updated 5 years ago
- MLE-Guided Parameter Search (AAAI 2021)☆12Sep 16, 2021Updated 4 years ago
- Official Code for EMNLP2023 Main Conference paper: "KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination Detec…☆30Nov 14, 2023Updated 2 years ago
- Implementation/experiments for L4DC 2020 submission "Optimal Cost Design for Model Predictive Control"☆12Apr 23, 2021Updated 4 years ago
- ODQA Baseline 팀프로젝트 이슈/정보 저장용 레포입니다.☆12May 22, 2021Updated 4 years ago
- Code to reproduce experiments appearing in the academic paper Lost Relatives of the Gumbel Trick☆17Jun 14, 2017Updated 8 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- RuCLIP tiny (Russian Contrastive Language–Image Pretraining) is a neural network trained to work with different pairs (images, texts).☆34Jul 16, 2022Updated 3 years ago
- An implementation of the Hopfield Network using PyTorch, leveraging CUDA for linear algebra speedup☆14Nov 19, 2025Updated 4 months ago
- ☆11Dec 9, 2020Updated 5 years ago
- RuLeanALBERT is a pretrained masked language model for the Russian language that uses a memory-efficient architecture.☆93May 27, 2023Updated 2 years ago
- Deep Reinforcement Learning for Dialogue Generation using SEQ2SEQ model☆12Feb 23, 2021Updated 5 years ago
- learning to play atari games with reinforcement learning☆10Jan 4, 2016Updated 10 years ago
- ☆11Jul 5, 2020Updated 5 years ago