reinforcement learning on a encoder-decoder GRU for chatbot dialogue generation
☆20Jun 1, 2018Updated 8 years ago
Alternatives and similar repositories for RL-Chat-pytorch
Users that are interested in RL-Chat-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repo for SPOLIN corpus and paper "Grounding Conversations with Improvised Dialogues" (ACL2020)☆14Feb 20, 2026Updated 3 months ago
- Notes on Deep Reinforcement Learning for Natural Language Processing papers☆30Jul 17, 2017Updated 8 years ago
- Training chatbot models with reinforcement learning in ParlAI.☆17Dec 8, 2022Updated 3 years ago
- ☆24Nov 7, 2024Updated last year
- Sequential planner for large text based environments☆12Dec 13, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆10Jan 5, 2018Updated 8 years ago
- Implementing Reinforcement Learning to find the best dialogue strategy for a conversation agent (chatbot) by searching for maximum award.☆13Jun 2, 2017Updated 9 years ago
- Code for "Task-Agnostic Continual RL: In Praise of a Simple Baseline"☆37Jun 8, 2023Updated 3 years ago
- [EMNLP 2022] Distillation-Resistant Watermarking (DRW) for Model Protection in NLP☆13Aug 17, 2023Updated 2 years ago
- Deep Learning Experiments Motivated from Fastai Course☆14Jan 2, 2019Updated 7 years ago
- Pytorch codebase for Capturing label characteristics in VAEs☆13May 1, 2021Updated 5 years ago
- This repository contains personal study notes on mathematical statistics. 수리 통계학에 대한 개인적인 공부 기록 용도의 레포지토리입니다.☆12May 2, 2023Updated 3 years ago
- (Silver medal - 60th place - Top 3%) Repository for the "Tweet Sentiment Extraction" Kaggle competition.☆11Jun 18, 2020Updated 5 years ago
- Simulating Realistic Human Scanpaths in Dynamic Real-World Scenes☆15Mar 3, 2026Updated 3 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆13Oct 11, 2024Updated last year
- In-BoXBART: Get Instructions into Biomedical Multi-task Learning☆15Aug 23, 2022Updated 3 years ago
- Yet Another PyTorch Tutorial☆12Jan 18, 2021Updated 5 years ago
- RLHF-Blender: A Configurable Interactive Interface for Learning from Diverse Human Feedback☆14May 19, 2026Updated 3 weeks ago
- ODQA Baseline 팀프로젝트 이슈/정보 저장용 레포입니다.☆12May 22, 2021Updated 5 years ago
- Lightweight ngram random text generator☆12Jul 11, 2014Updated 11 years ago
- An implementation of the Hopfield Network using PyTorch, leveraging CUDA for linear algebra speedup☆15Nov 19, 2025Updated 6 months ago
- Hierarchical Hidden Markov Model☆12Dec 11, 2022Updated 3 years ago
- Hierarchical Framework for Interpretable Deep Reinforcement Learning Based- Predictive Maintenance (Applied to NASA Turbofan engine datas…☆14Feb 9, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Similarity Encoder (SimEc) Neural Network Framework for learning low dimensional similarity preserving representations☆17Jun 28, 2020Updated 5 years ago
- Energy Based Models are a quite novel technique for density estimation. In this university project I explore this new research topic and …☆16Jul 6, 2021Updated 4 years ago
- ☆16Apr 28, 2023Updated 3 years ago
- qgpt-issue-31☆11Oct 31, 2024Updated last year
- training BART from scratch☆12Dec 31, 2021Updated 4 years ago
- Data and models for Misinfo Reaction Frames paper.☆14Jun 9, 2024Updated 2 years ago
- A Python tool for parsing and analyzing electron density maps data available from the worldwide Protein Data Bank☆12Sep 28, 2023Updated 2 years ago
- Manim scripts for YT videos☆12Nov 17, 2023Updated 2 years ago
- ☆15Feb 24, 2021Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- mGPfusion is a Gaussian process based method for predicting stability changes upon single and multiple mutations of proteins that comple…☆15May 24, 2018Updated 8 years ago
- "One general law, leading to the advancement of all organic beings, namely, multiply, vary, let the strongest live and the weakest die." …☆17Nov 18, 2020Updated 5 years ago
- ☆16Jul 20, 2017Updated 8 years ago
- Implementation of Learning without Prejudices: Continual Unbiased Learning via Benign and Malignant Forgetting (ICLR 2023)☆13Apr 14, 2023Updated 3 years ago
- ☆10Jun 28, 2023Updated 2 years ago
- Source Code for Paper "Evolutionary Community Detection in Dynamic Social Networks" IJCNN 2019☆16Jul 20, 2020Updated 5 years ago
- The repository of the quantum natural language processing WP6 within NEASQC. Development and releases are stored in this repository.☆12Jan 27, 2025Updated last year