Implementation of Reinforce for educational purposes.
☆12Jun 12, 2023Updated 2 years ago
Alternatives and similar repositories for reinforce
Users that are interested in reinforce are comparing it to the libraries listed below
Sorting:
- An Educational Framework Based on PyTorch for Deep Learning Education and Exploration☆10Dec 24, 2023Updated 2 years ago
- [ACL 2023] Counterspeeches up my sleeve! Intent Distribution Learning and Persistent Fusion for Intent-Conditioned Counterspeech Generati…☆10Sep 23, 2023Updated 2 years ago
- A Model Agnostic function to directly remove specified layers from the LLM☆10May 23, 2024Updated last year
- A fast and accurate index for distribution-aware dataset search.☆10Feb 3, 2026Updated last month
- Code repository supporting the paper "Auto-Generating Weak Labels for Real & Synthetic Data to Improve Label-Scarce Medical Image Segment…☆11Apr 29, 2024Updated last year
- A RAG that can scale 🧑🏻💻☆11May 28, 2024Updated last year
- DreamSmooth: Improving Model-Based RL with Reward Smoothing (ICLR 2024)☆12May 6, 2024Updated last year
- ☆10Dec 18, 2023Updated 2 years ago
- Learning in Noisy MDP (which is governed by stochastic, exogenous input processes) with input-dependent baseline☆11Aug 7, 2020Updated 5 years ago
- Generating Human Skeletons with Mutual Actions☆11Oct 22, 2021Updated 4 years ago
- Cpp Course☆12Dec 5, 2025Updated 2 months ago
- ☆16Feb 2, 2026Updated last month
- LLM from scratch: Basic AI, Pytorch, Neural Network, Sequence Model, and LLMs☆13May 26, 2024Updated last year
- ☆11Jul 7, 2023Updated 2 years ago
- ☆10May 21, 2023Updated 2 years ago
- A list where most values will be None (or default)☆11Jul 19, 2023Updated 2 years ago
- Unofficial implementation of paper : Exploring the Space of Key-Value-Query Models with Intention☆12May 24, 2023Updated 2 years ago
- ☆14Jun 24, 2024Updated last year
- A conda-smithy repository for ollama.☆10Updated this week
- an implementation of paper"Retentive Network: A Successor to Transformer for Large Language Models" https://arxiv.org/pdf/2307.08621.pdf☆11Jul 25, 2023Updated 2 years ago
- Annotated implementation of vanilla Transformers to guide through all the ambiguities.☆10Jun 20, 2025Updated 8 months ago
- ☆11Dec 10, 2020Updated 5 years ago
- PyTorch implementation of the "Learning an Adaptive Learning Rate Schedule" paper found here: https://arxiv.org/abs/1909.09712.☆12Jan 15, 2020Updated 6 years ago
- NLPBench: Evaluating NLP-Related Problem-solving Ability in Large Language Models☆10Oct 27, 2023Updated 2 years ago
- Full List of Bad Words and Top Swear Words Banned by Google. As they closed the api☆12Sep 26, 2018Updated 7 years ago
- See https://github.com/cuda-mode/triton-index/ instead!☆11May 8, 2024Updated last year
- High-performance tokenized language data-loader for Python C++ extension☆14Jul 22, 2024Updated last year
- ☆12Apr 24, 2024Updated last year
- Deep Learning with Multiple Objectives: 2021 edition☆10May 27, 2021Updated 4 years ago
- Script for using Bing chat like a meal delivery service.☆12Mar 15, 2023Updated 2 years ago
- ☆13Oct 28, 2024Updated last year
- ☆51Jan 28, 2024Updated 2 years ago
- Shared code for the AI Racing League☆13Jan 11, 2025Updated last year
- Scalable Computation of Hessian Diagonals☆14Jun 2, 2024Updated last year
- TensorFlow implementation of the "Prompt-to-Prompt Image Editing with Cross Attention Control" for Stable Diffusion☆16Mar 25, 2023Updated 2 years ago
- ☆13May 21, 2024Updated last year
- ☆11Apr 22, 2022Updated 3 years ago
- This repository contains the annotations and download scripts for the audio files of the GiantSteps Key data set. This data set was publi…☆22Mar 19, 2025Updated 11 months ago
- Data and code for APPDIA: A Discourse-aware Transformer-based Style Transfer Model for Offensive Social Media Conversations (COLING 2022)…☆13Sep 8, 2022Updated 3 years ago