ManifoldRG / MultiNet
☆10Updated this week
Related projects: ⓘ
- In Progress Implementation of GATO style Generalist Multimodal model capable of image, text, RL and Robotics tasks☆43Updated 3 months ago
- Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.☆17Updated last month
- Shakespeare transformer fine-tuned to generate positive sentiment samples using RLHF☆10Updated last year
- ☆40Updated 4 months ago
- Serialize JAX, Flax, Haiku, or Objax model params with 🤗`safetensors`☆41Updated 3 months ago
- This repo contains code for the paper "Psychologically-informed chain-of-thought prompts for metaphor understanding in large language mod…☆12Updated last year
- Implementation of Soft Actor Critic and some of its improvements in Pytorch☆30Updated this week
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆52Updated last month
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆31Updated 3 months ago
- Official code for the paper "Context-Aware Language Modeling for Goal-Oriented Dialogue Systems"☆34Updated last year
- Supercharge huggingface transformers with model parallelism.☆72Updated 6 months ago
- Critique-out-Loud Reward Models☆17Updated 2 weeks ago
- This repo contains a set of notebooks to reproduce reinforcement learning algorithms.☆13Updated last year
- Clean RL implementation using MLX☆26Updated 6 months ago
- Exercises of the reinforcement learning course from Hugging Face☆9Updated last year
- Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*☆77Updated 9 months ago
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆24Updated last week
- My explorations into editing the knowledge and memories of an attention network☆34Updated last year
- ☆27Updated this week
- Implementation of Stable Diffusion from scratch [WORK IN PROGRESS]☆21Updated last year
- A library to create and manage configuration files, especially for machine learning projects.☆77Updated 2 years ago
- The Intermediate Goal of the project is to train a GPT like architecture to learn to summarise reddit posts from human preferences, as th…☆12Updated 3 years ago
- Machine Learning eXperiment Utilities☆42Updated 3 months ago
- Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning"☆197Updated last year
- Applying Reinforcement Learning from Human Feedback to language models to teach them to write short story responses to writing prompts.☆14Updated 2 years ago
- A Toolkit for Distributional Control of Generative Models☆69Updated last year
- ☆44Updated 2 months ago
- JAX/Flax implementation of the Hyena Hierarchy☆29Updated last year
- Language models scale reliably with over-training and on downstream tasks☆91Updated 5 months ago