A minimal TF2 re-implementation of the OpenAI GPT training
☆58Sep 1, 2021Updated 4 years ago
Alternatives and similar repositories for minGPT-TF
Users that are interested in minGPT-TF are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- pytorch版基于gpt+nezha的中文多轮Cdial☆11Oct 22, 2022Updated 3 years ago
- GPT-3 attempts to predict & balance chemical reactions☆13Aug 2, 2020Updated 5 years ago
- Fastai implementation of @karpathy's miniGPT library☆15Aug 24, 2020Updated 5 years ago
- Adds timm pretrained backbone to pytorch's FasterRcnn model☆12Jan 25, 2024Updated 2 years ago
- This repository demonstrate training T5 transformers using tensorflow 2☆14Oct 1, 2020Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆16Dec 14, 2022Updated 3 years ago
- [NAACL 2021] Designing a Minimal Retrieve-and-Read System for Open-Domain Question Answering☆36Apr 20, 2021Updated 4 years ago
- A package for fine-tuning Transformers with TPUs, written in Tensorflow2.0+☆37Mar 10, 2021Updated 5 years ago
- GPT-jax based on the official huggingface library☆13Jun 22, 2021Updated 4 years ago
- chinese wwm masking and ngram masking based on jieba☆11Jul 25, 2019Updated 6 years ago
- This is the idea from scikit-learn to implement the task of multi-label for Chinese text.☆13Apr 17, 2017Updated 8 years ago
- ☆15Apr 22, 2023Updated 2 years ago
- Deep Reinforcement Learning by using an on-policy adaptation of Maximum a Posteriori Policy Optimization (MPO)☆16Oct 23, 2021Updated 4 years ago
- Offline RL experiments☆15Oct 1, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆25Aug 31, 2024Updated last year
- Procedural object generation for robotic manipulation☆11Oct 6, 2018Updated 7 years ago
- 京东“如期而至”比赛第8名日期模型单模型方案☆17Aug 17, 2018Updated 7 years ago
- ☆11Aug 12, 2020Updated 5 years ago
- 统计微信朋友圈送出的赞票与得到的赞票人员比例☆11May 3, 2016Updated 9 years ago
- 문장단위로 분절된 한국어 위키피디아 코퍼스. Releases에서 다운로드 받거나 tfds-korean으로 사용해주세요.☆24Sep 6, 2023Updated 2 years ago
- ☆11Oct 3, 2021Updated 4 years ago
- GRPO Training Script for Qwen Model on GSM8K Dataset. This script trains a Qwen model using the GRPO (Generalized Reinforcement Policy Op…☆29Dec 11, 2025Updated 3 months ago
- ☆10Mar 28, 2022Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Guide: Finetune GPT2-XL (1.5 Billion Parameters) and finetune GPT-NEO (2.7 B) on a single GPU with Huggingface Transformers using DeepSpe…☆434Jun 14, 2023Updated 2 years ago
- Official repo for vidar and vidarc: video foundation model for robotics.☆40Dec 22, 2025Updated 3 months ago
- gen0 gazebo simulation based on ROS2☆12Nov 13, 2025Updated 4 months ago
- Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"☆11Oct 29, 2019Updated 6 years ago
- Solving MuJoCo environments with Deep Deterministic Policy Gradients☆14Sep 17, 2018Updated 7 years ago
- Search kaggle competitions and solutions based on data and predict type, evaluation metric, etc.☆13Apr 29, 2023Updated 2 years ago
- This repository shows various ways of deploying a vision model (TensorFlow) from 🤗 Transformers.☆30Aug 22, 2022Updated 3 years ago
- Opensource chatbot framework☆16Aug 1, 2021Updated 4 years ago
- ICP implementation in Rust☆15Jun 27, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Lidar Obstacle Detection using RANSAC and DBSCAN☆16Jun 11, 2024Updated last year
- Try Metapost quickly and easily with our online sandbox application!☆11Feb 14, 2026Updated last month
- [Findings of NAACL2022] A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation☆11May 27, 2022Updated 3 years ago
- A tf.estimator version of GPT2☆27Jan 29, 2022Updated 4 years ago
- ↔️ Utilizing RBERT model structure for KLUE Relation Extraction task☆15Nov 15, 2022Updated 3 years ago
- ☆10Apr 8, 2020Updated 5 years ago
- Basic PyTorch Implementation of 'Neural Architecture Search with Reinforcement Learning' (https://arxiv.org/abs/1611.01578)☆13Feb 24, 2018Updated 8 years ago