Chinese version of GPT2 training code, using BERT or BPE tokenizer.
☆18Oct 30, 2019Updated 6 years ago
Alternatives and similar repositories for GPT2-Chinese
Users that are interested in GPT2-Chinese are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Codes for "EDG-based Question Decomposition for Complex Question Answering over Knowledge Bases"☆13Nov 12, 2021Updated 4 years ago
- Custom decoders for Kaldi☆13Jun 5, 2019Updated 7 years ago
- GeoViz: A Multi-View Visual Platform for Spatio-temporal Knowledge Graph☆13May 13, 2024Updated 2 years ago
- gensim-fast2vec改造、灵活使用大规模外部词向量(具备OOV查询能力)☆23Jun 3, 2019Updated 7 years ago
- 抽取式摘要抽取算法(1、抽取式 2、生成式)☆16Oct 13, 2019Updated 6 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Code for paper "Improving Generalizability of Graph Anomaly Detection Models via Data Augmentation" (TKDE 2023)☆16Dec 4, 2025Updated 6 months ago
- Finetune CPM-1 For Text Generation☆18Jul 9, 2021Updated 4 years ago
- The official Implementation for TKDE paper "Individual and Structural Graph Information Bottlenecks for Out-of-Distribution Generalizatio…☆14Aug 6, 2023Updated 2 years ago
- Mutual Information Neural Estimation (Pytorch)☆13Jun 29, 2020Updated 6 years ago
- A KALDI/C++ implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆15Sep 4, 2019Updated 6 years ago
- This repository is the official implementation of "Dynamic Graph Information Bottleneck (DGIB)" accepted by the research tracks of The We…☆16Jan 27, 2024Updated 2 years ago
- A tool to help understand how GNN/BERT works by attention☆15Jul 5, 2020Updated 5 years ago
- New structural distributional shifts for evaluating graph models☆16Oct 25, 2023Updated 2 years ago
- 基于知识图谱的问答系统☆13Jun 30, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [NeurIPS 2023] "Unleashing the Power of Graph Data Augmentation on Covariate Distribution Shift" by Yongduo Sui, Qitian Wu, Jiancan Wu, Q…☆16Nov 6, 2023Updated 2 years ago
- The official code for the "System Combination via Quality Estimation for Grammatical Error Correction" paper, published in EMNLP 2023.☆16Jan 24, 2026Updated 5 months ago
- 这是一个专门教你使用xgb,lgb的github-【baseline构建】☆13Aug 7, 2019Updated 6 years ago
- Unsupervised Key-phrase Extraction and Clustering for Classification Scheme in Scientific Publications.☆19May 24, 2021Updated 5 years ago
- A simple Keras implementation of Paper "Text Matching as Image Recognition"☆27Jun 28, 2023Updated 3 years ago
- Our code for ICLR'24 paper "Energy-based Automated Model Evaluation".☆24Feb 13, 2025Updated last year
- Bi-directional Lattice Recurrent Neural Networks for Confidence Estimation☆15Aug 28, 2020Updated 5 years ago
- The code of CIKM 2023 short paper : Bridging the KB-Text Gap: Leveraging Structured Knowledge-aware Pre-training for KBQA☆20Jul 19, 2024Updated last year
- Code for synchronising all CHiME-5 audio signals for use in CHiME-6☆18Dec 2, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆33Nov 27, 2021Updated 4 years ago
- TIANCHI天池-OGeek算法挑战赛(亚军)☆28Aug 20, 2019Updated 6 years ago
- The implementation of "Joint Learning of Label and Environment Causal Independence for Graph Out-of-Distribution Generalization" (NeurIPS…☆22Nov 4, 2024Updated last year
- Posterior Control of Blackbox Generation☆23May 2, 2020Updated 6 years ago
- ☆22Oct 10, 2022Updated 3 years ago
- LlaMA3-SFT, Meta-Llama-3-8B/Meta-Llama-3-8B-Instruct微调(transformers)/LORA(peft)/推理, 支持中文(chinese, zh)☆34May 17, 2024Updated 2 years ago
- Repository for CPU Kernel Generation for LLM Inference☆28Jul 13, 2023Updated 2 years ago
- ACL2023 (Oral): TemplateGEC: Improving Grammatical Error Correction with Detection Template☆23Jul 10, 2023Updated 2 years ago
- dan povey's local copy of kadi-asr/kaldi☆19Nov 10, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆24Jun 17, 2020Updated 6 years ago
- [NeurIPS 2023] Does Invariant Graph Learning via Environment Augmentation Learn Invariance?☆23Aug 25, 2024Updated last year
- ☆32Oct 13, 2021Updated 4 years ago
- graph neural networks, information theory, AI for Sciences☆23Apr 6, 2022Updated 4 years ago
- Code for EMNLP 20: Coarse-to-Fine Query Focused Multi-Document Summarization.☆24Feb 9, 2021Updated 5 years ago
- Code for the SIGIR 2020 paper "A Unified Dual-view Model for Review Summarization and Sentiment Classification with Inconsistency Loss"☆21Feb 3, 2021Updated 5 years ago
- The code repo of Jingwei, reconstructing global ocean oxygen changes with hybrid graph learning.☆21Apr 7, 2025Updated last year