Finetune GPT2 for text summarization
☆17Aug 16, 2021Updated 4 years ago
Alternatives and similar repositories for GPT2_Summarization
Users that are interested in GPT2_Summarization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Flan T5 LLM fine-tuning, by attaching a regression model last hidden layers activations. Runs on colab with A100 40gb☆12Mar 24, 2023Updated 3 years ago
- Benchmarking various Deep Learning models such as BERT, ALBERT, BiLSTMs on the task of sentence entailment using two datasets - MultiNLI …☆28Dec 31, 2020Updated 5 years ago
- [ACL'26] EvoToken-DLM (Beyond Hard Masks: Progressive Token Evolution for Diffusion Language)☆46Apr 7, 2026Updated last week
- A side project that follows all the acceleration tricks in tinyllama, with the minimal modification to the huggingface transformers code.☆13Sep 2, 2024Updated last year
- This is the implementation of paper "Learning to Ask Conversational Questions by Optimizing Levenshtein Distance".☆10Jul 5, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆22Dec 1, 2021Updated 4 years ago
- ☆10Mar 29, 2022Updated 4 years ago
- Unsupervised Key-phrase Extraction and Clustering for Classification Scheme in Scientific Publications.☆19May 24, 2021Updated 4 years ago
- Code for Pushdown Layers from our EMNLP 2023 paper☆29Dec 3, 2023Updated 2 years ago
- A Bert2Bert model which able to generate headlines!☆12Nov 16, 2020Updated 5 years ago
- Code for Repl4NLP paper "A Cross-Task Analysis of Text Span Representations"☆21Nov 4, 2022Updated 3 years ago
- simple telegram bot to extract text from images. ocr scanner bot based on ocr-space api.☆10Nov 4, 2022Updated 3 years ago
- Individual Coefficient Approximation for Risk Estimation (ICARE) model☆18Sep 9, 2023Updated 2 years ago
- Data simulation scripts for paper "Target Sound Extraction with Variable Cross-modality Clues"☆17May 19, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Codes for "Benchmarking the Generation of Fact Checking Explanations"☆10Aug 16, 2024Updated last year
- ☆14Jul 5, 2023Updated 2 years ago
- The official implantation of SGPT (CVPR2024)☆17Jul 15, 2024Updated last year
- ☆17Jul 24, 2025Updated 8 months ago
- ☆22Jun 14, 2024Updated last year
- Check storage of products in Chiikawa market.☆12Jan 10, 2025Updated last year
- [ACL 2025 Main] Code and data for paper "Can LLM Watermarks Robustly Prevent Unauthorized Knowledge Distillation?"☆22Jun 18, 2025Updated 9 months ago
- Understanding Factual Errors in Summarization: Errors, Summarizers, Datasets, Error Detectors (ACL 2023)☆28Mar 26, 2024Updated 2 years ago
- Initializing neural networks for hierarchical multi-label text classification☆11Mar 1, 2019Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Learn to Resolve Conversational Dependency: A Consistency Training Framework for Conversational Question Answering (Kim et al., ACL 2021)☆32Jan 2, 2023Updated 3 years ago
- Radiation Oncology NLP Database☆26Oct 9, 2024Updated last year
- My vocoder experiments☆31Jul 26, 2025Updated 8 months ago
- FlowDelta: Modeling Flow Information Gain in Reasoning for Conversational Machine Comprehension☆35Oct 4, 2022Updated 3 years ago
- Code for ACL2020 paper "Heterogeneous Graph Neural Networks for Extractive Document Summarization"☆248Apr 4, 2024Updated 2 years ago
- A repository of example implementations for interesting ml concepts☆28Jan 9, 2023Updated 3 years ago
- This repository lists papers, codes, and datasets in Biomedical Text Summarisation based on PLM☆23Oct 4, 2022Updated 3 years ago
- PyTorch implementation of a multi-task, weak supervision framework for abnormality localization in large, volumetric images.☆23Nov 22, 2022Updated 3 years ago
- microsoft unilm-v1 compatible with huggingface transformers☆22Apr 5, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Executive Memory for Coherent Long-Horizon Reasoning!☆82Jan 14, 2026Updated 3 months ago
- The code of the AAAI-20 paper "Attractive or Faithful? Popularity-Reinforced Learning for Inspired Headline Generation".☆13Aug 28, 2022Updated 3 years ago
- A mirror of the Open Risk white paper collection☆10Nov 11, 2025Updated 5 months ago
- Matlab toolbox for polyenergetic quantitative (polyquant) X-ray CT reconstruction with demos.☆29Jun 3, 2019Updated 6 years ago
- ☆24Mar 24, 2022Updated 4 years ago
- 使用LSTM进行端到端的语义角色标注(theano)☆55Dec 9, 2019Updated 6 years ago
- OpenNyAI is a mission aimed at developing open source software and datasets to catalyze the creation of AI-powered solutions to improve a…☆43Apr 17, 2024Updated last year