Pretraining scripts for BART transformer model
☆12May 15, 2023Updated 2 years ago
Alternatives and similar repositories for kb_bart
Users that are interested in kb_bart are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The repository for the paper "Is Killed More Significant than Fled? A Contextual Model for Salient Event Detection"☆10Jul 5, 2022Updated 3 years ago
- Yet Another Neural Machine Translation Toolkit☆179Mar 7, 2025Updated last year
- Personal information identification standard☆21Jan 24, 2024Updated 2 years ago
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.☆105May 20, 2022Updated 3 years ago
- The repository for the paper "When Do You Need Billions of Words of Pretraining Data?"☆21Nov 10, 2020Updated 5 years ago
- Pre-training BART in Flax on The Pile dataset☆22Jul 24, 2021Updated 4 years ago
- zero-vocab or low-vocab embeddings☆18Jul 17, 2022Updated 3 years ago
- TNT-KID: Transformer-based Neural Tagger for Keyword Identification☆11Jul 25, 2024Updated last year
- ☆15Sep 20, 2018Updated 7 years ago
- COMET for African languages☆11Jan 24, 2025Updated last year
- A simple semi-supervised approach for creating huggingface data script loaders and upload to the hub.☆11Jun 23, 2024Updated last year
- BSRGAN-Pip: Packaged version of the BSRGAN repository☆14Jan 6, 2023Updated 3 years ago
- Named Entity Recognition in Nepali Language☆10Jan 12, 2023Updated 3 years ago
- The code repository for the paper "Dimsum @LaySumm 20: BART-based Approach for Scientific Document Summarization".☆24Nov 12, 2020Updated 5 years ago
- ML Reproducibility Challenge 2020: Electra reimplementation using PyTorch and Transformers☆12Apr 16, 2021Updated 4 years ago
- Code and data for "Heterogeneous Supervised Topic Models"☆10Jun 27, 2022Updated 3 years ago
- ☆13Jul 10, 2020Updated 5 years ago
- Gradient accumulation on tf.estimator☆12Dec 15, 2020Updated 5 years ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆28Oct 3, 2021Updated 4 years ago
- Neural Paraphrase Generation based on OpenNMT-py☆12Jan 2, 2018Updated 8 years ago
- The pre-assignment for data science internship applicants☆18Feb 1, 2024Updated 2 years ago
- List of corpora annotated for coreference for different languages☆18Aug 8, 2024Updated last year
- Calculates bounds on the sofa moving problem☆14Sep 12, 2019Updated 6 years ago
- Code for Neural Coreference Resolution for Arabic☆12May 12, 2022Updated 3 years ago
- Recognize text using Calamari OCR and the OCR-D framework☆16May 13, 2025Updated 10 months ago
- Experiments for XLM-V Transformers Integeration☆13Feb 8, 2023Updated 3 years ago
- Medium article "API Gateway for your Microservices"☆19Sep 12, 2024Updated last year
- Reduce the size of pretrained Hugging Face models via vocabulary trimming.☆48Dec 28, 2022Updated 3 years ago
- Repo & Project for the Imminent Research Grant code & tasks☆12May 20, 2024Updated last year
- Hanja Understanding Evaluation Dataset☆15May 2, 2022Updated 3 years ago
- Scripts to create speech corpora from open.bible☆13Jan 3, 2022Updated 4 years ago
- ☆11Aug 29, 2022Updated 3 years ago
- ☆11Apr 2, 2024Updated last year
- Keras implementation of YOLOv2 refer to Andrew Ng☆11Feb 14, 2018Updated 8 years ago
- ☆15Jun 10, 2018Updated 7 years ago
- Reactive Multi-language Gradio App with minimal effort☆21Oct 12, 2025Updated 5 months ago
- Code for Auditing Data Provenance in Text-Generation Models (in KDD 2019)☆10Jun 18, 2019Updated 6 years ago
- Thai word segmentation using deep learning☆14Jul 1, 2019Updated 6 years ago
- The repo for ACL2021 findings paper - Don't Miss the Labels: Label-semantic Argumented Meta-Learner for Few-Shot Text Classification☆15Mar 24, 2022Updated 4 years ago