Efficient Attention for Long Sequence Processing
☆98Dec 17, 2023Updated 2 years ago
Alternatives and similar repositories for convert_checkpoint_to_lsg
Users that are interested in convert_checkpoint_to_lsg are comparing it to the libraries listed below
Sorting:
- 31st place silver medal solution to USPPPM Kaggle competition☆20Jun 23, 2022Updated 3 years ago
- ai4code competition source code☆19Aug 12, 2022Updated 3 years ago
- ☆160Jan 15, 2022Updated 4 years ago
- ☆19Sep 19, 2022Updated 3 years ago
- 🎖️ 4th place solution in the Feedback Prize Competition🎖️☆74Mar 19, 2022Updated 4 years ago
- ☆40Mar 30, 2022Updated 3 years ago
- 1st solution☆39Oct 4, 2022Updated 3 years ago
- Using short models to classify long texts☆21Mar 8, 2023Updated 3 years ago
- 2023 Kaggle LECR 金牌 Top3 训练代码☆28Mar 16, 2023Updated 3 years ago
- Early solution for Google AI4Code competition☆76May 26, 2022Updated 3 years ago
- 3rd Place solution for Feedback Prize - Predicting Effective Arguments Kaggle competition☆16Sep 6, 2022Updated 3 years ago
- A simple strategy for training and finetuning NLP models for Arabic. Specify the parameters and just wait for the results. A simple desig…☆21Jan 27, 2024Updated 2 years ago
- This is the solution for 2nd rank in Kaggle competition: Feedback Prize - Evaluating Student Writing.☆49Mar 31, 2022Updated 3 years ago
- Japanese NER with Transformers + PyTorch-Lightning + MLflow Tracking☆15Nov 20, 2022Updated 3 years ago
- Token-free Language Modeling with ByGPT5 & Friends!☆12Jul 18, 2025Updated 8 months ago
- Presentations documents related to OpenNMT talk or events☆14Mar 13, 2018Updated 8 years ago
- [NeurIPS 2022] Your Transformer May Not be as Powerful as You Expect (official implementation)☆34Aug 6, 2023Updated 2 years ago
- ☆18Apr 25, 2021Updated 4 years ago
- 基于用户画像的商品推荐挑战赛Rank5☆25Sep 22, 2021Updated 4 years ago
- Neural information retrieval / Semantic search / Bi-encoders☆174Aug 5, 2023Updated 2 years ago
- PyTorch reimplementation of the paper "SimCLS: A Simple Framework for Contrastive Learning of Abstractive Summarization"☆16Oct 17, 2021Updated 4 years ago
- Code for our Paper, 'Summaformers @ LaySumm 20, LongSumm 20' at EMNLP 2020, Scholarly Document Processing Workshop☆12Feb 10, 2021Updated 5 years ago
- Topic Inference with Zeroshot models☆61Jun 12, 2023Updated 2 years ago
- A classification model☆21Apr 24, 2022Updated 3 years ago
- ☆23Oct 21, 2022Updated 3 years ago
- 基于多层级语言特征融合的中文文本可读性分级模型☆12Feb 27, 2024Updated 2 years ago
- Pile Deduplication Code☆18May 15, 2023Updated 2 years ago
- ☆21Oct 13, 2021Updated 4 years ago
- UNISUMM: Unified Few-shot Summarization with Multi-Task Pre-Training and Prefix-Tuning☆61Jun 12, 2023Updated 2 years ago
- ⚡️ AllenNLP plugin for adding subcommands to use Optuna, making hyperparameter optimization easy☆32Nov 23, 2021Updated 4 years ago
- This repository contains code that was used to generate the first place solution in the CommonLit Readability Prize☆69Aug 17, 2021Updated 4 years ago
- Long-Span Summarization (ACL2021)☆23Jan 19, 2023Updated 3 years ago
- Part of 3rd place solution for Kaggle's Tweet Sentiment Extraction☆38Jun 25, 2020Updated 5 years ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Apr 15, 2024Updated last year
- This repo contains my hackathon solutions☆39Jun 21, 2022Updated 3 years ago
- Multi-task modelling extensions for huggingface transformers☆21Mar 3, 2023Updated 3 years ago
- Materials for "IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation" 🇮🇹☆30Jun 17, 2024Updated last year
- 🧮 Algebraic Positional Encodings.☆18Aug 20, 2025Updated 7 months ago
- Applying progressive resizing to building models in Keras.☆18Apr 28, 2019Updated 6 years ago