Efficient Attention for Long Sequence Processing
☆98Dec 17, 2023Updated 2 years ago
Alternatives and similar repositories for convert_checkpoint_to_lsg
Users that are interested in convert_checkpoint_to_lsg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 31st place silver medal solution to USPPPM Kaggle competition☆20Jun 23, 2022Updated 3 years ago
- ☆160Jan 15, 2022Updated 4 years ago
- ☆19Sep 19, 2022Updated 3 years ago
- 🎖️ 4th place solution in the Feedback Prize Competition🎖️☆74Mar 19, 2022Updated 4 years ago
- ☆10May 1, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Accompanying code for our EMNLP 2017 publication "Bringing Structure into Summaries: Crowdsourcing a Benchmark Corpus of Concept Maps"☆13Dec 5, 2017Updated 8 years ago
- ☆40Mar 30, 2022Updated 4 years ago
- 1st solution☆39Oct 4, 2022Updated 3 years ago
- Early solution for Google AI4Code competition☆76May 26, 2022Updated 4 years ago
- https://www.nlp.ecei.tohoku.ac.jp/projects/aio/☆16Aug 4, 2022Updated 3 years ago
- Japanese NER with Transformers + PyTorch-Lightning + MLflow Tracking☆15Nov 20, 2022Updated 3 years ago
- Token-free Language Modeling with ByGPT5 & Friends!☆12Jul 18, 2025Updated 11 months ago
- Presentations documents related to OpenNMT talk or events☆14Mar 13, 2018Updated 8 years ago
- ☆11May 24, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 基于用户画像的商品推荐挑战赛Rank5☆25Sep 22, 2021Updated 4 years ago
- Neural information retrieval / semantic search / bi-encoders☆176Aug 5, 2023Updated 2 years ago
- ☆26Aug 16, 2021Updated 4 years ago
- Code for our Paper, 'Summaformers @ LaySumm 20, LongSumm 20' at EMNLP 2020, Scholarly Document Processing Workshop☆12Feb 10, 2021Updated 5 years ago
- Nativescript plugin for Android & iOS that create beautiful navigation tabs☆11Apr 23, 2019Updated 7 years ago
- Topic Inference with Zeroshot models☆61Jun 12, 2023Updated 3 years ago
- 法律・判例関係のデータセット☆52Jan 8, 2025Updated last year
- A classification model☆21Apr 24, 2022Updated 4 years ago
- Pre-training Language Models for Japanese☆50Jul 2, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- UNISUMM: Unified Few-shot Summarization with Multi-Task Pre-Training and Prefix-Tuning☆61Jun 12, 2023Updated 3 years ago
- This repository contains code that was used to generate the first place solution in the CommonLit Readability Prize☆69Aug 17, 2021Updated 4 years ago
- Long-Span Summarization (ACL2021)☆23Jan 19, 2023Updated 3 years ago
- ⚡️ AllenNLP plugin for adding subcommands to use Optuna, making hyperparameter optimization easy☆33Nov 23, 2021Updated 4 years ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Apr 15, 2024Updated 2 years ago
- Multi-task modelling extensions for huggingface transformers☆21Mar 3, 2023Updated 3 years ago
- Materials for "IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation" 🇮🇹☆31Jun 17, 2024Updated 2 years ago
- Applying progressive resizing to building models in Keras.☆18Apr 28, 2019Updated 7 years ago
- EDGAR10-Q Dataset and implementation of the paper Context NER☆17Sep 29, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- CLIP (Contrastive Language–Image Pre-training) trained on Indonesian data☆19Dec 4, 2021Updated 4 years ago
- Replication code for "The donut effect: How Covid-19 shapes real estate"☆12Dec 14, 2022Updated 3 years ago
- Solution for the Foursquare - Location Matching competition☆14Jul 8, 2022Updated 3 years ago
- ☆34May 1, 2025Updated last year
- 🧮 Algebraic Positional Encodings.☆21Jun 5, 2026Updated 2 weeks ago
- Enhancing True Correspondence Discrimination through Relation Consistency for Robust Noisy Correspondence Learning (CVPR 2025, pytorch co…☆13Sep 29, 2025Updated 8 months ago
- This is the implementation of the 4th place solution (yu4u's part) for RSNA 2024 Lumbar Spine Degenerative Classification at Kaggle.☆10Oct 11, 2024Updated last year