Master thesis with code investigating methods for incorporating long-context reasoning in low-resource languages, without the need to pre-train from scratch. We investigated if multilingual models could inherit these properties by making it an Efficient Transformer (s.a. the Longformer architecture).
☆35Aug 19, 2021Updated 4 years ago
Alternatives and similar repositories for Master-Thesis-Multilingual-Longformer
Users that are interested in Master-Thesis-Multilingual-Longformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [Neural Networks 2025] The official code for the paper "MNet: A Multi-Scale Network for Visible Watermark Removal."☆17Jun 16, 2025Updated 9 months ago
- This repository contains the code used for distillation and fine-tuning of compact biomedical transformers that have been introduced in t…☆19Mar 26, 2024Updated 2 years ago
- ☆14Aug 26, 2024Updated last year
- An NLP-suite powered by deep learning☆19Mar 24, 2023Updated 3 years ago
- [Nature Scientific Reports] Translating synthetic natural language to database queries: a polyglot deep learning framework☆26Jun 10, 2023Updated 2 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- LAReQA is a challenging benchmark for evaluating language agnostic answer retrieval from a multilingual candidate pool. This repository c…☆14May 19, 2020Updated 5 years ago
- Fine-tuned BERT on SQuAd 2.0 Dataset. Applied Knowledge Distillation (KD) and fine-tuned DistilBERT (student) using BERT as the teacher m…☆26Feb 13, 2021Updated 5 years ago
- ☆22Sep 15, 2018Updated 7 years ago
- Code for ECIR 2022 paper Local Citation Recommendation with Hierarchical-Attention Text Encoder and SciBERT-based Reranking☆26Jul 30, 2024Updated last year
- AutoML library for Accurat, based on AutoKeras and Scikit-Learn.☆14Jun 21, 2022Updated 3 years ago
- Sythetic data generation and normalization functions powered by LLMs☆59Sep 19, 2024Updated last year
- A range of tools related to one-endpoint crossing graphs - parsing, format conversion, and evaluation☆11Nov 8, 2022Updated 3 years ago
- Cross language information retrieval pipeline☆19Jan 12, 2026Updated 2 months ago
- Clinical Text Summarization with Syntax-Based Negation and Semantic Concept Identification☆21Mar 3, 2020Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Fincausal 2020 shared task☆20Nov 1, 2020Updated 5 years ago
- Deep Multi-Speech model☆11Jul 25, 2018Updated 7 years ago
- ☆38Mar 27, 2022Updated 4 years ago
- Notebooks for docarray, Jina, Finetuner, and other products from Jina AI☆12Mar 31, 2022Updated 3 years ago
- Tutorials for the julia language☆12Feb 4, 2023Updated 3 years ago
- Repository for paper CELLS: A Parallel Corpus for Biomedical Lay Language Generation☆19Apr 2, 2024Updated last year
- zig build add-on (add more toolchains [LLVM-based] support)☆15May 5, 2025Updated 10 months ago
- Code associated with the paper: "Few-Shot Self-Rationalization with Natural Language Prompts"☆13Apr 27, 2022Updated 3 years ago
- Shell script to manage multiple Microsoft Teams profiles on Linux.☆12Mar 3, 2021Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆10Nov 2, 2023Updated 2 years ago
- Question-Directed Graph Attention Network for Numerical Reasoning over Text☆10Aug 14, 2020Updated 5 years ago
- ☆11Jun 23, 2022Updated 3 years ago
- Longformer: The Long-Document Transformer☆2,188Feb 8, 2023Updated 3 years ago
- Code for our TSD paper "TOKEN is a MASK: Few-shot Named Entity Recognition with Pre-trained Language Models"☆14Aug 19, 2022Updated 3 years ago
- PyTAIL - Interactive and Incremental Learning of NLP Models with Human in the Loop for Online Data☆13Dec 3, 2022Updated 3 years ago
- Code and data associated with our LREC 2018 and COLING 2018 papers on converting between emotion formats☆10Dec 15, 2022Updated 3 years ago
- ☆13Aug 2, 2023Updated 2 years ago
- Dutch abusive language data☆11Sep 23, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆11Jun 4, 2021Updated 4 years ago
- 基于BERT和MRC框架实现的嵌套命名实体识别☆19Mar 13, 2022Updated 4 years ago
- ☆53May 2, 2021Updated 4 years ago
- Just messing around with implementing data structures in Rust.☆14May 23, 2025Updated 10 months ago
- Collection of NLP model explanations and accompanying analysis tools☆143Jun 26, 2023Updated 2 years ago
- A repo for shared notebooks☆23Jan 20, 2023Updated 3 years ago
- Use the famous language model, xlnet, to do sequence tagging/ sequence labelling/ named entity recognition(NER) / noun extraction;☆18Sep 30, 2019Updated 6 years ago