boun-tabi / SQuAD-TR
☆10Updated 10 months ago
Alternatives and similar repositories for SQuAD-TR:
Users that are interested in SQuAD-TR are comparing it to the libraries listed below
- Text Classification Dataset for Turkish Language☆10Updated 3 years ago
- Using short models to classify long texts☆21Updated 2 years ago
- Repo for Turkish Wiki NER dataset.☆11Updated last year
- Pre-train Static Word Embeddings☆55Updated this week
- spaCyTurk - trained models & pipelines for Turkish☆19Updated 2 years ago
- ☆44Updated last month
- The official implementation for the paper GECTurk: Grammatical Error Correction and Detection Dataset for Turkish☆29Updated last year
- Official implementation of "GPT or BERT: why not both?"☆52Updated last month
- mT5 model for question answering and question generation☆27Updated 4 years ago
- Fine-tune ModernBERT on a large Dataset with Custom Tokenizer Training☆64Updated 2 months ago
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 2 years ago
- ☆22Updated 3 years ago
- State-of-the-art NLP tools for Turkish☆70Updated last year
- Generalist and Lightweight Model for Text Classification☆119Updated last week
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆34Updated this week
- 💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023☆127Updated 4 months ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆57Updated 8 months ago
- Question and answer retrieval in Turkish with BERT☆14Updated 3 years ago
- Automated question generation and question answering from Turkish texts using text-to-text transformers☆47Updated 2 years ago
- ☆11Updated 4 months ago
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆34Updated 4 months ago
- triple-encoders is a library for contextualizing distributed Sentence Transformers representations.☆14Updated 7 months ago
- Framework for unified summarisation and evaluation of English documents using state-of-the-art models and measures.☆32Updated 11 months ago
- LTG-Bert☆32Updated last year
- ☆44Updated 2 months ago
- ☆59Updated last year
- 🕸 GlotCC Dataset and Pipline -- NeurIPS 2024☆18Updated last week
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆26Updated last year
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆48Updated last year