TurkuNLP/wikibert

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/TurkuNLP/wikibert)

TurkuNLP / wikibert

BERT models for many languages created from Wikipedia texts

☆33

Alternatives and similar repositories for wikibert

Users that are interested in wikibert are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Data-Intelligence-Lab / DEFT-korean-alpaca
View on GitHub
☆23Oct 30, 2023Updated 2 years ago
jiacheng-xu / sum-interpret
View on GitHub
Code for Dissecting Generation Modes for Abstractive Summarization Models via Ablation and Attribution (ACL2021)
☆13Jun 2, 2021Updated 5 years ago
Knight-H / thai-lm
View on GitHub
☆14Dec 23, 2024Updated last year
m3hrdadfi / albert-persian-lab
View on GitHub
ALBERT Persian Playground
☆13Jun 12, 2023Updated 3 years ago
MrBananaHuman / KoreanCharacterBert
View on GitHub
Korean BERT model using character tokenizer
☆27Apr 8, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
jeongukjae / namuwiki-corpus
View on GitHub
문장단위로 분절된 나무위키 데이터셋. Releases에서 다운로드 받거나, tfds-korean을 통해 다운로드 받으세요.
☆19Jun 16, 2021Updated 5 years ago
dice-group / hypertrie
View on GitHub
A monolithic index that supports worst-case optimal joins (WCOJ) by providing all collation orders in a single redundancy eliminating dat…
☆18Updated this week
enlipleai / korquad-challenge
View on GitHub
https://challenge.enliple.com/
☆16Jun 10, 2020Updated 6 years ago
MrBananaHuman / open-korean-instructions
View on GitHub
언어모델을 학습하기 위한 공개 한국어 instruction dataset들을 모아두었습니다.
☆19Jul 16, 2023Updated 3 years ago
j-min / WikiExtractor_To_the_one_text
View on GitHub
Simple extension of WikiExtractor(https://github.com/attardi/wikiextractor)
☆16Dec 23, 2016Updated 9 years ago
fujimotos / TinyFastSS
View on GitHub
An index data structure for approximate string search.
☆23May 6, 2019Updated 7 years ago
Beomi / transformers-language-modeling
View on GitHub
Train 🤗transformers with DeepSpeed: ZeRO-2, ZeRO-3
☆23May 20, 2021Updated 5 years ago
yuanbit / jina-financial-qa-search
View on GitHub
☆69Feb 4, 2021Updated 5 years ago
nlpai-lab / Korean-CommonGen
View on GitHub
[Findings of NAACL2022] A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation
☆11May 27, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
AranKomat / adapinp
View on GitHub
Unofficial implementation of Adaptive Input in PyTorch
☆12Feb 22, 2019Updated 7 years ago
veer66 / Yaitron
View on GitHub
Yaitron English-Thai and Thai-English dictionary
☆34Oct 13, 2020Updated 5 years ago
seujung / t5-summarization
View on GitHub
☆25Oct 28, 2020Updated 5 years ago
monologg / KoCharELECTRA
View on GitHub
Character-level Korean ELECTRA Model (음절 단위 한국어 ELECTRA)
☆55Jun 12, 2023Updated 3 years ago
ModuNLP / hacking_transformers
View on GitHub
☆11Aug 12, 2020Updated 5 years ago
zzaebok / ksenticnet
View on GitHub
KSenticNet: 한국어 감성 사전
☆33May 20, 2019Updated 7 years ago
yseokchoi / SejongTree2Dependency
View on GitHub
세종 구문 분석 말뭉치의 의존 구문 구조로의 변환 도구
☆10Sep 7, 2018Updated 7 years ago
PyThaiNLP / nlpforthai.com
View on GitHub
NLP For Thai
☆26Oct 18, 2024Updated last year
snunlp / KR-BERT-MEDIUM
View on GitHub
Expanded KR-BERT by adding more training data
☆13Apr 23, 2021Updated 5 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
ehsanasgari / 1000Langs
View on GitHub
Creating super-parallel corpora of more than 1500+ unique languages for NLP research
☆33Dec 8, 2022Updated 3 years ago
SOMJANG / Youtube_Comment_Crawler
View on GitHub
유튜브 댓글 크롤러 ( Python, BeautifulSoup, Selenium )
☆35Sep 13, 2022Updated 3 years ago
monologg / korean-hate-speech-koelectra
View on GitHub
Bias, Hate classification with KoELECTRA 👿
☆27Jun 12, 2023Updated 3 years ago
snunlp / KR-ELECTRA
View on GitHub
KoRean based ELECTRA pre-trained models (KR-ELECTRA) for Tensorflow and PyTorch
☆15Feb 13, 2022Updated 4 years ago
tmu-nlp / ThaiToxicityTweetCorpus
View on GitHub
☆12Dec 14, 2020Updated 5 years ago
uma-pi1 / OPIEC-pipeline
View on GitHub
☆14Feb 26, 2022Updated 4 years ago
snunlp / KR-BERT-KOSAC
View on GitHub
Expanded KR-BERT for Sentiment Analysis
☆13Apr 23, 2021Updated 5 years ago
ModuNLP / weekly-meeting
View on GitHub
매주 목요일, 20:00 모임
☆16Jul 24, 2020Updated 5 years ago
aaronmueller / clams
View on GitHub
Syntactic evaluation sets, attribute-varying grammars, and code for replicating the CLAMS paper. ACL 2020.
☆17Nov 26, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
songys / Question_pair
View on GitHub
#Paired Question
☆24Jun 16, 2020Updated 6 years ago
xlhex / dpe
View on GitHub
☆22Oct 26, 2020Updated 5 years ago
jmyrberg / finnlem
View on GitHub
Neural network based lemmatizer for Finnish language
☆11Sep 10, 2020Updated 5 years ago
PyThaiNLP / thai-synonym
View on GitHub
The synonym for thai (open source & open data)
☆18Dec 6, 2023Updated 2 years ago
YongWookHa / kor-text-preprocess
View on GitHub
Korean text data preprocess toolkit for NLP
☆18Jun 11, 2019Updated 7 years ago
hyunwoongko / beyond-lm
View on GitHub
Beyond LM: How can language model go forward in the future?
☆15Apr 30, 2023Updated 3 years ago
lovit / KoBERTScore
View on GitHub
BERTScore for Korean
☆81Feb 22, 2024Updated 2 years ago