projectbenyehuda / public_domain_dumpView external linksLinks
Dump of Project Ben-Yehuda's public domain texts
☆31Oct 26, 2025Updated 3 months ago
Alternatives and similar repositories for public_domain_dump
Users that are interested in public_domain_dump are comparing it to the libraries listed below
Sorting:
- AlephBertGimmel - Modern Hebrew pretrained BERT model with a 128K token vocabulary.☆26Dec 1, 2022Updated 3 years ago
- Hebrew oriented NER spaCy pipeline☆21Aug 8, 2024Updated last year
- Neural Sentiment Analyzer for Modern Hebrew☆43Aug 5, 2020Updated 5 years ago
- Hebrew word lists☆49Oct 27, 2024Updated last year
- ☆12Feb 11, 2019Updated 7 years ago
- A comprehensive list of Hebrew NLP resources.☆286May 11, 2025Updated 9 months ago
- HeBERT: Pre-training BERT for modern Hebrew☆81Jun 15, 2023Updated 2 years ago
- A question answering dataset in Modern Hebrew, containing 30,147 questions.☆25Dec 5, 2024Updated last year
- Repository for the paper titled: "When is BERT Multilingual? Isolating Crucial Ingredients for Cross-lingual Transfer"☆13Nov 10, 2021Updated 4 years ago
- This is an open-source effort for making Hebrew properly searchable by various IR software libraries, while maintaining decent recall, pr…☆105Jan 4, 2023Updated 3 years ago
- the library for otzaria app, with the scripts that created it☆24Feb 1, 2025Updated last year
- ☆57Mar 18, 2022Updated 3 years ago
- Tool for disambiguating acronyms and abbreviations in text for NLP applications☆22Dec 18, 2025Updated last month
- A field-tested Hebrew tokenizer for dirty texts (ben-yehuda project, bible, cc100, mc4, opensubs, oscar, twitter) focused on multi-word e…☆23Aug 13, 2022Updated 3 years ago
- Hebrew text generation models based on EleutherAI's gpt-neo. Each was trained on a TPUv3-8 made avilable via TPU Research Cloud Program.☆22Jul 6, 2022Updated 3 years ago
- A curated list of resources for NLP (Natural Language Processing) for Hebrew☆109Jan 13, 2023Updated 3 years ago
- The Vision and goals of the Open Natural Language Processing in Hebrew Project☆110Oct 12, 2018Updated 7 years ago
- A modern app that brings the jewish library to every device☆24Updated this week
- ☆15Updated this week
- protein embedding project☆12May 3, 2018Updated 7 years ago
- NOT READY FOR PRODUCTION.☆13Jan 7, 2024Updated 2 years ago
- Hebrew-translated Disassembly of Pokémon Red/Blue☆11Sep 23, 2021Updated 4 years ago
- An NLP pipeline for Hebrew☆41Jun 16, 2025Updated 7 months ago
- Anki addon for reviewing with mouse☆11Dec 7, 2025Updated 2 months ago
- ☆37Jun 12, 2023Updated 2 years ago
- python camouflager, rename all your project's names (variables, function, modules, files, etc)☆10Apr 16, 2017Updated 8 years ago
- Official implementation of a temporal pupil light response model proposed in the Scientific Reports article: "Deep learning-based pupil m…☆11Jan 6, 2023Updated 3 years ago
- Named Entity (NER) annotations of the Hebrew Treebank (Haaretz newspaper) corpus, including: morpheme and token level NER labels, nested …☆10Dec 27, 2021Updated 4 years ago
- ☆10Jun 29, 2021Updated 4 years ago
- Analysis on stop reasons☆10Jun 17, 2024Updated last year
- This repository contains material of a teaching innovation project in Universitat de Barcelona: "Intelligent Support System for Tutor of …☆10Jun 30, 2020Updated 5 years ago
- Conversation view user interface for Yamot's SMS system.☆17Oct 31, 2025Updated 3 months ago
- Analysis of gutenberg dataset☆44Dec 22, 2018Updated 7 years ago
- New Interfaces for Jewish Texts☆730Feb 8, 2026Updated last week
- Repository for Booking.com Data Challenge 6th Place Solution☆10Feb 17, 2021Updated 4 years ago
- ☆12Dec 4, 2020Updated 5 years ago
- decentralized, tag-based textboard - 2020.10.31 . production paused 2021.07.01 due to criminal harassment☆10Jul 19, 2022Updated 3 years ago
- This gender detection model recognizes man and women in real time with a good accuracy rate especially in traffics and roads, using deep …☆13Jun 29, 2022Updated 3 years ago
- Code for "Proposition-Level Clustering for Multi-Document Summarization" paper☆10Apr 5, 2024Updated last year