AzBuki-ML/public-data

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AzBuki-ML/public-data)

AzBuki-ML / public-data

Custom-built Bulgarian language data sets, used by АзБуки.ML for sentiment analysis, text classification, summarisation and generation. Open-source & free to use in any ML project.

☆19

Alternatives and similar repositories for public-data

Users that are interested in public-data are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

delyan-boychev / imaginet
View on GitHub
☆11Apr 25, 2026Updated 2 months ago
nilesc / Long-Structured-Debate-Generation-and-Evaluation
View on GitHub
☆13Dec 8, 2022Updated 3 years ago
changwoolee / BLAST
View on GitHub
[NeurIPS 2024] BLAST: Block Level Adaptive Structured Matrix for Efficient Deep Neural Network Inference
☆18Nov 6, 2024Updated last year
zaemyung / wikiextractor
View on GitHub
A tool for extracting plain text from Wikipedia dumps
☆15Oct 3, 2019Updated 6 years ago
slone-nlp / myv-nmt
View on GitHub
☆29Jan 13, 2026Updated 6 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ozekri / SEPO
View on GitHub
Code for the paper: "Fine-Tuning Discrete Diffusion Models with Policy Gradient Methods"
☆32May 19, 2025Updated last year
joyheyueya / giants
View on GitHub
☆28Jun 1, 2026Updated last month
RMLio / yarrrml-parser
View on GitHub
A YARRRML parser library and CLI in Javascript
☆56Jun 12, 2026Updated last month
nalgeon / iuliia
View on GitHub
Transliterate Cyrillic → Latin in every possible way
☆73Jan 4, 2025Updated last year
OP-TED / ePO
View on GitHub
The eProcurement Ontology provides the formal, semantic foundation for the creation and reuse of linked open data in the domain of public…
☆77Oct 21, 2025Updated 9 months ago
oxigraph / rio
View on GitHub
RDF parsers library
☆87Apr 11, 2026Updated 3 months ago
psb1558 / Joscelyn-font
View on GitHub
An authentic secretary hand font
☆95May 16, 2025Updated last year
kuleshov-group / remdm
View on GitHub
Remasking Discrete Diffusion Models with Inference-Time Scaling
☆77Feb 7, 2026Updated 5 months ago
machine-intelligence-laboratory / TopicNet
View on GitHub
Interface for easier topic modelling.
☆143Jul 29, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
AlexanderMandera / arduino-wch32v003
View on GitHub
Arduino Core for CH32V003 RISC-V microcontroller
☆194Nov 27, 2024Updated last year
FasterDecoding / TEAL
View on GitHub
☆167Feb 15, 2025Updated last year
jhlau / topic_interpretability
View on GitHub
Computation of the semantic interpretability of topics produced by topic models.
☆180Apr 19, 2017Updated 9 years ago
dice-group / Palmetto
View on GitHub
Palmetto is a quality measuring tool for topics
☆226Mar 20, 2026Updated 4 months ago
natasha / navec
View on GitHub
Compact high quality word embeddings for Russian language
☆218Apr 13, 2026Updated 3 months ago
natasha / slovnet
View on GitHub
Deep Learning based NLP modeling for Russian language
☆248Jul 24, 2023Updated 3 years ago
OpenGVLab / OmniQuant
View on GitHub
[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.
☆903Nov 26, 2025Updated 7 months ago
RMLio / rmlmapper-java
View on GitHub
The RMLMapper executes RML rules to generate high quality Linked Data from multiple originally (semi-)structured data sources
☆201Feb 17, 2026Updated 5 months ago
pchampin / sophia_rs
View on GitHub
Sophia: a Rust toolkit for RDF and Linked Data
☆325May 20, 2026Updated 2 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
natasha / yargy
View on GitHub
Rule-based facts extraction for Russian language
☆334Apr 13, 2026Updated 3 months ago
hermes-webui / hermes-swift-mac
View on GitHub
The best way to run Hermes on your Mac!
☆407Updated this week
denull / Az.js
View on GitHub
A NLP library for Russian language
☆366Apr 2, 2024Updated 2 years ago
tc39 / proposal-error-cause
View on GitHub
TC39 proposal for accumulating errors
☆376Oct 26, 2021Updated 4 years ago
tracel-ai / models
View on GitHub
Models and examples built with Burn
☆375Apr 28, 2026Updated 2 months ago
Wikidata-Toolkit / Wikidata-Toolkit
View on GitHub
Java library to interact with Wikibase
☆411Jul 6, 2026Updated 2 weeks ago
KinglittleQ / GST-Tacotron
View on GitHub
A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
☆374Dec 8, 2022Updated 3 years ago
syang1993 / gst-tacotron
View on GitHub
A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"
☆367Dec 6, 2018Updated 7 years ago
eclipse-rdf4j / rdf4j
View on GitHub
Eclipse RDF4J: scalable RDF for Java
☆408Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
dllm-reasoning / d1
View on GitHub
Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"
☆453Jan 26, 2026Updated 5 months ago
bigartm / bigartm
View on GitHub
Fast topic modeling platform
☆674Feb 5, 2026Updated 5 months ago
awizemann / scarf
View on GitHub
Native macOS and iOS App for the Hermes AI agent — multi-window, multi-server (local + remote over SSH). Chat, dashboard, sessions, memor…
☆754Updated this week
nmntz / bloomz.cpp
View on GitHub
C++ implementation for BLOOM
☆811May 13, 2023Updated 3 years ago
MCUdude / MiniCore
View on GitHub
Arduino hardware package for ATmega8, ATmega48, ATmega88, ATmega168, ATmega328 and ATmega328PB
☆1,149Apr 19, 2026Updated 3 months ago
MIND-Lab / OCTIS
View on GitHub
OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)
☆803Jun 21, 2026Updated last month
kuleshov-group / mdlm
View on GitHub
[NeurIPS 2024] Simple and Effective Masked Diffusion Language Model
☆702Sep 29, 2025Updated 9 months ago