Custom-built Bulgarian language data sets, used by АзБуки.ML for sentiment analysis, text classification, summarisation and generation. Open-source & free to use in any ML project.
☆19Jan 12, 2024Updated 2 years ago
Alternatives and similar repositories for public-data
Users that are interested in public-data are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLM play 20questions with itself☆13Mar 31, 2023Updated 3 years ago
- ☆15Mar 11, 2021Updated 5 years ago
- ☆12Apr 25, 2026Updated 3 weeks ago
- ☆13Dec 8, 2022Updated 3 years ago
- Easily change the background of your Hyper terminal!☆11Nov 29, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Auxiliary tasks for task-oriented dialogue systems. Published in ICNLSP'22 and indexed in the ACL Anthology.☆17Feb 27, 2023Updated 3 years ago
- The code for EMNLP2022 paper "Improved grammatical error correction by ranking elementary edits"☆21Dec 14, 2022Updated 3 years ago
- A tool for extracting plain text from Wikipedia dumps☆15Oct 3, 2019Updated 6 years ago
- ☆16May 5, 2022Updated 4 years ago
- Code for the note "NF4 Isn't Information Theoretically Optimal (and that's Good)☆21Jun 22, 2023Updated 2 years ago
- Code for AINL2018 paper Deep Convolutional Networks for Supervised Morpheme Segmentation of Russian Language☆25Aug 23, 2019Updated 6 years ago
- Unofficial Implementation of Consistency Models in Pytorch☆15Mar 18, 2023Updated 3 years ago
- Collection and resources for Bulgarian Corpus, Datasets and Models used in ASR, TTS or NLP tasks together with the links of corresponding…☆25Jun 6, 2020Updated 5 years ago
- ✨ A simple application that helps me manage my passwords easier.☆20Jan 13, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆25Aug 4, 2022Updated 3 years ago
- Official Code for SIGIR 2022 "A Multi-task Based Neural Model to Simulate Users in Goal Oriented Dialogue Systems". User Simulator genera…☆37Jul 14, 2022Updated 3 years ago
- Morphological Analyzer for Russian 💬☆41Jul 14, 2021Updated 4 years ago
- Autonomous cloud for the autonomous era☆44Jan 13, 2026Updated 4 months ago
- ☆68Aug 16, 2024Updated last year
- Transliterate Cyrillic → Latin in every possible way☆72Jan 4, 2025Updated last year
- Language Models for Zalando's flair library☆61Jan 20, 2020Updated 6 years ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆111Mar 7, 2025Updated last year
- ☆129Jan 22, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Fine-tune a large language model on your own iMessages☆122Apr 23, 2023Updated 3 years ago
- A deep learning tabular classification architecture inspired by TabTransformer with integrated gated multilayer perceptron.☆101Feb 4, 2023Updated 3 years ago
- Interface for easier topic modelling.☆143Jul 29, 2024Updated last year
- Lingtrain Aligner — ML powered library for the accurate texts alignment.☆152May 14, 2026Updated last week
- Code Release for “Balanced Contrastive Learning for Long-Tailed Visual Recognition”☆112Oct 31, 2022Updated 3 years ago
- ☆165Feb 15, 2025Updated last year
- Universal I/O bridge for the ESP8266, including GPIO, I2C and UART (serial bridge)☆139May 8, 2024Updated 2 years ago
- Experiments around a simple idea for inducing multiple hierarchical predictive model within a GPT☆227Mar 25, 2026Updated last month
- ⭐ An awesome list that curates the best Flet libraries, tools, tutorials, articles and more.☆262Mar 18, 2026Updated 2 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Compact high quality word embeddings for Russian language☆219Apr 13, 2026Updated last month
- Deep Learning based NLP modeling for Russian language☆246Jul 24, 2023Updated 2 years ago
- [ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.☆897Nov 26, 2025Updated 5 months ago
- Semantic search through a vectorized Wikipedia (SentenceBERT) with the Weaviate vector search engine☆245May 31, 2023Updated 2 years ago
- Llama2 LLM ported to Rust burn☆280Apr 16, 2024Updated 2 years ago
- Models and examples built with Burn☆364Apr 28, 2026Updated 3 weeks ago
- A crude RLHF layer on top of nanoGPT with Gumbel-Softmax trick☆294Nov 25, 2023Updated 2 years ago