slovak-nlp/resources

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/slovak-nlp/resources)

slovak-nlp / resources

A curated list of resources such as tools and datasets useful for the processing of Slovak language

☆24

Alternatives and similar repositories for resources

Users that are interested in resources are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

uds-lsv / llmft
View on GitHub
Fine-tuning large language models with huggingface transformers and deepspeed
☆31Dec 11, 2023Updated 2 years ago
abhimishra91 / pytorch-tutorials
View on GitHub
Set of notebooks to practice Pytorch from basics to RNN
☆27May 2, 2020Updated 6 years ago
Yale-LILY / LoFT
View on GitHub
Code for EACL 2023 paper "LoFT: Enhancing Faithfulness and Diversity for Table-to-Text Generation via Logic Form Control"
☆20Feb 7, 2023Updated 3 years ago
crate-ci / imperative
View on GitHub
Check the mood of a word
☆18Jul 9, 2026Updated 2 weeks ago
official-elinas / zeus-llm-trainer
View on GitHub
Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models
☆69Aug 27, 2023Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
x-tabdeveloping / turftopic
View on GitHub
Robust and fast topic models with sentence-transformers.
☆118Updated this week
explosion / wikid
View on GitHub
Generate a SQLite database from Wikipedia & Wikidata dumps.
☆39Mar 27, 2024Updated 2 years ago
jackv24 / Unity-Nested-Fade-Group
View on GitHub
A generic fade group system to mimic the standard Canvas Group alpha control functionality for other things, such as SpriteRenderer.
☆11Jan 21, 2020Updated 6 years ago
michahu / pre-pretraining
View on GitHub
Accelerate pretraining by pre-pretraining on formal languages!
☆20Feb 13, 2026Updated 5 months ago
MaLA-LM / GlotEval
View on GitHub
GlotEval: a unified evaluation toolkit designed to benchmark multilingual Large Language Models (LLMs) in a language-specific way
☆18Nov 4, 2025Updated 8 months ago
tgjones / DotNetro
View on GitHub
.NET AOT compiler for retro computers
☆15Jul 11, 2026Updated last week
shubham0204 / tfidf-summarizer.rs
View on GitHub
Simple, efficient and cross-platform TFIDF-based text summarizer in Rust
☆13Apr 12, 2024Updated 2 years ago
wmt-conference / wmt22-news-systems
View on GitHub
☆21Feb 13, 2023Updated 3 years ago
stephen-huan / plaid
View on GitHub
Resources for building a Plaid keyboard
☆14May 25, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
CharlesStover / reactn-devtools
View on GitHub
ReactN DevTools allow you to view your ReactN state with the Redux DevTools browser extension
☆11Apr 10, 2022Updated 4 years ago
jordanbrauer / citylights.nvim
View on GitHub
A Neovim port of the beautiful City Lights syntax theme by Yummygum
☆12Apr 30, 2024Updated 2 years ago
B-M-dev / Bilingual_Manga_databases
View on GitHub
Databases of Bilingual Manga
☆15Mar 28, 2026Updated 3 months ago
UnityTechnologies / Unity-Entities-Dodge-the-Bullets
View on GitHub
Unity Korea Entities(ECS) tutorial project
☆18May 9, 2024Updated 2 years ago
jongwooko / NASH-Pruning-Official
View on GitHub
Code Implementation for "NASH: A Simple Unified Framework of Structured Pruning for Accelerating Encoder-Decoder Language Models" (EMNLP …
☆17Oct 17, 2023Updated 2 years ago
DimABSA / DimABSA2026
View on GitHub
SemEval2026 Task 3 DimABSA
☆34Jun 8, 2026Updated last month
explosion / spacy-huggingface-hub
View on GitHub
🤗 Push your spaCy pipelines to the Hugging Face Hub
☆45Jun 2, 2024Updated 2 years ago
GermanT5 / wikipedia2corpus
View on GitHub
Wikipedia text corpus for self-supervised NLP model training
☆47Jul 17, 2022Updated 4 years ago
JDEA-NLP / Vega-MT
View on GitHub
[WMT 2022 champion system] Vega-MT model and inference scripts
☆41Feb 10, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
thooton / muse
View on GitHub
Let's create synthetic textbooks together :)
☆74Jan 29, 2024Updated 2 years ago
dropbox / low-rank-llama2
View on GitHub
Low-Rank Llama Custom Training
☆23Mar 27, 2024Updated 2 years ago
nicola-decao / efficient-autoregressive-EL
View on GitHub
Pytorch implementation of Highly Parallel Autoregressive Entity Linking with Discriminative Correction
☆66May 4, 2022Updated 4 years ago
stressosaurus / raw-data-google-ngram
View on GitHub
This will download and process the Google Ngram data.
☆25Nov 29, 2022Updated 3 years ago
tchewik / isanlp_rst
View on GitHub
RST Discourse Parsers
☆59May 4, 2026Updated 2 months ago
JHU-CLSP / ettin-encoder-vs-decoder
View on GitHub
State-of-the-art paired encoder and decoder models (17M-1B params)
☆75Aug 6, 2025Updated 11 months ago
kenjones007 / ESP32-e-Paper-Weather-Display
View on GitHub
An ESP32 and 7.5" ePaper Display reads Weather Underground data via their API and then displays the weather
☆29Apr 2, 2026Updated 3 months ago
malteos / llm-datasets
View on GitHub
A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.
☆66Jul 29, 2024Updated last year
huggingface / olm-training
View on GitHub
Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.
☆98Feb 9, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
OpenNLG / OpenBA-v2
View on GitHub
OpenBA-V2: 3B LLM (Large Language Model) with T5 architecture, utilizing model pruning technique and continuing pretraining from OpenBA-1…
☆25May 10, 2024Updated 2 years ago
oobabooga / GPTQ-for-LLaMa
View on GitHub
4 bits quantization of LLaMa using GPTQ
☆130Jun 3, 2023Updated 3 years ago
jakubtomsu / vecc
View on GitHub
An experimental compiler for a vector programming languge inspired by ISPC and shader programming
☆45Feb 24, 2025Updated last year
zeromake / learnopengl-examples
View on GitHub
Examples from learnopengl.com, implemented using Sokol libraries.
☆48Mar 10, 2025Updated last year
MinishLab / tokenlearn
View on GitHub
Pre-train Static Word Embeddings
☆109Jun 9, 2026Updated last month
Helsinki-NLP / OPUS-MT-train
View on GitHub
Training open neural machine translation models
☆404Jan 17, 2026Updated 6 months ago
naver / nllb-pruning
View on GitHub
Library for pruning experts per language pair in NLLB-200
☆35Jul 7, 2023Updated 3 years ago