Tools for training causal language models for Finnish
☆27Jan 14, 2026Updated 5 months ago
Alternatives and similar repositories for finngen-tools
Users that are interested in finngen-tools are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Aug 30, 2021Updated 4 years ago
- Portable Qt-based console for SWI-Prolog by Carlo Capelli☆11Sep 9, 2025Updated 9 months ago
- The graphics toolkit for SWI-Prolog☆22Updated this week
- FlexiTokens☆23Dec 27, 2025Updated 6 months ago
- Measures integrative complexity in English texts.☆12Apr 29, 2017Updated 9 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Fork of Flame repo for training of some new stuff in development☆19Jun 19, 2026Updated last week
- Comparison of existing spell checking tools☆11Mar 28, 2023Updated 3 years ago
- Code for "Fast Sparse ConvNets" CVPR2020 submissions☆12Nov 20, 2019Updated 6 years ago
- A Swedish Natural Language Understanding Benchmark☆11Apr 23, 2026Updated 2 months ago
- LTG-Bert☆34Jan 8, 2024Updated 2 years ago
- Interface from C# to SWI-Prolog☆27Dec 7, 2025Updated 6 months ago
- BPE modification that implements removing of the intermediate tokens during tokenizer training.☆27Nov 25, 2024Updated last year
- [ICLR 2023] 'Revisiting Pruning At Initialization Through The Lens of Ramanujan Graph" by Duc Hoang, Shiwei Liu, Radu Marculescu, Atlas W…☆14Aug 4, 2023Updated 2 years ago
- A Signal Propagation Perspective for Pruning Neural Networks at Initialization☆14Jun 23, 2020Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- data related codebase for polyglot project☆19Mar 30, 2023Updated 3 years ago
- [ICML 2024] Junk DNA Hypothesis: A Task-Centric Angle of LLM Pre-trained Weights through Sparsity; Lu Yin*, Ajay Jaiswal*, Shiwei Liu, So…☆16Apr 21, 2025Updated last year
- [ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…☆30Jul 24, 2025Updated 11 months ago
- ☆28Mar 29, 2025Updated last year
- Python barebones for uProbe-1 ultrasound probe acquisitions☆17Nov 11, 2017Updated 8 years ago
- MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following☆16Oct 31, 2024Updated last year
- Pytorch implementation of Generative Adversarial Networks (GAN) for ULTRASOUND image.☆13Sep 12, 2018Updated 7 years ago
- A High-level Library for Named Entity Recognition in Python.☆25Dec 7, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- RosenPy is a complex-valued neural network library, written in Python; Incorporates CVNNs such as CV-FFNN (complex-valued feedforward neu…☆14Sep 17, 2024Updated last year
- ☆35Jun 17, 2026Updated last week
- Push.Foo - Web Push API Playground☆54May 27, 2023Updated 3 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆98Feb 9, 2023Updated 3 years ago
- ☆19Aug 10, 2024Updated last year
- Official repo for the NCR Crypto Meetup☆17Jun 1, 2022Updated 4 years ago
- A curated list of papers, tools, and resources on Multi-Token Prediction (MTP) and related techniques in Large Language Models (LLMs), Sp…☆153Jun 13, 2026Updated 2 weeks ago
- ☆11Nov 16, 2021Updated 4 years ago
- Low-cost, portable, super resolution ultrasound imaging system. It’s an 8-channel transceiver with 16 analog multiplexes to time-multiple…☆13Oct 6, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Sep 17, 2025Updated 9 months ago
- Pure C implementation of Voxtral-4B-TTS-2603☆106Mar 27, 2026Updated 3 months ago
- A parser for arxiv.org astro-ph that will find a list of papers matching keywords and either e-mail it or post it to Slack☆29Feb 7, 2024Updated 2 years ago
- Learning to Identify Critical States for Reinforcement Learning from Videos (Accepted to ICCV'23)☆28Aug 19, 2023Updated 2 years ago
- It's the code for <A holistic representation guided attention network for scene text recognition>Neurocomputing 2020☆17Dec 1, 2020Updated 5 years ago
- ☆13Feb 28, 2024Updated 2 years ago
- Official Pytorch Implementation of "Outlier-weighed Layerwise Sampling for LLM Fine-tuning" by Pengxiang Li, Lu Yin, Xiaowei Gao, Shiwei …☆35Jun 3, 2025Updated last year