Код для файнтюна LM (rugpt, LLaMa, FRED T5) средствами transformers + deepspeed + LoRa
☆14May 22, 2023Updated 2 years ago
Alternatives and similar repositories for LM-finetune
Users that are interested in LM-finetune are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Диалоговая система на базе FRED-T5☆38Jul 10, 2023Updated 2 years ago
- ☆17Oct 9, 2023Updated 2 years ago
- Russian text segmenter and tokenizer☆18Mar 2, 2021Updated 5 years ago
- Bunch of notebooks for pre-training custom Saiga-like LLM☆12Feb 9, 2024Updated 2 years ago
- This is a smart chunker for efficient preparing of long document for RAG☆13Mar 24, 2026Updated last month
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆21Apr 30, 2026Updated last week
- Data and Code for COLM 2025 paper "Retrieval-Augmented Generation with Conflicting Evidence"☆23Apr 18, 2025Updated last year
- Gazeta: Dataset for automatic summarization of Russian news / Газета: набор данных для автоматического реферирования на русском языке☆37Oct 6, 2021Updated 4 years ago
- Это прототип решения типа Agentic RAG (Retrieval-Augmented Generation) с данными из Jira, Confluence и Git.☆11Dec 4, 2024Updated last year
- ☆19Dec 14, 2024Updated last year
- Multilingual RAG benchmark.☆10Nov 22, 2024Updated last year
- 📚 A small collection of Russian literature 📚☆15Dec 9, 2022Updated 3 years ago
- Материалы открытого курса Углубленный Python от VK Education, осень 2024☆18Dec 2, 2024Updated last year
- Deep Learning School in MIPT. ML, NN, etc. (basic framework - PyTorch)☆14Aug 24, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ChatGPT Jailbreak promts☆15Mar 22, 2023Updated 3 years ago
- Easiest way to get started with Rasa stack. A full stack web based chatbot on top of rasa stack.☆21Jul 20, 2018Updated 7 years ago
- Code for blog post on r-squared☆13Jul 25, 2016Updated 9 years ago
- Large silver standart Russian corpus with NER, morphology and syntax markup☆74Apr 13, 2026Updated 3 weeks ago
- Русскоязычный генеративный чатбот с профилем и фактами☆259Jan 20, 2023Updated 3 years ago
- ☆16Oct 29, 2023Updated 2 years ago
- Реализация sklearn-based Transformer-а для Weight of Evidence преобразования☆10May 6, 2020Updated 6 years ago
- Contains TeXLive distributions with additional python-pygments library for source code highlighting and pandoc.☆13May 6, 2024Updated 2 years ago
- A Controllable Model of Grounded Response Generation (AAAI 21)☆13Oct 25, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- SAGE: Spelling correction, corruption and evaluation for multiple languages☆166Dec 8, 2025Updated 5 months ago
- A self-hosted data validation platform, for labor intensive fact checking.☆28Jul 3, 2025Updated 10 months ago
- IEEE Investment Ranking Challenge solution (4th place)☆10Jun 1, 2018Updated 7 years ago
- A docker container for Gitlab CI to build papers with Latex☆10Aug 30, 2022Updated 3 years ago
- Reinforcement Learning Agent for Binance Futures — realistic backtesting, CNN + D3QN + PER, and reproducible training pipeline.☆62Aug 12, 2025Updated 8 months ago
- Normalize Text in Russian☆29Nov 7, 2023Updated 2 years ago
- ☆90Aug 30, 2019Updated 6 years ago
- EasyPortrait - Face Parsing and Portrait Segmentation Dataset☆28Sep 2, 2024Updated last year
- A set of scripts and configurations for pretraining of Large Language Models (LLM)☆35Mar 2, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ML Course created for Bauman Moscow State Technical University☆65Aug 31, 2022Updated 3 years ago
- Codebase describing experiments in Truncation Sampling as Language Model Desmoothing☆13Dec 6, 2022Updated 3 years ago
- Библиотека для извлечения статистик из текстов на русском языке.☆126Jan 21, 2023Updated 3 years ago
- Latest version of Docear re-packaged with old java library to work on Linux without additional configurations.☆10Dec 13, 2020Updated 5 years ago
- Planet: Understanding the Amazon from Space☆12Jul 23, 2017Updated 8 years ago
- Code for "Five Hundred Deep Learning Papers, Graphviz and Python" (http://goo.gl/l1PIoi)☆14Dec 9, 2015Updated 10 years ago
- Brush up your tests!☆20Mar 4, 2021Updated 5 years ago