Код для файнтюна LM (rugpt, LLaMa, FRED T5) средствами transformers + deepspeed + LoRa
☆14May 22, 2023Updated 2 years ago
Alternatives and similar repositories for LM-finetune
Users that are interested in LM-finetune are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆17Oct 9, 2023Updated 2 years ago
- Russian text segmenter and tokenizer☆18Mar 2, 2021Updated 5 years ago
- Bunch of notebooks for pre-training custom Saiga-like LLM☆12Feb 9, 2024Updated 2 years ago
- ☆21Apr 2, 2025Updated last year
- Data and Code for COLM 2025 paper "Retrieval-Augmented Generation with Conflicting Evidence"☆21Apr 18, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Gazeta: Dataset for automatic summarization of Russian news / Газета: набор данных для автоматического реферирования на русском языке☆37Oct 6, 2021Updated 4 years ago
- Multilingual RAG benchmark.☆10Nov 22, 2024Updated last year
- Implementation of transformer for optical character recognition of russian words☆14Nov 25, 2023Updated 2 years ago
- ChatGPT Jailbreak promts☆16Mar 22, 2023Updated 3 years ago
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆47Mar 20, 2025Updated last year
- Code for blog post on r-squared☆13Jul 25, 2016Updated 9 years ago
- Russian DialoGPT☆26May 14, 2021Updated 4 years ago
- Large silver standart Russian corpus with NER, morphology and syntax markup☆74Updated this week
- Knowledge Graph based Question Answering benchmark.☆10Feb 1, 2020Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Проект языковой модели для проведения морфемного анализа, сегментации и токенизации слов русского языка.☆17Jan 10, 2025Updated last year
- Реализация sklearn-based Transformer-а для Weight of Evidence преобразования☆10May 6, 2020Updated 5 years ago
- Contains TeXLive distributions with additional python-pygments library for source code highlighting and pandoc.☆13May 6, 2024Updated last year
- SAGE: Spelling correction, corruption and evaluation for multiple languages☆166Dec 8, 2025Updated 4 months ago
- A self-hosted data validation platform, for labor intensive fact checking.☆28Jul 3, 2025Updated 9 months ago
- Дипломная работа бакалавра / Bachelor thesis☆10Sep 11, 2015Updated 10 years ago
- IEEE Investment Ranking Challenge solution (4th place)☆10Jun 1, 2018Updated 7 years ago
- Python package to compute metrics on an NLU intent parsing pipeline☆13Mar 10, 2020Updated 6 years ago
- Text reading pipeline that combines segmentation and OCR-models.☆26Feb 6, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Building a model for predicting whether a student will be admitted to college. Done as a part of Project of the Week at DataTalks.Club☆11Aug 15, 2022Updated 3 years ago
- Reinforcement Learning Agent for Binance Futures — realistic backtesting, CNN + D3QN + PER, and reproducible training pipeline.☆60Aug 12, 2025Updated 8 months ago
- Normalize Text in Russian☆29Nov 7, 2023Updated 2 years ago
- ☆90Aug 30, 2019Updated 6 years ago
- CraftML is a restful web service for easy pipeline creation without code.☆13Apr 18, 2021Updated 5 years ago
- Règles typographiques de l'Imprimerie Nationale☆11May 24, 2020Updated 5 years ago
- Stickers just for fun☆12Dec 17, 2022Updated 3 years ago
- A set of scripts and configurations for pretraining of Large Language Models (LLM)☆36Mar 2, 2025Updated last year
- Codebase describing experiments in Truncation Sampling as Language Model Desmoothing☆13Dec 6, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Multimodal Open Source Framework for Conversational Agent Research and Development.☆25Feb 16, 2025Updated last year
- Implementation of https://arxiv.org/abs/1904.00962☆15Aug 30, 2019Updated 6 years ago
- Planet: Understanding the Amazon from Space☆12Jul 23, 2017Updated 8 years ago
- 매주 목요일, 20:00 모임☆16Jul 24, 2020Updated 5 years ago
- Code for "Five Hundred Deep Learning Papers, Graphviz and Python" (http://goo.gl/l1PIoi)☆14Dec 9, 2015Updated 10 years ago
- ☆10Oct 4, 2024Updated last year
- ☆19Mar 18, 2021Updated 5 years ago