Код для файнтюна LM (rugpt, LLaMa, FRED T5) средствами transformers + deepspeed + LoRa
☆14May 22, 2023Updated 3 years ago
Alternatives and similar repositories for LM-finetune
Users that are interested in LM-finetune are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Bunch of notebooks for pre-training custom Saiga-like LLM☆12Feb 9, 2024Updated 2 years ago
- ☆21May 5, 2026Updated 3 weeks ago
- Data and Code for COLM 2025 paper "Retrieval-Augmented Generation with Conflicting Evidence"☆23Apr 18, 2025Updated last year
- Это прототип решения типа Agentic RAG (Retrieval-Augmented Generation) с данными из Jira, Confluence и Git.☆11Dec 4, 2024Updated last year
- Gazeta: Dataset for automatic summarization of Russian news / Газета: набор данных для автоматического реферирования на русском языке☆37Oct 6, 2021Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Multilingual RAG benchmark.☆10Nov 22, 2024Updated last year
- 📚 A small collection of Russian literature 📚☆15Dec 9, 2022Updated 3 years ago
- Implementation of transformer for optical character recognition of russian words☆14Nov 25, 2023Updated 2 years ago
- ChatGPT Jailbreak promts☆15Mar 22, 2023Updated 3 years ago
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆47Mar 20, 2025Updated last year
- Code for blog post on r-squared☆13Jul 25, 2016Updated 9 years ago
- Emacs minor mode for entering unicode math symbols☆11Dec 10, 2023Updated 2 years ago
- Large silver standart Russian corpus with NER, morphology and syntax markup☆74Apr 13, 2026Updated last month
- Knowledge Graph based Question Answering benchmark.☆10Feb 1, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆16Oct 29, 2023Updated 2 years ago
- Проект языковой модели для проведения морфемного анализа, сегментации и токенизации слов русского языка.☆17Jan 10, 2025Updated last year
- Реализация sklearn-based Transformer-а для Weight of Evidence преобразования☆10May 6, 2020Updated 6 years ago
- Contains TeXLive distributions with additional python-pygments library for source code highlighting and pandoc.☆13May 6, 2024Updated 2 years ago
- A Controllable Model of Grounded Response Generation (AAAI 21)☆13Oct 25, 2022Updated 3 years ago
- A self-hosted data validation platform, for labor intensive fact checking.☆28Jul 3, 2025Updated 10 months ago
- Дипломная работа бакалавра / Bachelor thesis☆10Sep 11, 2015Updated 10 years ago
- Python package to compute metrics on an NLU intent parsing pipeline☆13Mar 10, 2020Updated 6 years ago
- Text reading pipeline that combines segmentation and OCR-models.☆26Feb 6, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Building a model for predicting whether a student will be admitted to college. Done as a part of Project of the Week at DataTalks.Club☆11Aug 15, 2022Updated 3 years ago
- Normalize Text in Russian☆29Nov 7, 2023Updated 2 years ago
- CraftML is a restful web service for easy pipeline creation without code.☆13Apr 18, 2021Updated 5 years ago
- Règles typographiques de l'Imprimerie Nationale☆11May 24, 2020Updated 6 years ago
- A set of scripts and configurations for pretraining of Large Language Models (LLM)☆35Mar 2, 2025Updated last year
- Библиотека для извлечения статистик из текстов на русском языке.☆126Jan 21, 2023Updated 3 years ago
- Latest version of Docear re-packaged with old java library to work on Linux without additional configurations.☆10Dec 13, 2020Updated 5 years ago
- Implementation of https://arxiv.org/abs/1904.00962☆15Aug 30, 2019Updated 6 years ago
- Planet: Understanding the Amazon from Space☆12Jul 23, 2017Updated 8 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 매주 목요일, 20:00 모임☆16Jul 24, 2020Updated 5 years ago
- Comparing PyTorch, JIT and ONNX for inference with Transformers☆19Feb 22, 2021Updated 5 years ago
- Code for "Five Hundred Deep Learning Papers, Graphviz and Python" (http://goo.gl/l1PIoi)☆14Dec 9, 2015Updated 10 years ago
- Brush up your tests!☆20Mar 4, 2021Updated 5 years ago
- ☆10Oct 4, 2024Updated last year
- All presentations from Data Fest Kyiv 2017 http://datafest.in.ua☆13Apr 24, 2017Updated 9 years ago
- Teaching a humanoid to walk(ish), then displaying in your browser (using tensorflow.js and reinforcement learning)☆10Sep 7, 2020Updated 5 years ago