Код для файнтюна LM (rugpt, LLaMa, FRED T5) средствами transformers + deepspeed + LoRa
☆14May 22, 2023Updated 2 years ago
Alternatives and similar repositories for LM-finetune
Users that are interested in LM-finetune are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Диалоговая система на базе FRED-T5☆37Jul 10, 2023Updated 2 years ago
- ☆17Oct 9, 2023Updated 2 years ago
- Russian text segmenter and tokenizer☆18Mar 2, 2021Updated 5 years ago
- Bunch of notebooks for pre-training custom Saiga-like LLM☆12Feb 9, 2024Updated 2 years ago
- This is a smart chunker for efficient preparing of long document for RAG☆13Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆21Apr 2, 2025Updated 11 months ago
- Data and Code for COLM 2025 paper "Retrieval-Augmented Generation with Conflicting Evidence"☆20Apr 18, 2025Updated 11 months ago
- 📚 A small collection of Russian literature 📚☆13Dec 9, 2022Updated 3 years ago
- Implementation of transformer for optical character recognition of russian words☆14Nov 25, 2023Updated 2 years ago
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆47Mar 20, 2025Updated last year
- Emacs minor mode for entering unicode math symbols☆11Dec 10, 2023Updated 2 years ago
- Large silver standart Russian corpus with NER, morphology and syntax markup☆73Jul 24, 2023Updated 2 years ago
- Русскоязычный генеративный чатбот с профилем и фактами☆260Jan 20, 2023Updated 3 years ago
- Knowledge Graph based Question Answering benchmark.☆10Feb 1, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆16Oct 29, 2023Updated 2 years ago
- Реализация sklearn-based Transformer-а для Weight of Evidence преобразования☆10May 6, 2020Updated 5 years ago
- Contains TeXLive distributions with additional python-pygments library for source code highlighting and pandoc.☆13May 6, 2024Updated last year
- A self-hosted data validation platform, for labor intensive fact checking.☆27Jul 3, 2025Updated 8 months ago
- SAGE: Spelling correction, corruption and evaluation for multiple languages☆164Dec 8, 2025Updated 3 months ago
- Дипломная работа бакалавра / Bachelor thesis☆10Sep 11, 2015Updated 10 years ago
- IEEE Investment Ranking Challenge solution (4th place)☆10Jun 1, 2018Updated 7 years ago
- A docker container for Gitlab CI to build papers with Latex☆10Aug 30, 2022Updated 3 years ago
- Text reading pipeline that combines segmentation and OCR-models.☆26Feb 6, 2023Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Building a model for predicting whether a student will be admitted to college. Done as a part of Project of the Week at DataTalks.Club☆11Aug 15, 2022Updated 3 years ago
- ☆91Aug 30, 2019Updated 6 years ago
- Règles typographiques de l'Imprimerie Nationale☆11May 24, 2020Updated 5 years ago
- A set of scripts and configurations for pretraining of Large Language Models (LLM)☆37Mar 2, 2025Updated last year
- ML Course created for Bauman Moscow State Technical University☆65Aug 31, 2022Updated 3 years ago
- Codebase describing experiments in Truncation Sampling as Language Model Desmoothing☆13Dec 6, 2022Updated 3 years ago
- Multimodal Open Source Framework for Conversational Agent Research and Development.☆25Feb 16, 2025Updated last year
- Latest version of Docear re-packaged with old java library to work on Linux without additional configurations.☆10Dec 13, 2020Updated 5 years ago
- Implementation of https://arxiv.org/abs/1904.00962☆15Aug 30, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- 매주 목요일, 20:00 모임☆16Jul 24, 2020Updated 5 years ago
- Comparing PyTorch, JIT and ONNX for inference with Transformers☆20Feb 22, 2021Updated 5 years ago
- Code for "Five Hundred Deep Learning Papers, Graphviz and Python" (http://goo.gl/l1PIoi)☆14Dec 9, 2015Updated 10 years ago
- All presentations from Data Fest Kyiv 2017 http://datafest.in.ua☆13Apr 24, 2017Updated 8 years ago
- Teaching a humanoid to walk(ish), then displaying in your browser (using tensorflow.js and reinforcement learning)☆10Sep 7, 2020Updated 5 years ago
- ⚡ Набор решений для разработки LLM-приложений на русском языке с поддержкой GigaChat ⚡☆550Mar 16, 2026Updated last week
- This Chrome extension lets you summarize YouTube videos using the ChatGPT.☆17Dec 10, 2022Updated 3 years ago