A set of scripts and configurations for pretraining of Large Language Models (LLM)
☆35Mar 2, 2025Updated last year
Alternatives and similar repositories for impruver
Users that are interested in impruver are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Jan 17, 2024Updated 2 years ago
- Training and data processing code for Saiga☆54Jan 2, 2026Updated 4 months ago
- Normalize Text in Russian☆29Nov 7, 2023Updated 2 years ago
- Репозиторий измеряет качество Yandexgpt, Gigachat, T-Pro, Saiga, Vikhr, Ruadapt на популярных англоязычных бенчмарках: MGSM, MATH, HumanE…☆24Apr 16, 2025Updated last year
- Effective LLM Alignment Toolkit☆153Jun 25, 2025Updated 10 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Репозиторий курса "Практические аспекты обучения больших языковых моделей", ВМК МГУ, осень, 2024☆20Dec 24, 2024Updated last year
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant☆10Aug 12, 2019Updated 6 years ago
- Bunch of notebooks for pre-training custom Saiga-like LLM☆12Feb 9, 2024Updated 2 years ago
- Memory-optimized training scripts for video models based on Diffusers☆16Jan 3, 2025Updated last year
- A server code for serving BERT-based models for text classification. It is designed by SerpApi for heavy-load prototyping and production …☆15Apr 17, 2024Updated 2 years ago
- Data and Code for COLM 2025 paper "Retrieval-Augmented Generation with Conflicting Evidence"☆23Apr 18, 2025Updated last year
- Это прототип решения типа Agentic RAG (Retrieval-Augmented Generation) с данными из Jira, Confluence и Git.☆11Dec 4, 2024Updated last year
- A graphing calculator written in c.☆13Oct 17, 2023Updated 2 years ago
- Multilingual RAG benchmark.☆10Nov 22, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Loads OpenSubtitles v2018 dataset without having to load everything into memory at once. Works well with pytorch.☆13Aug 26, 2020Updated 5 years ago
- Several php scripts with code for image animation. Sources was done for site http://ffmpeg.unixpin.com☆12Nov 26, 2018Updated 7 years ago
- Flutter-based mobile app displaying a list of daily curated content from top engineering blogs and articles. Backed by a GraphQL-based AP…☆24May 16, 2026Updated last week
- ☆13Jun 5, 2024Updated last year
- Find context neurons in Pythia models.☆13Jun 13, 2023Updated 2 years ago
- ☆11Dec 11, 2024Updated last year
- Russian text segmenter and tokenizer☆18Mar 2, 2021Updated 5 years ago
- Use Machine Learning to predict abandoned carts and take action to prevent them for Magento platform☆12Jul 13, 2014Updated 11 years ago
- ☆22Feb 10, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆19Sep 15, 2022Updated 3 years ago
- 学习vLLM,使用vLLM部署Qwen2-0.5B的模型,并使用docker部署。☆20Jun 22, 2024Updated last year
- Implementation of transformer for optical character recognition of russian words☆14Nov 25, 2023Updated 2 years ago
- Perl implementation of the Naval Research Laboratory text-to-phoneme algorithm, described by Elovitz et al (1976)☆16May 7, 2020Updated 6 years ago
- The tiniest sentence encoder for Russian language☆245Jul 25, 2024Updated last year
- ☆16Oct 29, 2023Updated 2 years ago
- Notebooks for RAG optimization workshop, using HackerNews data☆21Mar 27, 2024Updated 2 years ago
- Discovering YouTube sailing channels by using a network analysis on recursive channel subscriptions☆12Oct 22, 2023Updated 2 years ago
- ☆14Jul 26, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Model Primitive Hierarchical Reinforcement Learning☆13Dec 8, 2022Updated 3 years ago
- Yii2 extension for handling deferred tasks (background cron jobs)☆10Apr 25, 2017Updated 9 years ago
- mysterious ooze☆15Mar 22, 2025Updated last year
- A WordPress trip summary plugin to help travel bloggers manage and display structured information about their train rides and biking or h…☆13May 29, 2025Updated 11 months ago
- Alternative yii\grid\ActionColumn for yii2☆11Aug 7, 2025Updated 9 months ago
- Neural network that can generate meme captions, built upon GPT-2☆13May 21, 2021Updated 5 years ago
- 🗺️ An interactive scratch off map. Keep track of which places you have been, how much of the world you have conquered, and where to go n…☆11Mar 26, 2026Updated last month