Генерация описаний к изображениям с помощью различных архитектур нейронных сетей
☆18May 13, 2023Updated 2 years ago
Alternatives and similar repositories for Image_captioning
Users that are interested in Image_captioning are comparing it to the libraries listed below
Sorting:
- A graphing calculator written in c.☆12Oct 17, 2023Updated 2 years ago
- ☆12Aug 15, 2023Updated 2 years ago
- ☆13Jan 17, 2024Updated 2 years ago
- Find context neurons in Pythia models.☆13Jun 13, 2023Updated 2 years ago
- ☆18Oct 14, 2024Updated last year
- Loads OpenSubtitles v2018 dataset without having to load everything into memory at once. Works well with pytorch.☆13Aug 26, 2020Updated 5 years ago
- Надеюсь, что тут будет множество веток и записей, но как пойдет.☆19Jul 24, 2024Updated last year
- My DS projects☆15Aug 6, 2025Updated 6 months ago
- Dialogue Act classification☆18Jan 15, 2024Updated 2 years ago
- Notebooks for RAG optimization workshop, using HackerNews data☆21Mar 27, 2024Updated last year
- ☆14Jul 26, 2023Updated 2 years ago
- A Tamagotchi in C!☆21May 22, 2019Updated 6 years ago
- Data from the publication "Multi-Domain Goal-Oriented Dialogues (MultiDoGO): Strategies toward Curating and Annotating Large Scale Dialog…☆25Dec 3, 2020Updated 5 years ago
- Репозиторий измеряет качество Yandexgpt, Gigachat, T-Pro, Saiga, Vikhr, Ruadapt на популярных англоязычных бенчмарках: MGSM, MATH, HumanE…☆23Apr 16, 2025Updated 10 months ago
- Zhirinovsky with ruGPT3☆27Dec 11, 2022Updated 3 years ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs with LoRA support.☆30Feb 19, 2024Updated 2 years ago
- Measuring the Mixing of Contextual Information in the Transformer☆34May 27, 2023Updated 2 years ago
- ☆30Dec 24, 2021Updated 4 years ago
- Collection of things related to UTF-8 and http://bjoern.hoehrmann.de/utf-8/decoder/dfa/☆36Mar 17, 2014Updated 11 years ago
- ☆42Sep 2, 2021Updated 4 years ago
- Training and data processing code for Saiga☆54Jan 2, 2026Updated last month
- Implementation of the Mamba SSM with hf_integration.☆55Aug 31, 2024Updated last year
- Sparse probing paper full code.☆67Dec 17, 2023Updated 2 years ago
- Evaluating tool-augmented LLMs in conversation settings☆89May 31, 2024Updated last year
- A fast implementation of T5/UL2 in PyTorch using Flash Attention☆114Oct 30, 2025Updated 4 months ago
- C examples for graduate course on MIPT☆152Jan 12, 2026Updated last month
- Materials of transformers lecture course☆143Jan 26, 2026Updated last month
- Pre-training code for Amber 7B LLM☆172May 10, 2024Updated last year
- Inference of Mamba and Mamba2 models in pure C☆197Jan 22, 2026Updated last month
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆252Oct 30, 2024Updated last year
- ☆316Jun 21, 2024Updated last year
- Domain Adapted Language Modeling Toolkit - E2E RAG☆333Nov 8, 2024Updated last year
- Course about deep learning for computer vision and graphics co-developed by YSDA and Skoltech.☆387Dec 7, 2025Updated 2 months ago
- MAD: The first work to explore Multi-Agent Debate with Large Language Models :D☆522Dec 16, 2025Updated 2 months ago
- Code for the ALiBi method for transformer language models (ICLR 2022)☆552Oct 30, 2023Updated 2 years ago
- Simple Python library/structure to ablate features in LLMs which are supported by TransformerLens☆581Jun 11, 2024Updated last year
- Official repository for the paper "Grokfast: Accelerated Grokking by Amplifying Slow Gradients"☆576Jun 28, 2024Updated last year
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆858Feb 20, 2026Updated last week
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling☆952Nov 16, 2025Updated 3 months ago