Fork of kingoflolz/mesh-transformer-jax with memory usage optimizations and support for GPT-Neo, GPT-NeoX, BLOOM, OPT and fairseq dense LM. Primarily used by KoboldAI and mtj-softtuner.
☆22Nov 14, 2022Updated 3 years ago
Alternatives and similar repositories for mesh-transformer-jax
Users that are interested in mesh-transformer-jax are comparing it to the libraries listed below
Sorting:
- Open Source + Multilingual MLLM + Fine-tuning + Distillation + More efficient models and learning + ?☆18Jan 31, 2025Updated last year
- Expanded KR-BERT for Sentiment Analysis☆13Apr 23, 2021Updated 4 years ago
- Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts☆18Mar 15, 2021Updated 5 years ago
- ☆23Oct 30, 2023Updated 2 years ago
- ☆21May 24, 2023Updated 2 years ago
- Easy installer of kocohub dataset☆24May 31, 2020Updated 5 years ago
- Korean BERT model using character tokenizer☆27Apr 8, 2021Updated 4 years ago
- code for COLING paper "A Hybrid Model of Classification and Generation for Spatial Relation Extraction"☆10Oct 20, 2022Updated 3 years ago
- Bias, Hate classification with KoELECTRA 👿☆27Jun 12, 2023Updated 2 years ago
- 한국어 언어 모델 학습을 위한 프로젝트(Flax, Pytorch with Huggingface Accelerate)☆32Sep 13, 2023Updated 2 years ago
- Attention-Informed Mixed-Language Training for Zero-shot Cross-lingual Task-oriented Dialogue Systems (AAAI-2020)☆31Jan 8, 2020Updated 6 years ago
- KoGPT2 on Huggingface Transformers☆33May 4, 2021Updated 4 years ago
- KSenticNet: 한국어 감성 사전☆33May 20, 2019Updated 6 years ago
- My OpenCode and Oh-My-OpenCode configuration files with API proxy setup documentation☆33Jan 5, 2026Updated 2 months ago
- Super simple, zero config options, <2kb declarative tooltip library with no dependencies.☆17Jun 2, 2023Updated 2 years ago
- 서울시 민원 데이터 자동 분류 분석가이드(서울디지털재단)☆12Apr 3, 2021Updated 4 years ago
- Platform for creating audio-first AI assistants that can work offline using a flexible plugin architecture☆13Jun 29, 2025Updated 8 months ago
- notebook for running gpt-4chan on colab☆11Aug 11, 2022Updated 3 years ago
- LocEmb: Location Embedding (Currently covering districts, roads, and businesses in Korea)☆11Aug 15, 2022Updated 3 years ago
- La plateforme derrière nous le peuple. Fork de Pligg.☆10Sep 29, 2015Updated 10 years ago
- ☆22Feb 3, 2026Updated last month
- Apply schema.org microdata to your erb files☆13Jun 5, 2011Updated 14 years ago
- ☆10Jun 5, 2025Updated 9 months ago
- Library and examples to interface a HPGL plotter such as HP7550a to processing.☆10Jan 15, 2015Updated 11 years ago
- An OSINT tool to find data leaks on a targeted website☆17Mar 30, 2021Updated 4 years ago
- ☆12Nov 30, 2022Updated 3 years ago
- ☆10Sep 13, 2022Updated 3 years ago
- ☆13Mar 3, 2026Updated 2 weeks ago
- Read, write and manipulate code which reads, writes and manipulates code.☆10Mar 15, 2020Updated 6 years ago
- Benchmarks for Business Document Foundation Models☆10Apr 4, 2024Updated last year
- ☆12Nov 22, 2018Updated 7 years ago
- ☆11Dec 11, 2024Updated last year
- Package to parse and analyze trademark data from the United States Patent and Trademark Office☆14Apr 5, 2017Updated 8 years ago
- ☆10Oct 15, 2020Updated 5 years ago
- Access Control for Hyperdrive☆11Jun 15, 2023Updated 2 years ago
- ☆12May 12, 2025Updated 10 months ago
- Morfessor EM+Prune☆10Jul 22, 2020Updated 5 years ago
- ☆12Jun 7, 2023Updated 2 years ago
- Log queries that are incompatible with a Postgres pooler in transaction mode.☆11Apr 13, 2023Updated 2 years ago