☆16May 8, 2024Updated last year
Alternatives and similar repositories for open-instruct
Users that are interested in open-instruct are comparing it to the libraries listed below
Sorting:
- [ACL 2024] Progressive LLaMA with Block Expansion.☆514May 20, 2024Updated last year
- evolve llm training instruction, from english instruction to any language.☆120Sep 15, 2023Updated 2 years ago
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision☆96Oct 30, 2024Updated last year
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning☆96Aug 15, 2023Updated 2 years ago
- 거꾸로 읽는 self-supervised learning in NLP☆27Oct 30, 2022Updated 3 years ago
- ☆10Oct 15, 2019Updated 6 years ago
- Consistent dialogue generation☆16Oct 26, 2022Updated 3 years ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- ☆10Jan 11, 2023Updated 3 years ago
- minimal scripts for 24GB VRAM GPUs. training, inference, whatever☆51Mar 1, 2026Updated 2 weeks ago
- A clean and structured implementation of the RNN family with wandb and pytorch-lightning☆47May 21, 2022Updated 3 years ago
- Wrtn.ai unofficial openai-style api☆12Aug 17, 2023Updated 2 years ago
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆17Mar 31, 2025Updated 11 months ago
- [ Text Analytics ] 법률 도메인 특화 한국어 기반 LLM 개발☆15Sep 14, 2025Updated 6 months ago
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Sep 14, 2025Updated 6 months ago
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago
- DeepL을 통한 한국 번역 자동화 코드☆12Jul 27, 2023Updated 2 years ago
- BERT score for text generation☆12Jan 15, 2025Updated last year
- 🤖 Code for our EMNLP 2022 paper: "BotsTalk: Machine-sourced Framework for Automatic Curation of Large-scale Multi-skill Dialogue Dataset…☆16Oct 7, 2024Updated last year
- ☆10Mar 24, 2023Updated 2 years ago
- This repository contains the metadata and data of different databases that we use for testing☆14Jan 29, 2025Updated last year
- Official code release for "SuperBPE: Space Travel for Language Models"☆90Jan 9, 2026Updated 2 months ago
- readthedocs.org documentation for Inkplate boards☆10Aug 25, 2025Updated 6 months ago
- Code Roberta version of RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder☆10Mar 16, 2023Updated 3 years ago
- Mini Model Daemon☆12Nov 9, 2024Updated last year
- Source code for "Retrieving Sequential Information for Non-Autoregressive Neural Machine Translation"☆18Aug 31, 2019Updated 6 years ago
- Continual Memorization of Factoids in Large Language Models☆12Nov 20, 2024Updated last year
- Using multiple LLMs for ensemble Forecasting☆16Jan 17, 2024Updated 2 years ago
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆17Aug 30, 2024Updated last year
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆18Jul 24, 2025Updated 7 months ago
- ICML 2025 Spotlight, PCEvolve: Private Contrastive Evolution for Synthetic Dataset Generation via Few-Shot Private Data and Generative AP…☆14Jun 27, 2025Updated 8 months ago
- unofficial implementation of the CoT-decoding method for extract cot paths in an unsupervised way☆20Jan 11, 2026Updated 2 months ago
- Official repository for Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning☆12Sep 2, 2024Updated last year
- ☆12Apr 28, 2023Updated 2 years ago
- ☆14Dec 27, 2016Updated 9 years ago
- An OpenAI Compatible API which integrates LLM, Embedding and Reranker. 一个集成 LLM、Embedding 和 Reranker 的 OpenAI 兼容 API☆18Aug 21, 2025Updated 6 months ago
- ☆16Aug 23, 2023Updated 2 years ago
- ☆33Oct 13, 2025Updated 5 months ago
- ☆12Dec 6, 2024Updated last year