OpenLLMAI / OpenLLMDELinks

OpenLLMDE: An open source data engineering framework for LLMs

☆17

Alternatives and similar repositories for OpenLLMDE

Users that are interested in OpenLLMDE are comparing it to the libraries listed below

Sorting:

Academic-Hammer / HammerLLM
1.4B sLLM for Chinese and English - HammerLLM🔨
☆44Updated last year
Alibaba-NLP / RankingGPT
code for paper 《RankingGPT: Empowering Large Language Models in Text Ranking with Progressive Enhancement》
☆34Updated last year
RWKV-Wiki / MultilingualShareGPT
MultilingualShareGPT, the free multi-language corpus for LLM training
☆72Updated 2 years ago
seanzhang-zhichen / baichuan-Dynamic-NTK-ALiBi
百川Dynamic NTK-ALiBi的代码实现：无需微调即可推理更长文本
☆47Updated last year
yhao-wang / LLM-Knowledge-Boundary
Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"
☆22Updated 2 years ago
CLUEbenchmark / SuperCLUE-Code3
中文原生等级化代码能力测试基准
☆15Updated last year
Longyichen / Alpaca-family-library
Summarize all open source Large Languages Models and low-cost replication methods for Chatgpt.
☆136Updated 2 years ago
Zheng0428 / COIG-Kun
☆36Updated 11 months ago
MikeGu721 / EasyLLM
make LLM easier to use
☆59Updated 2 years ago
jiahe7ay / infini-mini-transformer
This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…
☆58Updated last year
Magnetic2014 / RoleEval
A Bilingual Role Evaluation Benchmark for Large Language Models
☆42Updated last year
ssbuild / aigc_evals
aigc evals
☆10Updated last year
gydpku / PPTC
PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion
☆55Updated last year
FlagOpen / Infinity-Instruct
☆49Updated last year
BAAI-WuDao / P-tuning
Finetune CPM-1
☆24Updated 4 years ago
mrcabbage972 / simple-toolformer
A Python implementation of Toolformer using Huggingface Transformers
☆14Updated 2 years ago
GasolSun36 / Iter-CoT
[NAACL 2024] Enhancing Chain-of-Thoughts Prompting with Iterative Bootstrapping in Large Language Models
☆85Updated last year
onesuper / HuggingFace-Datasets-Text-Quality-Analysis
Retrieves parquet files from Hugging Face, identifies and quantifies junky data, duplication, contamination, and biased content in datase…
☆53Updated 2 years ago
beichao1314 / Open-Llama
The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.
☆66Updated 2 years ago
Agora-Lab-AI / Orca
An Implementation of "Orca: Progressive Learning from Complex Explanation Traces of GPT-4"
☆43Updated 9 months ago
dqwang122 / MLROUGE
ROUGE for multilingual Summarization
☆25Updated 3 years ago
OpenBMB / DecT
Source code for ACL 2023 paper Decoder Tuning: Efﬁcient Language Understanding as Decoding
☆51Updated 2 years ago
KuaiSearchPERKS / PERKS
KuaiSearch PERKS
☆11Updated 3 years ago
syncdoth / Chain-of-Hindsight-PyTorch
Unofficial implementation of Chain of Hindsight (https://arxiv.org/abs/2302.02676) using pytorch and huggingface Trainers.
☆11Updated 2 years ago
llmeval / llmeval-1
中文大语言模型评测第一期
☆109Updated last year
FreedomIntelligence / FastLLM
Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];
☆40Updated last year
OpenLMLab / scaling-rope
code for Scaling Laws of RoPE-based Extrapolation
☆73Updated last year
stanford-oval / dialogues
A unified versatile interface for dialogue datasets
☆18Updated last year
thu-coai / OPD
OPD: Chinese Open-Domain Pre-trained Dialogue Model
☆75Updated 2 years ago
RUCAIBox / BAMBOO
☆35Updated last year