hjandlm / Chunk-Factory

Chunk-Factory is a fast, efficient text chunking library with real-time evaluation.

☆10

Alternatives and similar repositories for Chunk-Factory

Users that are interested in Chunk-Factory are comparing it to the libraries listed below

Sorting:

yhao-wang / LLM-Knowledge-Boundary
Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"
☆22Updated last year
loveisp / KDD_2024_AQA
KDD 2024 AQA competition 2nd place solution
☆11Updated 9 months ago
StibiumT16 / Robust-Fine-tuning
Code for Robust Fine-tuning (RbFT)
☆12Updated 3 months ago
jiahe7ay / infini-mini-transformer
This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…
☆56Updated last year
KomeijiForce / MetaIE
This is a meta-model distilled from LLMs for information extraction. This is an intermediate checkpoint that can be well-transferred to a…
☆27Updated 2 months ago
CLUEbenchmark / SuperCLUE-Industry
中文原生工业测评基准
☆13Updated last year
seanzhang-zhichen / baichuan-Dynamic-NTK-ALiBi
百川Dynamic NTK-ALiBi的代码实现：无需微调即可推理更长文本
☆47Updated last year
Academic-Hammer / HammerLLM
1.4B sLLM for Chinese and English - HammerLLM🔨
☆44Updated last year
cjymz886 / LLM-RAG-QA
LLM+RAG for QA
☆22Updated last year
LC1332 / Luotuo-Silk-Road
Silk Road will be the dataset zoo for Luotuo(骆驼). Luotuo is an open sourced Chinese-LLM project founded by 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子…
☆39Updated last year
ZBWpro / PretCoTandKE
☆24Updated this week
thunlp / Document-Plugin
Plug-and-Play Document Modules for Pre-trained Models
☆26Updated last year
Zheng0428 / COIG-Kun
☆36Updated 8 months ago
ArtificialZeng / llama3_explained
the newest version of llama3，source code explained line by line using Chinese
☆22Updated last year
ssbuild / aigc_evals
aigc evals
☆10Updated last year
ictnlp / LevelRAG
The official implementation of "LevelRAG: Enhancing Retrieval-Augmented Generation with Multi-hop Logic Planning over Rewriting Augmented…
☆29Updated last month
WalkerMitty / Fast-Llama2
Fast instruction tuning with Llama2
☆11Updated last year
zhangqi-here / UnifiedEAE
A Multi-Format Transfer Learning Model for Event Argument Extraction via Variational Information Bottleneck
☆10Updated 2 years ago
LuckyyySTA / Fine-grained-Attribution
[ACL 2024 Findings] Learning Fine-Grained Grounded Citations for Attributed Large Language Models
☆18Updated 6 months ago
zhaochenyang20 / Prompt2Model-Self-Guide
SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper
☆32Updated 11 months ago
yann168 / boshi-sample-solution
☆15Updated last year
tianchiguaixia / ocr_recognition
微调阿里开源的文字检测模型，利用合合识别返回的OCR结果作为初始训练数据，对模型进行优化训练，使其更加适应1万张图片的具体场景，提高文字识别的精度。
☆9Updated 5 months ago
hahahawu / VCSum
Dataset for Findings of ACL 23 "VCSum: A Versatile Chinese Meeting Summarization Dataset"
☆40Updated last year
RUC-GSAI / Llama-3-SynE
Llama-3-SynE: A Significantly Enhanced Version of Llama-3 with Advanced Scientific Reasoning and Chinese Language Capabilities | 继续预训练提升 …
☆32Updated 5 months ago
yongchao98 / PROMST
Automatic prompt optimization framework for multi-step agent tasks.
☆30Updated 6 months ago
yifeiwang77 / Self-Correction
☆20Updated 6 months ago
dqwang122 / MLROUGE
ROUGE for multilingual Summarization
☆24Updated 3 years ago
CLUEbenchmark / SuperCLUE-Code3
中文原生等级化代码能力测试基准
☆13Updated last year
OpenBMB / DecT
Source code for ACL 2023 paper Decoder Tuning: Efﬁcient Language Understanding as Decoding
☆49Updated last year
MikeGu721 / EasyLLM
make LLM easier to use
☆59Updated last year