joydeb28 / llm-labLinks
LLM, Fine Tuning, Llama 2, Gemma, Mixtral, vLLM, LangChain, RAG, ChromaDB, FAISS
☆13Updated last year
Alternatives and similar repositories for llm-lab
Users that are interested in llm-lab are comparing it to the libraries listed below
Sorting:
- A collection of large question answering datasets☆429Updated last year
- Fine-tuning Open-Source LLMs for Adaptive Machine Translation☆90Updated 7 months ago
- Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigs…☆184Updated 2 years ago
- simpleT5 is built on top of PyTorch-lightning⚡️ and Transformers🤗 that lets you quickly train your T5 models.☆400Updated 2 years ago
- Neural Machine Translation (NMT) tutorial. Data preprocessing, model training, evaluation, and deployment.☆174Updated last month
- A Neural Framework for MT Evaluation☆712Updated this week
- Multilingual/multidomain question generation datasets, models, and python library for question generation.☆373Updated last year
- Summarize existing representative LLMs text datasets.☆1,431Updated 4 months ago
- Expanding natural instructions☆1,030Updated 2 years ago
- ☆254Updated last year
- ☆22Updated last year
- Source code of paper "Alirector: Alignment-Enhanced Chinese Grammatical Error Corrector" (Findings of ACL 2024)☆12Updated 10 months ago
- ⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x.☆589Updated 2 years ago
- Finetune BLOOM☆40Updated 2 years ago
- The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".☆1,594Updated 8 months ago
- ☆1,345Updated 11 months ago
- A survey and reflection on the latest research breakthroughs in LLM-generated Text detection, including data, detectors, metrics, current…☆241Updated last year
- ☆127Updated last year
- “Dive Into OCR” is a textbook developed by the PaddleOCR community that integrates OCR theory and practice.☆254Updated 3 years ago
- A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型。☆721Updated last year
- ☆122Updated 2 years ago
- Fine tune a T5 transformer model using PyTorch & Transformers🤗☆220Updated 5 years ago
- Crosslingual Generalization through Multitask Finetuning☆537Updated last year
- All-in-one text de-duplication☆741Updated last month
- This repository is dedicated to summarizing papers related to large language models with the field of law☆281Updated 3 weeks ago
- ReadMe++: A Multi-domain Multilingual Dataset for Readability Assessment☆12Updated 9 months ago
- Fine-Tuning Embedding for RAG with Synthetic Data☆523Updated 2 years ago
- Calculate perplexity on a text with pre-trained language models. Support MLM (eg. DeBERTa), recurrent LM (eg. GPT3), and encoder-decoder …☆166Updated 7 months ago
- A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)☆1,143Updated 2 years ago
- LLM Finetuning with peft☆2,767Updated 6 months ago