sanjibnarzary / awesome-llm
Curated list of open source and openly accessible large language models
☆22Updated last year
Related projects: ⓘ
- ☆86Updated 2 years ago
- Training & Implementation of chatbots leveraging GPT-like architecture with the aitextgen package to enable dynamic conversations.☆46Updated 2 years ago
- Python tools for processing the stackexchange data dumps into a text dataset for Language Models☆74Updated 9 months ago
- Efficiently computing & storing token n-grams from large corpora☆15Updated 2 weeks ago
- Documentation effort for the BookCorpus dataset☆30Updated 3 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆18Updated last year
- Code for the paper-"Mirostat: A Perplexity-Controlled Neural Text Decoding Algorithm" (https://arxiv.org/abs/2007.14966).☆56Updated 2 years ago
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- Command-line script for inferencing from models such as LLaMA, in a chat scenario, with LoRA adaptations☆33Updated last year
- ☆34Updated last year
- A file utility for accessing both local and remote files through a unified interface.☆36Updated last month
- The Next Generation Multi-Modality Superintelligence☆69Updated 2 weeks ago
- An Implementation of "Orca: Progressive Learning from Complex Explanation Traces of GPT-4"☆43Updated last year
- A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.☆31Updated last year
- ☆32Updated last year
- Open source library for few shot NLP☆78Updated last year
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆44Updated 10 months ago
- **ARCHIVED** Filesystem interface to 🤗 Hub☆56Updated last year
- Adversarial Training and SFT for Bot Safety Models☆38Updated last year
- Developing tools to automatically analyze datasets☆68Updated 10 months ago
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆32Updated last year
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆22Updated 6 months ago
- Blenderbot☆9Updated 2 years ago
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆204Updated 8 months ago
- Experiments with generating opensource language model assistants☆97Updated last year
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 2 years ago
- HuggingChat like UI in Gradio☆63Updated last year
- ☆42Updated last year
- YT_subtitles - extracts subtitles from YouTube videos to raw text for Language Model training☆38Updated 3 years ago
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆60Updated last year
- Implementation of Z-BERT-A: a zero-shot pipeline for unknown intent detection.☆38Updated last year