sanjibnarzary / awesome-llm
Curated list of open source and openly accessible large language models
☆24Updated last year
Related projects ⓘ
Alternatives and complementary repositories for awesome-llm
- Documentation effort for the BookCorpus dataset☆33Updated 3 years ago
- Efficient few-shot learning with cross-encoders.☆40Updated 9 months ago
- Code for the paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers" with GPT-J implementation.☆15Updated last year
- aiXplain enables python programmers to add AI functions to their software.☆27Updated this week
- Lightweight tools for quick and easy LLM demo's☆26Updated last month
- A TextTiling-based algorithm for text segmentation (aka topic segmentation) that uses neural sentence encoders, as well as extractive sum…☆42Updated last year
- Experiments w/ ChatGPT, LangChain, local LLMs☆24Updated last year
- Implementation of "SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages" paper, accepted to E…☆23Updated last year
- Consists of the largest (10K) human annotated code-switched semantic parsing dataset & 170K generated utterance using the CST5 augmentati…☆33Updated last year
- Efficiently computing & storing token n-grams from large corpora☆15Updated last month
- LLM finetuning☆42Updated last year
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆50Updated this week
- Training & Implementation of chatbots leveraging GPT-like architecture with the aitextgen package to enable dynamic conversations.☆46Updated 2 years ago
- Pipeline for pulling and processing online language model pretraining data from the web☆174Updated last year
- QLoRA with Enhanced Multi GPU Support☆36Updated last year
- **ARCHIVED** Filesystem interface to 🤗 Hub☆56Updated last year
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆23Updated 8 months ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆40Updated 3 weeks ago
- ☆32Updated last year
- ☆56Updated 2 years ago
- ☆54Updated this week
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆206Updated 10 months ago
- The pipeline for the OSCAR corpus☆162Updated 11 months ago
- Language Identification with Support for More Than 2000 Labels -- EMNLP 2023☆90Updated 3 weeks ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆40Updated 8 months ago
- Experiments with Hugging Face 🔬 🤗☆45Updated 3 months ago
- Public reports detailing responses to sets of prompts by Large Language Models.☆26Updated last year
- Our open source implementation of MiniLMv2 (https://aclanthology.org/2021.findings-acl.188)☆60Updated last year
- Experimental sampler to make LLMs more creative☆30Updated last year
- SEPIA server to support open-source speech recognition via WebSocket connection.☆120Updated 2 weeks ago