sanjibnarzary / awesome-llm
Curated list of open source and openly accessible large language models
☆25Updated last year
Alternatives and similar repositories for awesome-llm:
Users that are interested in awesome-llm are comparing it to the libraries listed below
- Command-line script for inferencing from models such as LLaMA, in a chat scenario, with LoRA adaptations☆33Updated last year
- ☆31Updated last year
- GGML implementation of BERT model with Python bindings and quantization.☆54Updated last year
- Hosting the JSON for the GPT4 Tokenizer☆64Updated last year
- A visual tool to interpret and understand PyTorch machine learning models☆16Updated last year
- Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive instruction collection encompassing various tasks. Vicuna is al…☆111Updated last year
- GPU controlled Hetzner Cloud workers swarm for Crawling@Home project☆52Updated 2 years ago
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- A Multilingual Dataset for Parsing Realistic Task-Oriented Dialogs☆114Updated last year
- A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.☆32Updated last year
- Python tools for processing the stackexchange data dumps into a text dataset for Language Models☆81Updated last year
- Reimplementation of the task generation part from the Alpaca paper☆119Updated last year
- Efficient few-shot learning with cross-encoders.☆49Updated last year
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆45Updated last year
- Open source library for few shot NLP☆77Updated last year
- LLM finetuning☆42Updated last year
- Our open source implementation of MiniLMv2 (https://aclanthology.org/2021.findings-acl.188)☆60Updated last year
- ☆37Updated last year
- The data processing pipeline for the Koala chatbot language model☆117Updated last year
- Efficiently computing & storing token n-grams from large corpora☆18Updated 4 months ago
- Embeddings focused small version of Llama NLP model☆103Updated last year
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆208Updated last year
- Public reports detailing responses to sets of prompts by Large Language Models.☆29Updated last month
- The Next Generation Multi-Modality Superintelligence☆71Updated 5 months ago
- An Implementation of "Orca: Progressive Learning from Complex Explanation Traces of GPT-4"☆44Updated 4 months ago
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆32Updated last year
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆18Updated 2 years ago
- ☆57Updated 5 months ago
- Lightweight tools for quick and easy LLM demo's☆26Updated 5 months ago
- Pipeline for pulling and processing online language model pretraining data from the web☆175Updated last year