michaelthwan / llm_family_chart
LLM family chart
☆50Updated last year
Alternatives and similar repositories for llm_family_chart:
Users that are interested in llm_family_chart are comparing it to the libraries listed below
- Reimplementation of the task generation part from the Alpaca paper☆119Updated last year
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated last year
- Low-Rank adapter extraction for fine-tuned transformers models☆169Updated 9 months ago
- Just a bunch of benchmark logs for different LLMs☆119Updated 6 months ago
- experiments with inference on llama☆104Updated 8 months ago
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆147Updated last year
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆101Updated 2 months ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆67Updated 4 months ago
- Instruct-tuning LLaMA on consumer hardware☆66Updated last year
- ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward exp…☆215Updated 10 months ago
- ☆65Updated 8 months ago
- ☆38Updated last year
- The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…☆121Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆100Updated 10 months ago
- Data preparation code for Amber 7B LLM☆85Updated 9 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆64Updated 3 months ago
- QLoRA with Enhanced Multi GPU Support☆36Updated last year
- GPT-4 Level Conversational QA Trained In a Few Hours☆58Updated 6 months ago
- Evaluating LLMs with CommonGen-Lite☆88Updated 11 months ago
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆82Updated last week
- ☆199Updated last year
- Hallucinations (Confabulations) Document-Based Benchmark for RAG☆90Updated last week
- Drop in replacement for OpenAI, but with Open models.☆153Updated last year
- ☆60Updated last year
- A data-centric AI package for ML/AI. Get the best high-quality data for the best results. Discord: https://discord.gg/t6ADqBKrdZ☆64Updated last year
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆130Updated this week
- Code repository for the c-BTM paper☆105Updated last year
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆79Updated 11 months ago
- Modified Stanford-Alpaca Trainer for Training Replit's Code Model☆40Updated last year