seonglae / llama2gptq
Chat to LLaMa 2 that also provides responses with reference documents over vector database. Locally available model using GPTQ 4bit quantization.
☆30Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for llama2gptq
- manage histories of LLM applied applications☆86Updated 11 months ago
- HuggingChat like UI in Gradio☆64Updated last year
- fine tuning mistral 7B using Huggingface, Weights and Biases, Choline, and Vast AI☆38Updated last year
- Evaluate your LLM apps, RAG pipeline, any generated text, and more!☆0Updated 6 months ago
- 1-Click is all you need.☆58Updated 6 months ago
- generate synthetic data for LLM fine-tuning in arbitrary situations within systematic way☆21Updated 7 months ago
- Sakura-SOLAR-DPO: Merge, SFT, and DPO☆115Updated 10 months ago
- This project breathes life into video characters by using AI to describe their personality and then chat with you as them.☆45Updated 7 months ago
- Weekly visualization report of Open LLM model performance based on 4 metrics.☆88Updated 10 months ago
- An OpenAI Completions API compatible server for NLP transformers models☆55Updated 11 months ago
- ☆64Updated last year
- "Learning-based One-line intelligence Owner Network Connectivity Tool"☆15Updated last year
- Explore the use of DSPy for extracting features from PDFs 🔎☆32Updated 8 months ago
- ☆37Updated 11 months ago
- Use OpenAI with HuggingChat by emulating the text_generation_inference_server☆45Updated last year
- A Gradio web UI for Large Language Models. Supports LoRA/QLoRA finetuning,RAG(Retrieval-augmented generation) and Chat☆32Updated 11 months ago
- Conduct consumer interviews with synthetic focus groups using LLMs and LangChain☆43Updated last year
- hllama is a library which aims to provide a set of utility tools for large language models.☆10Updated 6 months ago
- Chain-of-thought 방식을 활용하여 llama2를 fine-tuning☆10Updated 11 months ago
- Here is a Google Colab Notebook for fine-tuning Alpaca Lora (within 3 hours with a 40GB A100 GPU)☆38Updated last year
- Construct a vector database through sentence embedding. And make your LLM respond based on this database.☆8Updated 9 months ago
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…☆42Updated last year
- Overview and tutorials of the LlamaIndex Library☆17Updated last year
- Official implementation of "OffsetBias: Leveraging Debiased Data for Tuning Evaluators"☆14Updated last month
- ☆83Updated last year
- ⚡️ Asynchronous framework for ChatGPT API 🤖☆21Updated last year
- ☆68Updated last year
- ☆37Updated last year
- Awesome series for LLMOps☆34Updated 3 months ago
- ☆35Updated 7 months ago