hkproj / mistral-llm-notes
Notes on the Mistral AI model
☆18Updated last year
Alternatives and similar repositories for mistral-llm-notes:
Users that are interested in mistral-llm-notes are comparing it to the libraries listed below
- Reference implementation of Mistral AI 7B v0.1 model.☆28Updated last year
- End-to-End LLM Guide☆101Updated 7 months ago
- Notes about LLaMA 2 model☆53Updated last year
- Sample notebooks and prompts for LLM evaluation☆120Updated 2 months ago
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆47Updated 8 months ago
- Various installation guides for Large Language Models☆63Updated 3 months ago
- A set of scripts and notebooks on LLM finetunning and dataset creation☆103Updated 4 months ago
- CompanionLLM - A framework to finetune LLMs to be your own sentient conversational companion☆40Updated last year
- Using LlamaIndex with Ray for productionizing LLM applications☆71Updated last year
- ☆141Updated 7 months ago
- Set of scripts to finetune LLMs☆36Updated 10 months ago
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆136Updated 6 months ago
- LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch☆94Updated last year
- Document Q&A on Wikipedia articles using LLMs☆75Updated last year
- Collection of recipes aiding Gen AI model development☆98Updated last week
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆230Updated 3 months ago
- This project showcases an LLMOps pipeline that fine-tunes a small-size LLM model to prepare for the outage of the service LLM.☆295Updated this week
- 💻 Decoding ML articles hub: Hands-on articles with code on production-grade ML☆122Updated 2 months ago
- Build Enterprise RAG (Retriver Augmented Generation) Pipelines to tackle various Generative AI use cases with LLM's by simply plugging co…☆109Updated 6 months ago
- GenAIOps on Kubernetes: A collection of reference architectures for running GenAI at scale on Kubernetes using OSS tooling☆130Updated 3 months ago
- ☆71Updated 7 months ago
- LoRA and DoRA from Scratch Implementations☆196Updated 11 months ago
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.☆124Updated last year
- An index of all of our weekly concepts + code events for aspiring AI Engineers and Business Leaders!!☆60Updated this week
- RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systems☆90Updated last month
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆93Updated 2 months ago
- A collection of fine-tuning notebooks!☆26Updated last year
- Hugging Face Deep Learning Containers (DLCs) for Google Cloud☆141Updated 3 weeks ago
- Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024☆272Updated last week
- ☆76Updated 4 months ago