Reference implementation of Mistral AI 7B v0.1 model.
☆28Dec 25, 2023Updated 2 years ago
Alternatives and similar repositories for mistral-src-commented
Users that are interested in mistral-src-commented are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Notes on the Mistral AI model☆20Dec 27, 2023Updated 2 years ago
- LLaMA 2 implemented from scratch in PyTorch☆369Sep 25, 2023Updated 2 years ago
- DiskRAG is a high-performance vector search system built on top of DiskANN.☆21Nov 21, 2025Updated 4 months ago
- [arXiv 2024] PyTorch implementation of RRD: https://arxiv.org/abs/2407.12073☆15Dec 2, 2025Updated 4 months ago
- Flask-Mail - 使用 Python Flask 完成寄信功能☆12May 3, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch☆127Jul 24, 2023Updated 2 years ago
- Implementation of Reinforce for educational purposes.☆12Jun 12, 2023Updated 2 years ago
- Context7 Scoring Library☆30Sep 19, 2025Updated 6 months ago
- ☆10May 21, 2023Updated 2 years ago
- ☆10Apr 2, 2023Updated 3 years ago
- Official Implementation of SEA: Sparse Linear Attention with Estimated Attention Mask (ICLR 2024)☆12Jun 20, 2025Updated 9 months ago
- Whalegrad 🐳 is a lightweight deep learning library written in C.☆10Jan 5, 2025Updated last year
- KernelBench v2: Can LLMs Write GPU Kernels? - Benchmark with Torch -> Triton (and more!) problems☆23Jul 4, 2025Updated 9 months ago
- 🎈 A series of lightweight GPT models featuring TinyGPT Base (~51M params) and TinyGPT2 (~95M params). Fast, creative text generation tra…☆16Mar 9, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆14Jul 29, 2021Updated 4 years ago
- One File Tensor Libraries☆29Oct 7, 2025Updated 6 months ago
- Add function calling to text-generation-inference☆13Oct 10, 2023Updated 2 years ago
- Get up and running with Llama 2 and other large language models locally☆15Updated this week
- Global Satellite Assessment Tool (GlobalSAT)☆19Feb 1, 2026Updated 2 months ago
- ☆25Dec 12, 2025Updated 4 months ago
- Video+code lecture on building nanoGPT from scratch☆67Jun 14, 2024Updated last year
- Learn CUDA with PyTorch☆268Updated this week
- Mine-tuning is a methodology for synchronizing human and AI attention.☆19Jun 16, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆18Nov 10, 2020Updated 5 years ago
- 榮獲全國前標獎項:Accuracy in the Top 8% Nationwide; AI CUP 2024 E.SUN Artificial Intelligence Open Competition – Application of RAG and LLM in Fi…☆23Sep 2, 2025Updated 7 months ago
- ☆17Feb 12, 2025Updated last year
- ☆22May 1, 2024Updated last year
- Lidar Obstacle Detector☆25Jun 3, 2025Updated 10 months ago
- robotic arm hardware beta release of RX2 humanoid☆21Oct 10, 2024Updated last year
- Implementation of ReWiND, "Language-Guided Rewards Teach Robot Policies without New Demonstrations", from USC / Amazon Robotics☆35Aug 15, 2025Updated 8 months ago
- Notes on Direct Preference Optimization☆25Apr 14, 2024Updated 2 years ago
- Implementation of some of the Deep Distributional Reinforcement Learning Algorithms.☆26Jun 17, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This is the official repository for the spine segmentation course using nnUNet.☆28Sep 16, 2024Updated last year
- LLM training in simple, raw C/CUDA☆18May 6, 2024Updated last year
- ☆23Jan 13, 2026Updated 3 months ago
- An automation tool for deploying 5G Open RAN (ORAN) testbeds.☆55Apr 7, 2026Updated last week
- A trio of Google-Colab notebooks (ipynb) for training a GPT-2 (127M) model from scratch (useful for other / non-English languages) using …☆17Jun 29, 2020Updated 5 years ago
- ☆13Mar 30, 2026Updated 2 weeks ago
- An unsupervised model merging algorithm for Transformers-based language models.☆108Apr 29, 2024Updated last year