Reference implementation of Mistral AI 7B v0.1 model.
☆28Dec 25, 2023Updated 2 years ago
Alternatives and similar repositories for mistral-src-commented
Users that are interested in mistral-src-commented are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code repository supporting the paper "Auto-Generating Weak Labels for Real & Synthetic Data to Improve Label-Scarce Medical Image Segment…☆12Apr 29, 2024Updated 2 years ago
- Notes and commented code for RLHF (PPO)☆129Feb 27, 2024Updated 2 years ago
- LLaMA 2 implemented from scratch in PyTorch☆369Sep 25, 2023Updated 2 years ago
- Sqlite3-based logging for Python☆15May 27, 2024Updated last year
- Notes on quantization in neural networks☆124Dec 14, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- DiskRAG is a high-performance vector search system built on top of DiskANN.☆21Nov 21, 2025Updated 5 months ago
- PyTorch implementation of RRD: https://arxiv.org/abs/2407.12073☆15Dec 2, 2025Updated 5 months ago
- Implementation of Reinforce for educational purposes.☆12Jun 12, 2023Updated 2 years ago
- ☆10May 21, 2023Updated 2 years ago
- receipt parsing using donut model, next we will add using LLM + OCR or VLM☆19Jun 21, 2024Updated last year
- ☆10Apr 2, 2023Updated 3 years ago
- Whalegrad 🐳 is a lightweight deep learning library written in C.☆10Jan 5, 2025Updated last year
- KernelBench v2: Can LLMs Write GPU Kernels? - Benchmark with Torch -> Triton (and more!) problems☆23Jul 4, 2025Updated 10 months ago
- Changes in this fork has been merged to upstream.☆16Jun 10, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆14Jul 29, 2021Updated 4 years ago
- [CVPR 2022] OCSampler: Compressing Videos to One Clip with Single-step Sampling☆17Jun 21, 2022Updated 3 years ago
- 🎈 A series of lightweight GPT models featuring TinyGPT Base (~51M params) and TinyGPT2 (~95M params). Fast, creative text generation tra…☆17Apr 17, 2026Updated 2 weeks ago
- NCCU thesis template for XeLaTeX☆19Jul 4, 2018Updated 7 years ago
- Add function calling to text-generation-inference☆13Oct 10, 2023Updated 2 years ago
- Get up and running with Llama 2 and other large language models locally☆15Updated this week
- One File Tensor Libraries☆31Oct 7, 2025Updated 6 months ago
- This project uses handwriting recognition to recognize the names of medicines from a doctor's prescription. This is done using a Convolut…☆18Nov 11, 2022Updated 3 years ago
- Video+code lecture on building nanoGPT from scratch☆67Jun 14, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- This is a chatbot built using Gradio that can access Google Search and webpages to answer questions. Supports GPT-3.5, GPT-4, Claude 2, …☆13Aug 31, 2023Updated 2 years ago
- Learn CUDA with PyTorch☆284Apr 26, 2026Updated last week
- Training of a ResNet18 model using PyTorch compared to Torchvision ResNet18 model on the same dataset☆19Nov 20, 2022Updated 3 years ago
- ☆18Nov 10, 2020Updated 5 years ago
- ☆14Jun 21, 2024Updated last year
- Lidar Obstacle Detector☆25Jun 3, 2025Updated 11 months ago
- pytorch from scratch in pure C/CUDA and python☆41Oct 10, 2024Updated last year
- Implementation of ReWiND, "Language-Guided Rewards Teach Robot Policies without New Demonstrations", from USC / Amazon Robotics☆35Aug 15, 2025Updated 8 months ago
- Notes on Direct Preference Optimization☆25Apr 14, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆30Apr 24, 2025Updated last year
- Some recipes for data engineering with Python☆25Mar 23, 2021Updated 5 years ago
- Implementation of some of the Deep Distributional Reinforcement Learning Algorithms.☆26Jun 17, 2025Updated 10 months ago
- Playing with CSM☆22Mar 14, 2025Updated last year
- ☆13Mar 30, 2026Updated last month
- An unsupervised model merging algorithm for Transformers-based language models.☆108Apr 29, 2024Updated 2 years ago
- Automatic differentiation for Triton Kernels☆29Aug 12, 2025Updated 8 months ago