☆17Apr 9, 2025Updated 11 months ago
Alternatives and similar repositories for deepseek_from_scratch
Users that are interested in deepseek_from_scratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- python script to download & process data to train a speech-synthesis model of Vietnamese M.C. Nguyễn Ngọc Ngạn☆14Aug 13, 2024Updated last year
- nanobody melting temperature prediction using protein embeddings☆11Feb 24, 2025Updated last year
- ☆14Feb 5, 2025Updated last year
- High Performance FP8 GEMM Kernels for SM89 and later GPUs.☆20Jan 24, 2025Updated last year
- The MXNet Implementation of ShuffleNet v1, v2 and MobileFaceNet☆10Feb 28, 2019Updated 7 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Elixir: Train a Large Language Model on a Small GPU Cluster☆15Jun 8, 2023Updated 2 years ago
- This is a repository of Binary General Matrix Multiply (BGEMM) by customized CUDA kernel. Thank FP6-LLM for the wheels!☆18Aug 30, 2024Updated last year
- Comprehensive Implementation of Proximal Policy Optimization☆12Aug 3, 2021Updated 4 years ago
- Old book pages (with groundtruth), formerly used for OCR studies. There are several versions of the set (concerning resolution and binari…☆15Aug 25, 2017Updated 8 years ago
- ☆15Feb 23, 2025Updated last year
- This repo can contain all the analysis, machine learning work done using the patient data or the external data.☆32May 14, 2020Updated 5 years ago
- [NeurIPS 2024] VeLoRA : Memory Efficient Training using Rank-1 Sub-Token Projections☆21Oct 15, 2024Updated last year
- Official repo of M$^2$PT: Multimodal Prompt Tuning for Zero-shot Instruction Learning☆27Mar 23, 2025Updated last year
- JAX tutorials for PyTorch users☆13Feb 18, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Natural Language to Code☆14May 2, 2021Updated 4 years ago
- PyTorch Implementation of GPT-2☆32Sep 4, 2024Updated last year
- Collection of random notes, mostly transcribed from paper and mostly old. I take no responsibility for content!☆12Mar 27, 2020Updated 5 years ago
- This is code depository for my upcoming session. Will update details post the session☆41Jan 29, 2023Updated 3 years ago
- A tool to generate image dataset for sequences of handwritten digits using MNIST database☆12Nov 1, 2018Updated 7 years ago
- PoS crypto coin over ipfs distributed storage network (with new consensus protocol 🙌)☆16Apr 10, 2024Updated last year
- Convert LLaMA3.1-8B to DeepSeek R1 MLA & MoE (raw)☆24Mar 10, 2025Updated last year
- A Stress Annotated Dataset for Recognizing Everyday Stressors in SMS-like Conversational Systems☆14Apr 22, 2021Updated 4 years ago
- Named Entity Recognition implemented by PyTorch including BiLSTM and BiLSCTM+CRF☆13Apr 20, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Some common CUDA kernel implementations (Not the fastest).☆29Dec 5, 2025Updated 3 months ago
- This is just a collection of projects that made during my DEEPLEARNING NANODEGREE by UDACITY☆15May 5, 2018Updated 7 years ago
- Resources and Tool for Bangla language computation☆14Feb 20, 2026Updated last month
- ☆17Jul 12, 2022Updated 3 years ago
- Learning Pytorch☆13Oct 31, 2023Updated 2 years ago
- ChatGPT as a search engine☆25Sep 26, 2023Updated 2 years ago
- Artificial Intelligence Professional Program by Stanford School of Engineering☆19May 9, 2023Updated 2 years ago
- Complete Reinforcement Learning Toolkit for Large Language Models!☆21Aug 2, 2025Updated 7 months ago
- Reproduction of the paper 《Learning a Deep Convolutional Network for Image Super-Resolution》(ECCV 2014) by Pytorch and Matlab.☆16Sep 4, 2019Updated 6 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- MAFIA: Multiple Application Framework for GPU architectures☆28Jan 21, 2022Updated 4 years ago
- This is the work from my learnings from the Data Science with Python course offered by DataCamp. This was a really helpful course as it s…☆10Mar 12, 2021Updated 5 years ago
- Spelling and grammatical error detection and correction using N-grams Model☆13Nov 12, 2018Updated 7 years ago
- This repository contains my solutions and stand-alone Colab-friendly notebooks for the Intro to Deep Learning with PyTorch Course on Udac…☆19Jan 7, 2019Updated 7 years ago
- ☆17Mar 18, 2019Updated 7 years ago
- CSE 473: Introduction to Artificial Intelligence (taught by Rajesh Rao)☆16May 23, 2014Updated 11 years ago
- 【NeurIPS 2024】Official implementation of "Visual Fourier Prompt Tuning"☆40Jan 17, 2025Updated last year