hkproj / pytorch-transformer-distributedView external linksLinks
Distributed training (multi-node) of a Transformer model
☆94Apr 10, 2024Updated last year
Alternatives and similar repositories for pytorch-transformer-distributed
Users that are interested in pytorch-transformer-distributed are comparing it to the libraries listed below
Sorting:
- Notes on Direct Preference Optimization☆24Apr 14, 2024Updated last year
- ML algorithms implementations that are good for learning the underlying principles☆27Dec 7, 2024Updated last year
- Notes and commented code for RLHF (PPO)☆124Feb 27, 2024Updated last year
- ☆236Jan 2, 2025Updated last year
- BlockchainGPT: An intuitive, chat-based platform to manage your blockchain environments using natural language processing capabilities.☆11Jul 6, 2023Updated 2 years ago
- 🦾Distributed Natural Evolution Strategies Build with PyTorch and Ray☆18Jul 20, 2018Updated 7 years ago
- [ACL 2023] Solving Math Word Problems via Cooperative Reasoning induced Language Models (LLMs + MCTS + Self-Improvement)☆50Dec 15, 2023Updated 2 years ago
- Slides for "Retrieval Augmented Generation" video☆24Nov 27, 2023Updated 2 years ago
- LLaMA 2 implemented from scratch in PyTorch☆366Sep 25, 2023Updated 2 years ago
- Attention is all you need implementation☆1,164Jun 8, 2024Updated last year
- Vecna is a Python chatbot which recommends songs and movies depending upon your feelings☆11Jun 28, 2022Updated 3 years ago
- ☆27Jun 6, 2024Updated last year
- [NeurIPS 2025] RL Tango: Reinforcing Generator and Verifier Together for Language Reasoning☆50Oct 23, 2025Updated 3 months ago
- Financial Analysis and Algorithmic Trading Strategies in Python☆11Feb 16, 2023Updated 2 years ago
- ☆10May 22, 2024Updated last year
- ☆46May 24, 2025Updated 8 months ago
- Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation: https://www.youtube.com/watch?v=vAmKB7iPkWw☆592Dec 6, 2024Updated last year
- An Educational Framework Based on PyTorch for Deep Learning Education and Exploration☆10Dec 24, 2023Updated 2 years ago
- RL algorithm for stock trading with multiple reward functions☆11Apr 21, 2024Updated last year
- ☆10Jul 29, 2022Updated 3 years ago
- ☆13Feb 8, 2019Updated 7 years ago
- 排班管理系统☆11Jul 15, 2015Updated 10 years ago
- online shopping tool for flowers and gifts☆11Nov 13, 2017Updated 8 years ago
- Tally Prime MCP (Model Context Protocol) Server implementation to feed Tally ERP data to popular LLM like Claude, ChatGPT supporting MCP☆14Nov 11, 2025Updated 3 months ago
- Extract information from XBRL files in the ESEF format☆13Jan 3, 2026Updated last month
- ☆12Mar 28, 2025Updated 10 months ago
- Implementation of the model from "Faster sorting algorithms discovered using deep reinforcement learning" that discovered an all-new ult…☆11Aug 29, 2023Updated 2 years ago
- ☆10Jul 8, 2021Updated 4 years ago
- Documentation and code for predictive maintenance data and assess scripts.☆11Jun 8, 2023Updated 2 years ago
- An IOT based mobile application to monitor the vitals such as ECG, Body Temperature, Blood Pressure using an ESP32 DevKit and React Nativ…☆11Nov 14, 2024Updated last year
- A simple GPT-3 interface to automate core legal writing tasks☆11Mar 8, 2023Updated 2 years ago
- Deploying a custom pytorch model to AWS Sagemaker using terraform and FastAPI☆10Nov 10, 2023Updated 2 years ago
- This is a dehazed method for remote sensing image, which based on CycleGAN.☆12May 10, 2022Updated 3 years ago
- YT2Brief: Transcribe and summarize YouTube videos using Langchain with power of LLMs.☆11Dec 21, 2023Updated 2 years ago
- react后台管理系统☆10Jan 3, 2023Updated 3 years ago
- Python基于YOLO&Deepsort的闯红灯检测系统(完整源码&自定义UI操作界面&视频教程)☆17Nov 18, 2023Updated 2 years ago
- ☆39Apr 5, 2024Updated last year
- 基于Scrapy框架的知乎用户爬虫☆10Feb 26, 2021Updated 4 years ago
- Online auction platform☆13Sep 24, 2022Updated 3 years ago