thawtar / ButaChanRLLinks
Reinforcement Learning using PyTorch
☆11Updated last year
Alternatives and similar repositories for ButaChanRL
Users that are interested in ButaChanRL are comparing it to the libraries listed below
Sorting:
- MobileViT Implementation in TensorFlow and Pytorch☆13Updated 2 years ago
- Distributed training (multi-node) of a Transformer model☆76Updated last year
- Collection of links, tutorials and best practices of how to collect the data and build end-to-end RLHF system to finetune Generative AI m…☆224Updated 2 years ago
- Distillation Contrastive Decoding: Improving LLMs Reasoning with Contrastive Decoding and Distillation☆35Updated last year
- LLM_library is a comprehensive repository serves as a one-stop resource hands-on code, insightful summaries.☆69Updated last year
- Solving Problems with Applied Deep Learning (ITS-530)☆21Updated 2 months ago
- Tutorial for how to build BERT from scratch☆97Updated last year
- ☆15Updated last year
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆48Updated last year
- Pretraining and finetuning for visual instruction following with Mixture of Experts☆16Updated last year
- 👨🏻💻 Code release for Vietnamese chatbot from scratch [Published in IEEE IMCOM 2022]☆17Updated last year
- From Scratch Implementation of some popular Deep Learning Papers with PyTorch and Tensorflow☆18Updated 2 years ago
- https://slds-lmu.github.io/seminar_multimodal_dl/☆170Updated 2 years ago
- Notebooks for fine tuning pali gemma☆112Updated 3 months ago
- ☆20Updated 4 years ago
- 🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.☆50Updated last week
- ☆40Updated last year
- LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch☆112Updated 2 years ago
- Sythetic data generation and normalization functions powered by LLMs☆58Updated 10 months ago
- Notebook Examples used in machine learning writing and research☆81Updated last week
- Notes on the Mistral AI model☆20Updated last year
- VNHSGE: Vietnamese High School Graduation Examination Dataset for Large Language Models☆27Updated 2 years ago
- ☆14Updated 4 years ago
- Unlock the potential of finetuning Large Language Models (LLMs). Learn from industry expert, and discover when to apply finetuning, data …☆63Updated last year
- Short experiment with Deep Q-Learning + KAN to play Flappy Bird.☆19Updated last year
- ☆76Updated 2 years ago
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆69Updated 4 months ago
- A Simplified PyTorch Implementation of Vision Transformer (ViT)☆197Updated last year
- ☆33Updated 8 months ago
- Notes and commented code for RLHF (PPO)☆102Updated last year