thawtar / ButaChanRLLinks
Reinforcement Learning using PyTorch
☆11Updated last year
Alternatives and similar repositories for ButaChanRL
Users that are interested in ButaChanRL are comparing it to the libraries listed below
Sorting:
- MobileViT Implementation in TensorFlow and Pytorch☆13Updated 3 years ago
- Code example for pretraining an LLM with vanilla PyTorch training loop☆10Updated last year
- Distillation Contrastive Decoding: Improving LLMs Reasoning with Contrastive Decoding and Distillation☆35Updated last year
- Baseline for ZaloAI Challenge 2023 Elementary Math Solving☆69Updated last year
- Fine tune Gemma 3 on an object detection task☆92Updated 5 months ago
- 👨🏻💻 Code release for Vietnamese chatbot from scratch [Published in IEEE IMCOM 2022]☆17Updated last year
- Sample tutorials for training Natural Language Processing Models with Transformers☆22Updated 2 years ago
- ViDeBERTa: A powerful pre-trained language model for Vietnamese, EACL 2023☆58Updated 2 years ago
- https://slds-lmu.github.io/seminar_multimodal_dl/☆171Updated 2 years ago
- Collection of links, tutorials and best practices of how to collect the data and build end-to-end RLHF system to finetune Generative AI m…☆224Updated 2 years ago
- Pretraining and finetuning for visual instruction following with Mixture of Experts☆16Updated last year
- Vistral-V: Visual Instruction Tuning for Vistral - Vietnamese Large Vision-Language Model.☆23Updated last year
- Quy Nhon AI Hackathon 2022 - Challenge 2: Review Analytics - Top 1 Solution☆10Updated 3 years ago
- ☆75Updated last year
- This FastAPI-based RAG service processes OCR data, generates embeddings using OpenAI, and utilizes Pinecone as a vector database for sear…☆17Updated last year
- Sythetic data generation and normalization functions powered by LLMs☆58Updated last year
- LLM_library is a comprehensive repository serves as a one-stop resource hands-on code, insightful summaries.☆69Updated last year
- Short experiment with Deep Q-Learning + KAN to play Flappy Bird.☆19Updated last year
- Notebooks for fine tuning pali gemma☆117Updated 8 months ago
- ☆38Updated last year
- We finetune Bloomz-7b1-mt using LoRA with the chatdoctor-200k dataset at here https://huggingface.co/LinhDuong/doctorwithbloomz-7b1-mt an…☆30Updated 2 years ago
- Small and Efficient Mathematical Reasoning LLMs☆72Updated last year
- LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch☆117Updated 2 years ago
- A minimal yet unstoppable blueprint for multi-agent AI—anchored by the rare, far-reaching “Multi-Agent AI DAO” (2017 Prior Art)—empowerin…☆32Updated 11 months ago
- ☆48Updated last year
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆26Updated last year
- Composition of Multimodal Language Models From Scratch☆15Updated last year
- A collection of hand on notebook for LLMs practitioner☆51Updated 11 months ago
- ☆15Updated 2 years ago
- Pre-training script for BART in JAX/Flax☆38Updated 3 years ago