thawtar / ButaChanRLLinks

Reinforcement Learning using PyTorch

☆11

Alternatives and similar repositories for ButaChanRL

Users that are interested in ButaChanRL are comparing it to the libraries listed below

Sorting:

rashmimarganiatgithub / LLMS_Library_2023
LLM_library is a comprehensive repository serves as a one-stop resource hands-on code, insightful summaries.
☆69Updated 2 years ago
HumanSignal / RLHF
Collection of links, tutorials and best practices of how to collect the data and build end-to-end RLHF system to finetune Generative AI m…
☆224Updated 2 years ago
rohan-paul / Deep-Learning-Paper-Implementation
From Scratch Implementation of some popular Deep Learning Papers with PyTorch and Tensorflow
☆18Updated 2 years ago
saikhu / Docker-Guide-for-AI-Model-Development-and-Deployment
This repo gives a start for the docker.
☆35Updated last year
hkproj / pytorch-lora
LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch
☆119Updated 2 years ago
ariG23498 / fine-tune-paligemma
Notebooks for fine tuning pali gemma
☆117Updated 8 months ago
pphuc25 / distil-cd
Distillation Contrastive Decoding: Improving LLMs Reasoning with Contrastive Decoding and Distillation
☆35Updated last year
niconielsen32 / AlgortihmsAndDataStructures
☆14Updated 5 years ago
RanaMostafaAbdElMohsen / Traffic_Sign_Recognition
This repository is for a research project at Cairo University, computer engineering department.
☆14Updated 3 years ago
huynguyen250896 / Introduction-to-Applied-Linear-Algebra
Introduction to Applied Linear Algebra: Vectors, Matrices, and Least Squares - Stephen Boyd & Lieven Vandenberghe
☆12Updated 5 years ago
slds-lmu / seminar_multimodal_dl
https://slds-lmu.github.io/seminar_multimodal_dl/
☆171Updated 2 years ago
voxel51 / papers-with-data
A curated list of papers that released datasets along with their work
☆126Updated last year
Tobiadefami / fuxion
Sythetic data generation and normalization functions powered by LLMs
☆58Updated last year
ibm-self-serve-assets / SuperKnowa
Build Enterprise RAG (Retriver Augmented Generation) Pipelines to tackle various Generative AI use cases with LLM's by simply plugging co…
☆117Updated last year
ThinamXx / Meta-llama
Complete implementation of Llama2 with/without KV cache & inference 🚀
☆49Updated last year
coaxsoft / pytorch_bert
Tutorial for how to build BERT from scratch
☆101Updated last year
HySonLab / ViDeBERTa
ViDeBERTa: A powerful pre-trained language model for Vietnamese, EACL 2023
☆58Updated 2 years ago
FareedKhan-dev / Building-llama3-from-scratch
LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.
☆195Updated last year
ariG23498 / gemma3-object-detection
Fine tune Gemma 3 on an object detection task
☆95Updated 5 months ago
Sakil786 / llm_using_petals
llm_using_petals
☆17Updated 2 years ago
rajlm10 / D2L-Torch
Learning PyTorch through the D2L book. A series of notebooks for the same
☆27Updated 3 years ago
philschmid / deep-learning-habana-huggingface
☆32Updated 3 years ago
FareedKhan-dev / gpt4o-from-scratch
Implementation of a GPT-4o like Multimodal from Scratch using Python
☆75Updated 9 months ago
cindysridykhan / instruct_storyteller_tinyllama2
Training and Fine-tuning an llm in Python and PyTorch.
☆43Updated 2 years ago
IbrahimSobh / Practical-DRL
This is a practical resource that makes it easier to learn about and apply Practical Deep Reinforcement Learning (DRL) https://ibrahimsob…
☆99Updated 5 years ago
AntonioGr7 / pratical-llms
A collection of hand on notebook for LLMs practitioner
☆51Updated 11 months ago
ngtranminhtuan / LLMOPS
NLP/LLM Mlops Pipeline to dev/train/evaluation, scalable deploy and monitoring systems.
☆22Updated last year
ayulockin / neurips-llm-efficiency-challenge
Starter pack for NeurIPS LLM Efficiency Challenge 2023.
☆129Updated 2 years ago
cristianleoo / models-from-scratch-python
Repo where I recreate some popular machine learning models from scratch in Python
☆123Updated 10 months ago
AhmedSSoliman / Llama2-CodeGen-Fine-Tuning-LLama-2
☆15Updated 2 years ago