📖 Full Stack Practice of the Large Language Model Training @ RLChina 2024
☆40Oct 15, 2024Updated last year
Alternatives and similar repositories for llm_training_full_stack
Users that are interested in llm_training_full_stack are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- All in one PDF Parser Toolkit☆17Sep 15, 2023Updated 2 years ago
- FeelingBlue: A Corpus for Understanding the Emotional Connotation of Color in Context, accepted at TACL 2022, presented at ACL 2023☆13Dec 28, 2023Updated 2 years ago
- PLM: Efficient Peripheral Language Models Hardware-Co-Designed for Ubiquitous Computing☆21Mar 18, 2025Updated last year
- ☆14Mar 5, 2024Updated 2 years ago
- Format your bibtex (.bib) file to help standardize citations for conference and journal submissions☆14Nov 23, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Deep RL agents for NASimEmu. See also https://github.com/jaromiru/NASimEmu.☆15Jul 16, 2024Updated last year
- ☆13Mar 29, 2026Updated last month
- ☆12May 14, 2024Updated last year
- The code of "Deep Regression Representation Learning with Topology" in ICML 2024☆14Jul 4, 2024Updated last year
- A project to automatically generate program repair recommendation in the field of smart contracts for given code snippets with their cont…☆16Aug 30, 2025Updated 8 months ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆18Aug 8, 2022Updated 3 years ago
- ☆19Nov 7, 2024Updated last year
- superquadrics based grasping☆13Dec 4, 2018Updated 7 years ago
- [NeurIPS'25] ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and R…☆38Sep 27, 2025Updated 7 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆11Oct 12, 2023Updated 2 years ago
- This repository contains an implementation of the Batch-BKB algorithm as described in the ICML 2020 paper "Near-linear time Gaussian proc…☆13Jul 14, 2020Updated 5 years ago
- Mind map for the course on Andrew Ng Machine Learning and popular platforms and libs for AI.☆12Dec 1, 2023Updated 2 years ago
- VeighNa框架的万得Wind数据服务接口☆18Jun 11, 2025Updated 10 months ago
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"☆24Apr 30, 2025Updated last year
- ☆18Dec 17, 2023Updated 2 years ago
- Scholarly Big Data Subject Category Classifier☆10Jul 15, 2019Updated 6 years ago
- 📚 List of Top-tier Conference Papers on Reinforcement Learning (RL),including: NeurIPS, AAAI, IJCAI, ICML, AAMAS, ICLR, ICRA, etc. | (AI…☆11Aug 20, 2023Updated 2 years ago
- 这是中国人民大学高瓴人工智能学院本科课程《强化学习》的期末项目安排,项目内容是训练一个适用于国标麻将的强化学习智能体。☆26Aug 25, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Official PyTorch code for "Sample Efficient Offline-to-Online Reinforcement Learning" in TKDE'23.☆16Aug 14, 2023Updated 2 years ago
- under review☆14Mar 1, 2021Updated 5 years ago
- ☆15May 4, 2024Updated 2 years ago
- Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"☆18Oct 5, 2024Updated last year
- ☆24Aug 8, 2022Updated 3 years ago
- ☆21Apr 8, 2026Updated 3 weeks ago
- Official code for "A General Learning Framework for Open Ad Hoc Teamwork Using Graph-based Policy Learning"☆17Mar 1, 2023Updated 3 years ago
- How to use Bootstrap with Flask - Free Sample | AppSeed☆16Mar 12, 2024Updated 2 years ago
- Official implementation of “Watch Your Step: A Fine-Grained Evaluation Framework for Multi-hop Knowledge Editing in Large Language Models…☆45Nov 25, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆167Oct 15, 2023Updated 2 years ago
- ☆14Aug 15, 2024Updated last year
- AttentionDTA: prediction of drug–target binding affinity using attention model.https://ieeexplore.ieee.org/abstract/document/8983125☆13Aug 29, 2020Updated 5 years ago
- Multi-Agent Reinforcement Learning (MARL) method to learn scalable control polices for multi-agent target tracking (IROS22).☆11Jul 22, 2022Updated 3 years ago
- I use various Data Science and machine learning techniques to analyze customer data using STP framework. I preprocessed the data, perform…☆12Apr 26, 2020Updated 6 years ago
- Unsupervised learning coupled with applied factor analysis to the five-factor model (FFM), a taxonomy for personality traits used to desc…☆16Jun 19, 2021Updated 4 years ago
- Multimodal Model for Memotion Dataset☆12May 17, 2021Updated 4 years ago