A repository sharing the literatures about large language models
☆106Dec 22, 2025Updated 2 months ago
Alternatives and similar repositories for llms-learning
Users that are interested in llms-learning are comparing it to the libraries listed below
Sorting:
- ☆133Dec 9, 2025Updated 2 months ago
- A repository sharing the literatures about long-context large language models, including the methodologies and the evaluation benchmarks☆272Jul 30, 2024Updated last year
- ☆13Mar 8, 2024Updated last year
- This repository contains the implementation of the paper "MeteoRA: Multiple-tasks Embedded LoRA for Large Language Models".☆24May 28, 2025Updated 9 months ago
- 保存有关DDPM直播的资料☆20Apr 7, 2024Updated last year
- CMATH: Can your language model pass Chinese elementary school math test?☆50Jul 3, 2023Updated 2 years ago
- Repo for "Smart Word Suggestions" (SWS) task and benchmark☆20Dec 4, 2023Updated 2 years ago
- ☆10Apr 21, 2025Updated 10 months ago
- A brief and partial summary of RLHF algorithms.☆147Mar 4, 2025Updated last year
- This repository contains an implementation of an Extended LSTM or XLSTM model using TensorFlow.☆11Sep 18, 2024Updated last year
- This is a source code in pure JS to convert widely unsupported WebP format to JPG format (PNG also possible)☆12Apr 30, 2018Updated 7 years ago
- Obsidian Vault for my Cybersecurity learning☆11Oct 20, 2024Updated last year
- A repository sharing the literatures about the full-stack SOTA techniques of the autonomous driving system☆32Jun 29, 2024Updated last year
- ☆107Feb 25, 2025Updated last year
- Official implementation of REArtGS (NeurIPS 2025)☆19Oct 24, 2025Updated 4 months ago
- LLM Skirmish☆44Feb 3, 2026Updated last month
- A collection of heat engines, based on the OpenAI Gym environment framework for use with reinforcement learning applications.☆15Dec 20, 2021Updated 4 years ago
- Tracking Of Agent (actions and belief) and Spatio-TEmporal Reasoning☆14Feb 7, 2020Updated 6 years ago
- About Code release for "Imagination Mechanism: Mesh Information Propagation for Enhancing Data Efficiency in Reinforcement Learning"☆13Oct 7, 2023Updated 2 years ago
- The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models"☆29Feb 23, 2026Updated last week
- ☆11Jan 11, 2022Updated 4 years ago
- [SIGIR 2025] Benchmarking Recommendation, Classification, and Tracing Based on Hugging Face Knowledge Graph☆16Jun 6, 2025Updated 9 months ago
- Matlab/Octave toolbox for deep learning. Includes Deep Belief Nets, Stacked Autoencoders, Convolutional Neural Nets, Convolutional Autoen…☆21Jun 23, 2014Updated 11 years ago
- The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.☆13Jun 17, 2024Updated last year
- ☆10Oct 9, 2025Updated 4 months ago
- DreamSmooth: Improving Model-Based RL with Reward Smoothing (ICLR 2024)☆12May 6, 2024Updated last year
- ☆14Mar 21, 2024Updated last year
- A fast and accurate index for distribution-aware dataset search.☆10Feb 3, 2026Updated last month
- A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training☆659Updated this week
- Matlab/Octave toolbox for deep learning. Includes Deep Belief Nets, Stacked Autoencoders, Convolutional Neural Nets, Convolutional Autoen…☆10Jul 10, 2013Updated 12 years ago
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function☆13Nov 22, 2022Updated 3 years ago
- Some learning materials, notes and scripts about the programming and security of microcontroller.☆15Mar 15, 2022Updated 3 years ago
- ☆11May 23, 2024Updated last year
- Submission Under Review☆17May 15, 2025Updated 9 months ago
- Efficient retrieval head analysis with triton flash attention that supports topK probability☆13Jun 15, 2024Updated last year
- Neural Networks for penetration testing. Part of active research.☆13Jun 21, 2022Updated 3 years ago
- ☆13May 3, 2024Updated last year
- Python implementation of Avro Phonetic☆10Feb 25, 2025Updated last year
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year