☆48Aug 29, 2024Updated last year
Alternatives and similar repositories for optimized_hf_llama_class_for_training
Users that are interested in optimized_hf_llama_class_for_training are comparing it to the libraries listed below
Sorting:
- Easily run PyTorch on multiple GPUs & machines☆59Jan 8, 2026Updated 2 months ago
- AI model designed to test the effectiveness in handling external ethical attacks.☆11Feb 9, 2026Updated last month
- ☆10Dec 21, 2024Updated last year
- This is project for korean auto spacing☆12Aug 3, 2020Updated 5 years ago
- A chat implementation for FastHTML☆11Sep 14, 2025Updated 5 months ago
- Paper Review about Speech Recognition · NLP☆10Mar 25, 2021Updated 4 years ago
- ☆26Sep 3, 2025Updated 6 months ago
- Tool to apply Legal Matter Specification Standard (LMSS) to documents☆12Aug 15, 2024Updated last year
- ☆11Oct 3, 2021Updated 4 years ago
- ☆13Jan 22, 2025Updated last year
- Utilities for Training Very Large Models☆58Sep 25, 2024Updated last year
- Deploy KoGPT with Triton Inference Server☆14Nov 18, 2022Updated 3 years ago
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆15Oct 16, 2023Updated 2 years ago
- End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.☆10Jan 21, 2022Updated 4 years ago
- ☆80Jun 5, 2024Updated last year
- ☆14Oct 18, 2023Updated 2 years ago
- ☆14May 3, 2022Updated 3 years ago
- baikal.ai's pre-trained BERT models: descriptions and sample codes☆12Jun 24, 2021Updated 4 years ago
- ☆13Apr 22, 2024Updated last year
- We can crawl NaverBlog, Twitter, Youtube!!☆14Sep 13, 2019Updated 6 years ago
- 음성인식과 신호처리☆14Sep 12, 2021Updated 4 years ago
- ☆10May 22, 2023Updated 2 years ago
- Full finetuning of large language models without large memory requirements☆94Sep 22, 2025Updated 5 months ago
- CFT-RAG: An Entity Tree Based Retrieval Augmented Generation Algorithm With Cuckoo Filter☆22May 28, 2025Updated 9 months ago
- Fast, Modern, and Low Precision PyTorch Optimizers☆125Dec 29, 2025Updated 2 months ago
- PLM: Efficient Peripheral Language Models Hardware-Co-Designed for Ubiquitous Computing☆20Mar 18, 2025Updated 11 months ago
- [ACL 2025] Beyond Prompt Engineering: Robust Behavior Control in LLMs via Steering Target Atoms☆37Jun 4, 2025Updated 9 months ago
- Official repository for ICML 2024 paper "MoRe Fine-Tuning with 10x Fewer Parameters"☆22Oct 14, 2025Updated 4 months ago
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆23Feb 9, 2025Updated last year
- ☆21Mar 23, 2022Updated 3 years ago
- Decoding Attention is specially optimized for MHA, MQA, GQA and MLA using CUDA core for the decoding stage of LLM inference.☆46Jun 11, 2025Updated 8 months ago
- An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.☆21Nov 28, 2022Updated 3 years ago
- ☆56Feb 11, 2026Updated 3 weeks ago
- ↔️ T5 Machine Translation from English to Korean☆18Aug 11, 2022Updated 3 years ago
- Few Shot Learning using EleutherAI's GPT-Neo an Open-source version of GPT-3☆18Jul 8, 2021Updated 4 years ago
- ☆21Feb 21, 2022Updated 4 years ago
- Fast high-dimensional exact KNN search.☆18Mar 1, 2017Updated 9 years ago
- supporting pytorch FSDP for optimizers☆84Dec 8, 2024Updated last year
- ☆20Jul 12, 2023Updated 2 years ago