☆47Aug 29, 2024Updated last year
Alternatives and similar repositories for optimized_hf_llama_class_for_training
Users that are interested in optimized_hf_llama_class_for_training are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Easily run PyTorch on multiple GPUs & machines☆60May 2, 2026Updated last month
- GPT* - Training faster small transformers using ALiBi, Parallel Residual Connections and more!☆20Oct 29, 2022Updated 3 years ago
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆15Oct 16, 2023Updated 2 years ago
- ☆26Sep 3, 2025Updated 9 months ago
- Utilities for Training Very Large Models☆59Sep 25, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆11Oct 3, 2021Updated 4 years ago
- ☆79Jun 5, 2024Updated 2 years ago
- baikal.ai's pre-trained BERT models: descriptions and sample codes☆12Jun 24, 2021Updated 4 years ago
- AI model designed to test the effectiveness in handling external ethical attacks.☆11Feb 9, 2026Updated 4 months ago
- ☆13Jan 22, 2025Updated last year
- ☆14Dec 21, 2025Updated 5 months ago
- Deploy KoGPT with Triton Inference Server☆14Nov 18, 2022Updated 3 years ago
- [ICLR 2026] Quantile Advantage Estimation for Entropy-Safe Reasoning☆29Oct 14, 2025Updated 8 months ago
- ☆14Aug 16, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆14May 3, 2022Updated 4 years ago
- Fast, Modern, and Low Precision PyTorch Optimizers☆128May 16, 2026Updated last month
- Codebase for Instruction Following without Instruction Tuning☆36Sep 24, 2024Updated last year
- ☆13Jun 18, 2024Updated 2 years ago
- Reinforcement Learning Agent that plays Heroic - Magic Duel☆15Jun 23, 2020Updated 5 years ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆32Sep 22, 2024Updated last year
- ☆71Jul 11, 2024Updated last year
- Energetic GraphNeural Networks (EGNN) implementation based on Dirichlet Energy Constrained Learning.☆27Nov 1, 2021Updated 4 years ago
- 음성인식과 신호처리☆14Sep 12, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.☆10Jan 21, 2022Updated 4 years ago
- An efficent implementation of the method proposed in "The Era of 1-bit LLMs"☆155Oct 15, 2024Updated last year
- Collection of autoregressive model implementation☆85Jun 10, 2026Updated last week
- [ACL 2025] iAgent: LLM Agent as a Shield between User and Recommender Systems☆32May 23, 2025Updated last year
- On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning, …☆20Mar 13, 2026Updated 3 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆60Oct 18, 2025Updated 8 months ago
- [WWW2022] Geometric Graph Representation Learning via Maximizing Rate Reduction☆26May 27, 2022Updated 4 years ago
- We can crawl NaverBlog, Twitter, Youtube!!☆14Sep 13, 2019Updated 6 years ago
- A fusion of a linear layer and a cross entropy loss, written for pytorch in triton.☆75Aug 2, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Few Shot Learning using EleutherAI's GPT-Neo an Open-source version of GPT-3☆19Jul 8, 2021Updated 4 years ago
- ☆94Oct 5, 2023Updated 2 years ago
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆23Feb 9, 2025Updated last year
- An attempt to merge ESBN with Transformers, to endow Transformers with the ability to emergently bind symbols☆16Aug 3, 2021Updated 4 years ago
- Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)☆209May 20, 2024Updated 2 years ago
- A faster implementation of OpenCV-CUDA that uses OpenCV objects, and more!☆55Jun 4, 2026Updated 2 weeks ago
- ☆10May 22, 2023Updated 3 years ago