at-aaims / forge
☆12Updated 3 weeks ago
Alternatives and similar repositories for forge
Users that are interested in forge are comparing it to the libraries listed below
Sorting:
- The application is a end-user training and evaluation system for standard knowledge graph embedding models. It was developed to optimise …☆18Updated 3 weeks ago
- Tasks and tutorials using Graphore's IPU with Hugging Face. Originally at https://github.com/gradient-ai/Graphcore-HuggingFace☆16Updated last year
- FMS Model Optimizer is a framework for developing reduced precision neural network models.☆18Updated this week
- ☆47Updated 6 months ago
- Latent Large Language Models☆18Updated 8 months ago
- Official Implementation of "CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks"☆19Updated last week
- ☆48Updated last year
- AMD HPC Research Fund Cloud☆13Updated last week
- ☆31Updated 2 months ago
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆39Updated 2 months ago
- train with kittens!☆57Updated 6 months ago
- look how they massacred my boy☆63Updated 6 months ago
- ☆13Updated 10 months ago
- A parallel framework for training deep neural networks☆58Updated last month
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆52Updated 3 months ago
- ☆13Updated 2 years ago
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated 11 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated 11 months ago
- ☆61Updated last year
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆17Updated last month
- An introduction to LLM Sampling☆78Updated 4 months ago
- ☆43Updated last year
- ☆29Updated 4 months ago
- Pre-training code for CrystalCoder 7B LLM☆54Updated last year
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.☆25Updated 2 years ago
- ☆11Updated last week
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆33Updated this week
- Make triton easier☆47Updated 11 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated last year
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆86Updated this week