A custom Huggingface trainer which supports logging auxiliary losses returned by your model
☆15Jul 27, 2025Updated 7 months ago
Alternatives and similar repositories for custom_hf_trainer
Users that are interested in custom_hf_trainer are comparing it to the libraries listed below
Sorting:
- vTPM with SGX protection☆11May 30, 2019Updated 6 years ago
- pytorch☆10Apr 13, 2022Updated 3 years ago
- Source code to accompany research paper on training multi token prediction language models using self-distillation.☆24Feb 21, 2026Updated last week
- The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models"☆29Feb 23, 2026Updated last week
- This package will help you perform a multiple minumum Monte Carlo conformer search as described in Chang et al., 1989. It is built to be …☆32Jan 22, 2026Updated last month
- TensorFlow MOOC Code Implementation 入门实操课程代码完整实现 TensorFlow官方社区联合中国MOOC出品☆11Oct 22, 2023Updated 2 years ago
- This repository provides some useful snippets that you may need in some situations.☆12Jan 16, 2024Updated 2 years ago
- ☆19Aug 23, 2025Updated 6 months ago
- [ICML'25] The Price of Freedom: Exploring Expressivity and Runtime Tradeoffs in Equivariant Tensor Products☆17Jul 16, 2025Updated 7 months ago
- ☆18Dec 9, 2025Updated 2 months ago
- Andrea Fusiello's computer vision Matlab functions☆16Jul 25, 2025Updated 7 months ago
- The Polaris datasets and benchmarks recipes☆12May 26, 2025Updated 9 months ago
- ☆40Jan 16, 2026Updated last month
- 🔥 open-ss2: a third-party open-source implementation of Figure AI's Helix "System 1, System 2" VLA model for high-rate, dexterous humano…☆11Mar 18, 2025Updated 11 months ago
- 搜集关于黑苹果的内容☆10Aug 1, 2021Updated 4 years ago
- libtpms / swtpm software emulation of a Trusted Platform Module (TPM 1.2 and TPM 2.0) compile script☆13Sep 16, 2020Updated 5 years ago
- Recursive Self-Aggregation evals on ARC-AGI☆28Jan 26, 2026Updated last month
- Create my own language in Compilers Principle Lab, I call it Quary. In this repository, I provide all the source code.☆12Jan 25, 2021Updated 5 years ago
- Course project for CS410. Drug Molecular Toxicity Prediction with GCN + Cloud ML Infra.☆10Apr 6, 2021Updated 4 years ago
- ☆11Oct 15, 2023Updated 2 years ago
- My Machine Learning repository☆10Apr 10, 2017Updated 8 years ago
- Metadata Editor user and practice guide☆17Feb 23, 2026Updated last week
- ☆13Feb 2, 2023Updated 3 years ago
- BERT系列模型、搜搜、剪枝、蒸馏☆13Sep 10, 2020Updated 5 years ago
- A lightweight, self-hosted infrastructure layer for deploying and managing LLM agents as resilient microservices. Features automatic r…☆17Aug 4, 2025Updated 7 months ago
- ☆10Oct 12, 2021Updated 4 years ago
- Verify that any MCP server is running the intended and untampered code via hardware attestation.☆17Mar 28, 2025Updated 11 months ago
- personal info☆10Mar 23, 2024Updated last year
- k8s CSI driver for FastCFS☆13Mar 17, 2024Updated last year
- ☆13Jun 15, 2021Updated 4 years ago
- 基于RNN的中国古诗词生成模型(SJTU CS382 Course Project)☆10Dec 27, 2018Updated 7 years ago
- Code for the paper "Automated Generation of Hospital Discharge Summaries Using Clinical Guidelines and Large Language Models"☆11May 3, 2024Updated last year
- Chat with your documents using Generative AI & Retrieval-Augmented Generation (RAG)☆14Jul 10, 2025Updated 7 months ago
- Poor man's simple harvester for arXiv resources☆13Jul 14, 2023Updated 2 years ago
- A Visualization Tool for GPU Occupancy on S Cluster.☆13Nov 16, 2022Updated 3 years ago
- ⛰️ PrexSyn: Efficient and Programmable Exploration of Synthesizable Chemical Space☆41Updated this week
- 2018.6-2018.7软件学院暑期工程实训 项目:一起看电影(影伴)☆13Sep 1, 2022Updated 3 years ago
- Created a deep learning algorithm to predict the price of an Airbnb listing only from a picture from the listing☆11Jan 22, 2021Updated 5 years ago
- This is a demo for CNN models training on quickdraw dataset. Implemented with pytorch.☆13May 28, 2019Updated 6 years ago