All about large language models
☆52May 29, 2024Updated last year
Alternatives and similar repositories for my-llm
Users that are interested in my-llm are comparing it to the libraries listed below
Sorting:
- ☆13Dec 6, 2018Updated 7 years ago
- [AAAI 2025] Augmenting Math Word Problems via Iterative Question Composing (https://arxiv.org/abs/2401.09003)☆23Oct 2, 2025Updated 5 months ago
- A simple project training 3 separate NLP tasks simultaneously using Multitask-Learning☆23Jun 12, 2023Updated 2 years ago
- [ACL 2023]: Training Trajectories of Language Models Across Scales https://arxiv.org/pdf/2212.09803.pdf☆25Nov 14, 2023Updated 2 years ago
- Our code will be public soon .☆27Mar 20, 2023Updated 2 years ago
- ☆31Mar 12, 2023Updated 2 years ago
- ☆18Jun 10, 2025Updated 8 months ago
- TyDiP Multilingual Politeness dataset and code☆12Oct 15, 2023Updated 2 years ago
- EmoInt provides a high level wrapper to combine various word embeddings and creating ensembles from multiple trained models☆29Jul 24, 2020Updated 5 years ago
- A Data-Driven Approach to Predict the Success of Bank Telemarketing☆10Apr 27, 2021Updated 4 years ago
- Tool for sentiment analysis annotation☆13Mar 26, 2025Updated 11 months ago
- Source code, datasets and models of the paper "Efficient White-box Fairness Testing through Gradient Search" by Lingfeng Zhang, Yueling Z…☆11Jul 24, 2021Updated 4 years ago
- rag base on langchain☆11Mar 1, 2024Updated 2 years ago
- ☆16Feb 28, 2026Updated last week
- Debug DeepSpeed-Chat step by step in IDE (在IDE里一步一步调试DeepSpeed-Chat)☆10Apr 17, 2023Updated 2 years ago
- A collection of demos and utilities prepared ahead of the Vector Institute Privacy Enhancing Techniques (PETs) Bootcamp.☆15Sep 22, 2022Updated 3 years ago
- Guide to interviewing for industry machine learning roles (data/applied/research scientist, ML engineer, etc).☆11Dec 28, 2022Updated 3 years ago
- [ICDCS 2023] Evaluation and Optimization of Gradient Compression for Distributed Deep Learning☆10Apr 28, 2023Updated 2 years ago
- Generate Quiz Question from PDF/Text files☆11Feb 2, 2024Updated 2 years ago
- ☆10Jul 16, 2023Updated 2 years ago
- ☆11Jan 13, 2023Updated 3 years ago
- ☆47Mar 25, 2025Updated 11 months ago
- 美丽东自然语言处理百宝箱~命名实体识别,文本分类,语言模型,文本摘要。☆10Nov 28, 2022Updated 3 years ago
- Latin texts annotated for named entities and NER tagger used for the Herodotos Project (Ohio State University / Ghent University)☆11Sep 26, 2022Updated 3 years ago
- halcon算子阈值分割的实现☆12Apr 13, 2018Updated 7 years ago
- SGD/ADAM/Amsgrad/AdamW/RAdam/Lookahead☆10Nov 18, 2019Updated 6 years ago
- smplify code for point cloud based HMR☆10Jan 11, 2022Updated 4 years ago
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- Code for EMNLP'24 paper - On Diversified Preferences of Large Language Model Alignment☆16Aug 6, 2024Updated last year
- SentiStorm - Real-time Twitter Sentiment Classification based on Apache Storm☆10May 22, 2018Updated 7 years ago
- Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows☆19Nov 4, 2025Updated 4 months ago
- Source code for NeurIPS 2020 paper "Node Classification on Graphs with Few-Shot Novel Labels via Meta Transformed Network Embedding"☆10Nov 17, 2020Updated 5 years ago
- ☆11Feb 23, 2026Updated last week
- Code of the paper https://arxiv.org/abs/2009.11939. A defocus blur estimation method.☆10Jan 13, 2022Updated 4 years ago
- Unsupervised Cross-lingual Sentiment Analysis (CoNLL 2019)☆10Nov 4, 2019Updated 6 years ago
- ☆10Oct 3, 2023Updated 2 years ago
- several algorithms for converting dependency structures into constituency structures.☆10Feb 7, 2022Updated 4 years ago
- LTX-Video-Trainer-GUI 是为LTX视频lora模型训练提供的GUI工具,支持通过简单的界面训练 LoRA 模型用于视频生成。本训练器提供了直观的 GUI 界面,使用户能够轻松设置和启动训练流程,无需编写复杂代码。☆13Jul 18, 2025Updated 7 months ago
- Transformer from scratch with einsum method☆11Jul 8, 2021Updated 4 years ago