dwzq-com-cn / DongwuLLMView external linksLinks
This is the codebase for pre-training, compressing, extending, and distilling LLMs with Megatron-LM.
☆12Mar 11, 2024Updated last year
Alternatives and similar repositories for DongwuLLM
Users that are interested in DongwuLLM are comparing it to the libraries listed below
Sorting:
- OpenBA-V2: 3B LLM (Large Language Model) with T5 architecture, utilizing model pruning technique and continuing pretraining from OpenBA-1…☆25May 10, 2024Updated last year
- The framework to prune LLMs to any size and any config.☆95Mar 1, 2024Updated last year
- CMD: a framework for Context-aware Model self-Detoxification (EMNLP2024 Long Paper)☆17Feb 10, 2025Updated last year
- Code for paper: Long cOntext aliGnment via efficient preference Optimization☆24Oct 10, 2025Updated 4 months ago
- Evaluating the faithfulness of long-context language models☆30Oct 21, 2024Updated last year
- The aim of this repository is to utilize LLaMA to reproduce and enhance the Stanford Alpaca☆98Apr 5, 2023Updated 2 years ago
- ☆28May 24, 2025Updated 8 months ago
- [ACL 2024 Findings] Code implementation of Paper "Rethinking Negative Instances for Generative Named Entity Recognition"☆60Mar 20, 2024Updated last year
- ☆36Oct 14, 2022Updated 3 years ago
- COMS 4111 Project 1☆12Jul 21, 2022Updated 3 years ago
- Diffusion Model Improvement Method☆34Sep 4, 2023Updated 2 years ago
- A static website for a Chatbot with Azure OpenAI, Azure Text to Speech Services and Live2D☆13Sep 4, 2024Updated last year
- 今日头条搜索引擎以及新闻详情页爬虫(Selenium)☆15Mar 13, 2025Updated 11 months ago
- A Streamlit-based chatbot application using Gemini models for NLP. Features include light/dark mode toggle, model selection (Gemini 1.5 F…☆10May 23, 2024Updated last year
- The official github repo for the open online courses: "Dive into LLMs".☆10Mar 15, 2024Updated last year
- 简单的 AIGC 微服务,可通过 HTTP、gRPC 连接,支持流式回答。☆10Mar 23, 2023Updated 2 years ago
- treelite runtime binding in Rust☆12Jun 12, 2025Updated 8 months ago
- Long Context Research☆26Jan 26, 2026Updated 3 weeks ago
- ☆11Dec 19, 2023Updated 2 years ago
- On-device real-time RAG App built using Jina Reader, Mediapipe, Gemma 2b IT LLM.☆15Apr 15, 2024Updated last year
- Mathematics + Statistics Courses at the University of Alberta☆15Jan 8, 2023Updated 3 years ago
- Repository of shared bibtex files (references)☆11Jan 28, 2026Updated 2 weeks ago
- DeepSearch - Advanced Web Dir Scanner☆14Nov 13, 2018Updated 7 years ago
- pubg_sdk☆11Jul 26, 2020Updated 5 years ago
- ☆15Jan 26, 2026Updated 3 weeks ago
- Google Gemini Voice/Vision Assistant with gemini-1.5-pro / gemini-1.5-flash modal ! #Gemini 1.5 Flash #Gemini 1.5 Pro☆11May 18, 2024Updated last year
- Applies ROME and MEMIT on Mamba-S4 models☆14Apr 5, 2024Updated last year
- A Python package to convert Strava activities to GPX format using data from the Strava API.☆13Jan 4, 2025Updated last year
- Go bindings for LLama.cpp☆14Apr 11, 2023Updated 2 years ago
- Language Translator using the new GPT-4o model☆17May 15, 2024Updated last year
- Offical code repository for PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation, EMNLP 2023☆12Dec 13, 2023Updated 2 years ago
- A simple fastp-MultiQC nextflow pipeline☆12Feb 2, 2023Updated 3 years ago
- Topic models for microblogging content☆10Sep 23, 2015Updated 10 years ago
- MDClub 的 JavaScript 版 SDK☆11May 29, 2022Updated 3 years ago
- Hyperparameter tuning for FCN using Ray Tune☆14Sep 11, 2020Updated 5 years ago
- Automatically exported from code.google.com/p/hf-2011☆15Feb 12, 2016Updated 10 years ago
- RAG-QA is a free, containerised question-answer framework that allows you to ask questions to your documents in an intuitive way☆20Jan 25, 2024Updated 2 years ago
- Code for Unsupervised Domain Adaptation of a Pretrained Cross-Lingual Language Model, IJCAI 2020☆12Nov 26, 2020Updated 5 years ago
- Head used in Poppy Torso and Poppy Humanoid☆15Jul 3, 2021Updated 4 years ago