This repository contains a custom implementation of the BERT model, fine-tuned for specific tasks, along with an implementation of Low Rank Approximation (LoRA). The models are optimized for high performance using NVIDIA's TensorRT.
☆79Nov 14, 2023Updated 2 years ago
Alternatives and similar repositories for BERT-LoRA-TensorRT
Users that are interested in BERT-LoRA-TensorRT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Nov 23, 2023Updated 2 years ago
- ☆14Jan 24, 2022Updated 4 years ago
- ☆10May 6, 2024Updated 2 years ago
- An AI assistant for PCs powered by Meta's LLaMA3 using Hugging Face, runs on voice recognition, text-to-speech. Send messages, voice/vide…☆19Jun 6, 2024Updated last year
- python codes for iDNA-ABF: multi-scale deep biological language learning model for the accurate and interpretable prediction of DNA methy…☆15May 6, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆17May 1, 2022Updated 4 years ago
- Various LLM Benchmarks☆25Feb 20, 2026Updated 2 months ago
- ☆14Mar 28, 2025Updated last year
- ☆12Apr 18, 2026Updated 3 weeks ago
- The backend behind the LLM-Perf Leaderboard☆11May 5, 2024Updated 2 years ago
- ☆17Mar 28, 2025Updated last year
- 在NLP领域中一些任务的Demo☆13Sep 11, 2023Updated 2 years ago
- Gene Neural Network (GNN)☆11Oct 5, 2019Updated 6 years ago
- ☆11Mar 12, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A quick and dirty script to call LLaMA.cpp in Python. Supports streaming and interactive mode.☆13Apr 17, 2023Updated 3 years ago
- Privacy preserving machine learning for small molecule data☆13Mar 2, 2026Updated 2 months ago
- competition☆16Aug 1, 2020Updated 5 years ago
- Python implementation of closed frequent subgraph mining algorithm cgSpan. Only undirected graphs are currently supported.☆13Dec 20, 2021Updated 4 years ago
- Homebrew formulas for installing LLM and related tools☆14Sep 6, 2023Updated 2 years ago
- ☆79Updated this week
- Train word2vec on Pubmed data, compare 2 GO terms, compare 2 genes.☆12Apr 7, 2017Updated 9 years ago
- PDB ProtVista Viewer☆11Jul 8, 2025Updated 10 months ago
- This repo consists of the code as discussed in the Medium blog.☆17Sep 10, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A tutorial example for nbdev☆15Feb 26, 2022Updated 4 years ago
- A Node.js tool to examine the correctness of Open Data Metadata and build custom dataset profiles☆12Sep 26, 2023Updated 2 years ago
- ChatBLOOM☆16May 5, 2023Updated 3 years ago
- A simple Panel-based dashboard visualizing geotagged tweets with hvplot and Datashader.☆17Mar 25, 2024Updated 2 years ago
- [ICML 2024] Generalizing Knowledge Graph Embedding with Universal Orthogonal Parameterization☆16May 12, 2024Updated last year
- A benchmark dataset designed to support the development and evaluation of large language models (LLMs) for conversational mental health a…☆18Feb 24, 2025Updated last year
- TrojanLM: Trojaning Language Models for Fun and Profit☆16Jun 17, 2021Updated 4 years ago
- ☆63Nov 8, 2024Updated last year
- GenAI Playground☆23Nov 6, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Runpod VLLM Worker that Works !☆10Nov 14, 2023Updated 2 years ago
- A repository dedicated to learning about ChatGPT training techniques and related knowledge. Contains study notes, code snippets, and reso…☆13Dec 14, 2024Updated last year
- SaTML'23 paper "Backdoor Attacks on Time Series: A Generative Approach" by Yujing Jiang, Xingjun Ma, Sarah Monazam Erfani, and James Bail…☆21Feb 5, 2023Updated 3 years ago
- Qwen1.5大模型微调、基于PEFT框架LoRA微调,在数据集HC3-Chinese上实现文本分类。☆12Jun 29, 2024Updated last year
- "How to Trust Your Diffusion Models: A Convex Optimization Approach to Conformal Risk Control"☆17Jan 6, 2026Updated 4 months ago
- Lego for GRPO☆30May 27, 2025Updated 11 months ago
- ERC721 變體測試(ERC721, ERC721A, ERC721Solmate, ERC721Psi ...)☆10Jul 18, 2022Updated 3 years ago