This repository contains a custom implementation of the BERT model, fine-tuned for specific tasks, along with an implementation of Low Rank Approximation (LoRA). The models are optimized for high performance using NVIDIA's TensorRT.
☆78Nov 14, 2023Updated 2 years ago
Alternatives and similar repositories for BERT-LoRA-TensorRT
Users that are interested in BERT-LoRA-TensorRT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Jun 6, 2024Updated last year
- ☆17Nov 23, 2023Updated 2 years ago
- ☆14Jan 24, 2022Updated 4 years ago
- ☆27Feb 23, 2026Updated 3 months ago
- ☆13Mar 23, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 9th solution☆11Oct 11, 2022Updated 3 years ago
- An AI assistant for PCs powered by Meta's LLaMA3 using Hugging Face, runs on voice recognition, text-to-speech. Send messages, voice/vide…☆19Jun 6, 2024Updated last year
- A Max for Live device based on nn~ for real-time latent interaction and bending in Ableton.☆20Jul 8, 2025Updated 10 months ago
- NLP on Korean news articles. Automatic topic extraction through dynamic clustering.☆12Sep 15, 2017Updated 8 years ago
- Original LISP version of Meta-AQUA☆14Sep 3, 2018Updated 7 years ago
- ☆14Mar 28, 2025Updated last year
- The backend behind the LLM-Perf Leaderboard☆11May 5, 2024Updated 2 years ago
- ☆17Mar 28, 2025Updated last year
- 在NLP领域中一些任务的Demo☆13Sep 11, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- competition☆16Aug 1, 2020Updated 5 years ago
- Python library for parsing and converting SAMI files.☆13Mar 28, 2015Updated 11 years ago
- 比赛中的通用方法和模板☆17Sep 8, 2020Updated 5 years ago
- Homebrew formulas for installing LLM and related tools☆14Sep 6, 2023Updated 2 years ago
- MCP server for Korean law data access via open.law.go.kr API (experimental. Works in Claude Code)☆30Jul 7, 2025Updated 10 months ago
- An instance segmentation challenge on Basketball images, with a particular focus on occlusion resolution. An opportunity to publish at MM…☆16Aug 8, 2023Updated 2 years ago
- A simple Panel-based dashboard visualizing geotagged tweets with hvplot and Datashader.☆17Mar 25, 2024Updated 2 years ago
- [ICML 2024] Generalizing Knowledge Graph Embedding with Universal Orthogonal Parameterization☆16May 12, 2024Updated 2 years ago
- ☆12Oct 18, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- TrojanLM: Trojaning Language Models for Fun and Profit☆16Jun 17, 2021Updated 4 years ago
- Repository to reproduce "Cascade-based Echo Chamber Detection" accepted at CIKM2022☆11Mar 13, 2024Updated 2 years ago
- Python client for Marqo☆31May 12, 2026Updated 2 weeks ago
- Quick Draw Implementation that recognize the Doodles and the Shapes you Feed into System.☆11Jul 31, 2020Updated 5 years ago
- ☆12Jun 20, 2024Updated last year
- ☆63Nov 8, 2024Updated last year
- GenAI Playground☆23Nov 6, 2024Updated last year
- A literature review for constructing and using knowledge graphs in a biomedical setting.☆11May 22, 2020Updated 6 years ago
- ☆12Aug 6, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A repository dedicated to learning about ChatGPT training techniques and related knowledge. Contains study notes, code snippets, and reso…☆13Dec 14, 2024Updated last year
- Official repository for "Proxy-based Item Representation for Attribute and Context-aware Recommendation", WSDM 2024.☆12Jan 22, 2024Updated 2 years ago
- "How to Trust Your Diffusion Models: A Convex Optimization Approach to Conformal Risk Control"☆17Jan 6, 2026Updated 4 months ago
- Lego for GRPO☆30May 27, 2025Updated last year
- Partial Codes and datasets for NeurIPS'19 "Stochastic Shared Embeddings: Data-driven Regularization of Embedding Layers"☆20Nov 1, 2019Updated 6 years ago
- ☆12Jul 13, 2023Updated 2 years ago
- ☆29Sep 4, 2024Updated last year