Fine-tuning GPT-J-6B on colab or equivalent PC GPU with your custom datasets: 8-bit weights with low-rank adaptors (LoRA)
☆73Jun 18, 2022Updated 4 years ago
Alternatives and similar repositories for Finetune_GPT-J_6B_8-bit
Users that are interested in Finetune_GPT-J_6B_8-bit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fine-tuning 6-Billion GPT-J (& other models) with LoRA and 8-bit compression☆69Oct 5, 2022Updated 3 years ago
- A Simple Discord Bot with a Rasa Connection☆10Jan 3, 2025Updated last year
- Repo for fine-tuning Casual LLMs☆465Mar 27, 2024Updated 2 years ago
- ☆15May 4, 2023Updated 3 years ago
- Fast Neural Machine Translation in C++ - development repository☆23May 12, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Django Google custom search engine app.☆29Oct 20, 2016Updated 9 years ago
- ☆64Oct 1, 2021Updated 4 years ago
- QGIS.бг докс е документация за QGIS на български и друг географски софтуер с отворен код☆13Jun 12, 2026Updated last week
- 🚀 Simple blog built with Next.js + TailwindCSS☆12Jul 10, 2021Updated 4 years ago
- ☆16Apr 28, 2023Updated 3 years ago
- A zero-shot relation extractor, easily downloadable from the HuggingFace repo.☆12Aug 13, 2021Updated 4 years ago
- 💬 简单在线聊天室。☆11Apr 15, 2019Updated 7 years ago
- Longformer Encoder Decoder model for the legal domain, trained for long document abstractive summarization task.☆10Feb 26, 2021Updated 5 years ago
- ☆11Oct 6, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A Streamlit-based chatbot application using Gemini models for NLP. Features include light/dark mode toggle, model selection (Gemini 1.5 F…☆10May 23, 2024Updated 2 years ago
- ☆13Oct 22, 2023Updated 2 years ago
- Using Spectral Noise Gating (SNG) techniques to reduce background noise in streaming microphone input for enhanced vocal recognition☆24Dec 10, 2018Updated 7 years ago
- Search and indexing your own Google Drive Files using GPT3, LangChain, and Python☆44Feb 7, 2023Updated 3 years ago
- simple usage of yolov8 on android device☆22Apr 26, 2024Updated 2 years ago
- Smart contract that holds an ERC20 token and provides getTokens method to claim free tokens.☆10Dec 3, 2022Updated 3 years ago
- Elo ratings for time-series forecasting packages☆25Jan 5, 2022Updated 4 years ago
- Automated FOREX trading using recurrent reinforcement learning☆35Dec 8, 2022Updated 3 years ago
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆11Apr 26, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Simple tool for recursively scraping/validating emails and phone numbers from web pages.☆11Oct 1, 2021Updated 4 years ago
- An implementation of the paper "Solving the Rubik's Cube without Human Knowledge"☆14Dec 9, 2018Updated 7 years ago
- Certified Reasoning with Language Models☆31Dec 6, 2023Updated 2 years ago
- Validates Aadhaar Number, Aadhaar VID☆26Dec 30, 2022Updated 3 years ago
- to analyze martial arts motion with CMU OpenPose. License depends on OpenPose.☆11Mar 28, 2019Updated 7 years ago
- ☆14Mar 19, 2021Updated 5 years ago
- ChatRPC is a framework that allows large language models to interact with external services.☆10Dec 25, 2023Updated 2 years ago
- ☆14Dec 8, 2022Updated 3 years ago
- UI for an app where you 🔎 find your match.👫☆14Nov 10, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Artifacts for the "SurgeProtector: Mitigating Temporal Algorithmic Complexity Attacks using Adversarial Scheduling" paper that appears in…☆13Jun 24, 2022Updated 3 years ago
- Minimal implementation of multiple PEFT methods for LLaMA fine-tuning☆13May 7, 2023Updated 3 years ago
- DocQues answers queries on longer and multiple documents build on GPT-Index and GPT-3☆13Jan 1, 2023Updated 3 years ago
- A Dropbox clone created using Django☆10Apr 21, 2023Updated 3 years ago
- An Algorithmic Day Trading Bot☆13Aug 18, 2020Updated 5 years ago
- WASM web-application that allows you to mint ERC-721 and ERC-1155 tokens in all major testnets☆13Oct 6, 2022Updated 3 years ago
- Learn how to combine Nginx + wigs + load balancing + flask + unit testing + Docker☆11Jun 2, 2021Updated 5 years ago