shi3z / BitNetLinks
Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch
☆20Updated last year
Alternatives and similar repositories for BitNet
Users that are interested in BitNet are comparing it to the libraries listed below
Sorting:
- Japanese LLaMa experiment☆54Updated last month
- ☆89Updated 2 years ago
- 【2024年版】BERTによるテキスト分類☆30Updated last year
- ☆31Updated last year
- alpacaデータセットを日本語化したものです☆86Updated 2 years ago
- Preferred Generation Benchmark☆90Updated 3 months ago
- Alpaca-LoRAをlivedoorニュースコーパスでFineTuningさせるサンプルコード☆21Updated 2 years ago
- JMultiWOZ: A Large-Scale Japanese Multi-Domain Task-Oriented Dialogue Dataset, LREC-COLING 2024☆25Updated last year
- LLMとLoRAを用いたテキスト分類☆98Updated 2 years ago
- ☆51Updated 2 years ago
- The Remdis toolkit: Building advanced real-time multimodal dialogue systems with incremental processing and large language models☆102Updated 7 months ago
- ☆36Updated 3 months ago
- ☆29Updated last year
- Code and documentation to train Stanford's Alpaca models, and generate the data.☆24Updated 2 years ago
- Mecab + NEologd + Docker + Python3☆36Updated 3 years ago
- A web app that automatically generates transcripts and summaries of meetings or lectures.☆65Updated 2 years ago
- ☆141Updated 2 years ago
- ☆65Updated 4 years ago
- GPTがYouTuberをやります☆63Updated 2 years ago
- ☆44Updated 4 months ago
- ☆40Updated last year
- DeepLearningのAttentionモデルをPytorchの低レベルAPIを使って1から制作しようという試みのリポジトリです。☆72Updated last month
- DistilBERT model pre-trained on 131 GB of Japanese web text. The teacher model is BERT-base that built in-house at LINE.☆46Updated 2 years ago
- ☆24Updated 2 years ago
- ☆19Updated 4 months ago
- ☆27Updated 2 years ago
- Tools to implement active inferring and pseudo-consciousness in LLM☆112Updated 2 years ago
- General-purpose Swich transformer based Japanese language model☆118Updated 2 years ago
- ☆101Updated last year
- ☆149Updated 4 months ago