Hajime-Y / BitNet-b158
☆18Updated 10 months ago
Alternatives and similar repositories for BitNet-b158:
Users that are interested in BitNet-b158 are comparing it to the libraries listed below
- ☆83Updated last year
- Japanese LLaMa experiment☆53Updated 3 months ago
- ☆48Updated 3 months ago
- ☆52Updated 9 months ago
- Preferred Generation Benchmark☆78Updated 2 weeks ago
- ☆39Updated last month
- ☆15Updated last year
- ☆59Updated 9 months ago
- 0️⃣1️⃣🤗 BitNet-Transformers: Huggingface Transformers Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" i…☆96Updated last year
- ☆27Updated last year
- 【2024年版】BERTによるテキスト分類☆29Updated 8 months ago
- ☆14Updated 6 months ago
- RealPersonaChat: A Realistic Persona Chat Corpus with Interlocutors' Own Personalities☆53Updated last year
- ☆22Updated last year
- JMultiWOZ: A Large-Scale Japanese Multi-Domain Task-Oriented Dialogue Dataset, LREC-COLING 2024☆23Updated last year
- Swallowプロジェクト 大規模言語モデル 評価スクリプト☆16Updated last week
- alpacaデータセットを日本語化したものです☆90Updated last year
- DeepLearningのAttentionモデルをPytorchの低レベルAPIを使って1から制作しようという試みのリポジトリです。☆56Updated last year
- ☆50Updated last year
- ☆16Updated 2 months ago
- ☆39Updated 7 months ago
- ☆29Updated 9 months ago
- ☆15Updated last year
- ☆25Updated 4 months ago
- The robust text processing pipeline framework enabling customizable, efficient, and metric-logged text preprocessing.☆121Updated 4 months ago
- A framework for few-shot evaluation of autoregressive language models.☆149Updated 6 months ago
- The Remdis toolkit: Building advanced real-time multimodal dialogue systems with incremental processing and large language models☆88Updated last month
- Code and documentation to train Stanford's Alpaca models, and generate the data.☆24Updated 2 years ago
- Exploring Japanese SimCSE☆69Updated last year
- ☆26Updated 3 years ago