Swallowプロジェクト 事後学習済み大規模言語モデル 評価フレームワーク
☆28May 8, 2026Updated 2 weeks ago
Alternatives and similar repositories for swallow-evaluation-instruct
Users that are interested in swallow-evaluation-instruct are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Swallowプロジェクト 大規模言語モデル 評価スクリプト☆24Sep 17, 2025Updated 8 months ago
- Ongoing research training Mixture of Expert models.☆22Sep 16, 2024Updated last year
- A Python implementation of a graph-based parser for Abstract Meaning Representation (AMR)☆11Feb 2, 2018Updated 8 years ago
- The robust text processing pipeline framework enabling customizable, efficient, and metric-logged text preprocessing.☆126Apr 10, 2026Updated last month
- ☆14Jan 12, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A shell-friendly hyperparameter search tool inspired by Optuna☆18Dec 17, 2024Updated last year
- ☆27Nov 4, 2024Updated last year
- Recording Composition Tool Hisui☆25Updated this week
- Implementation of Wide Residual Networks in Keras☆10Sep 10, 2016Updated 9 years ago
- Reimplementation of the paper `Human Attention Maps for Text Classification: Do Humans and Neural Networks Focus on the Same Words? (ACL2…☆17Jul 10, 2020Updated 5 years ago
- LaTeX document class for the proceedings of ANLP☆21Oct 28, 2025Updated 6 months ago
- The tool facilitates debugging convergence issues and testing new algorithms and recipes for training LLMs using Nvidia libraries such as…☆19Sep 17, 2025Updated 8 months ago
- Show notes for https://anchor.fm/yoheikikuta.☆15Apr 24, 2022Updated 4 years ago
- Use custom tokenizers in spacy-transformers☆16Aug 9, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Checkpointable dataset utilities for foundation model training☆32Jan 29, 2024Updated 2 years ago
- デジタル化資料から作成したOCRテキストデータのngram頻度統計情報のデータセット☆15Jan 10, 2023Updated 3 years ago
- ☆27Mar 28, 2026Updated last month
- ☆40Oct 21, 2025Updated 7 months ago
- Training and evaluation scripts for JGLUE, a Japanese language understanding benchmark☆18May 12, 2026Updated last week
- You can create datasets from Wikia/Wikipedia that can be used for entity recognition and Entity Linking. Dumps for ja-wiki and VTuber-wik…☆18May 2, 2021Updated 5 years ago
- Survey of audio language models☆65Apr 18, 2026Updated last month
- LLMとLoRAを用いたテキスト分類☆98Jul 22, 2023Updated 2 years ago
- Funer is Rule based Named Entity Recognition tool.☆22Apr 21, 2022Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆62Jun 13, 2024Updated last year
- 0️⃣1️⃣🤗 BitNet-Transformers: Huggingface Transformers Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" i…☆98Mar 1, 2024Updated 2 years ago
- ☆16Mar 4, 2024Updated 2 years ago
- ☆78Apr 14, 2026Updated last month
- This project uses llama.cpp as an LLM server to perform inference and generate speech using Synthetic voice library☆22Mar 5, 2024Updated 2 years ago
- ☆49Dec 18, 2024Updated last year
- ☆57Jun 17, 2024Updated last year
- ☆19Dec 6, 2024Updated last year
- ☆22Sep 18, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆19Sep 29, 2024Updated last year
- ☆25May 29, 2025Updated 11 months ago
- DefSent: Sentence Embeddings using Definition Sentences☆23Aug 5, 2021Updated 4 years ago
- Abstract Meaning Representation (AMR) reader☆35Feb 24, 2020Updated 6 years ago
- A beamer template mainly for Japanese.☆14Apr 21, 2024Updated 2 years ago
- [EACL'23] MCoNaLa: A Benchmark for Code Generation from Multiple Natural Languages☆23Feb 13, 2023Updated 3 years ago
- publish twitter likes☆42Mar 4, 2026Updated 2 months ago