TMMMU-Benchmark / evaluationLinks
Evaluation code for benchmarking VLMs in traditional chinese understanding
☆12Updated 2 months ago
Alternatives and similar repositories for evaluation
Users that are interested in evaluation are comparing it to the libraries listed below
Sorting:
- [Kaggle-2nd] Lightweight yet Effective Chinese LLM.☆50Updated last month
- finetune llama2 with traditional chinese dataset☆38Updated last year
- Generative Fusion Decoding (GFD) is a novel framework for integrating Large Language Models (LLMs) into multi-modal text recognition syst…☆82Updated last month
- ☆25Updated 4 years ago
- just collections about Llama2☆44Updated 10 months ago
- Toward Multi Modality Language Model - implementation of GPT-4o/Project Astra☆16Updated 7 months ago
- 🏃 hosting nlp models in one line☆20Updated last year
- Make pytorch and tensorflow two become one.☆72Updated last week
- A distributed training framework for large language models powered by Lightning.☆22Updated 4 months ago
- Personal colab collections which I feel interesting.☆54Updated 7 months ago
- 聯發創新基地(MediaTek Research) 致力於研究基礎模型。我們將研究體現在適合繁體中文使用者的模型上,並在使用權許可的情況下,提供模型給學術界研究或產業界使用。☆240Updated 4 months ago
- Tbrain舉辦的「玉山人工智慧公開挑戰賽2019夏季賽 - 台灣不動產AI神預測」冠軍隊伍☆1Updated 5 years ago
- This tool allows you to access TWCC through CLI.☆20Updated last year
- A Traditional-Chinese instruction-following model with datasets based on Alpaca.☆137Updated 2 years ago
- Twinkle Eval:高效且準確的 AI 評測工具☆65Updated 2 weeks ago
- Free intents (and more goodies) for Loki NLU Engine☆40Updated 2 months ago
- Taiwanese Speech Synthesis with Tacotron2☆21Updated 2 years ago
- deeplearning record☆50Updated last year
- 🦅🔗 Building FlyteGPT on Flyte with LangChain☆29Updated last year
- A simple python reproduction and modification of the 2022 Ig Nobel Prize for Economics "Which Is More Important: Talent or Luck?"☆28Updated 2 years ago
- 陽明交通大學在2022年11月要砍掉校友的Gsuite帳號雲端硬碟容量到5 GB,本專案用自動化程式下載雲端硬碟並整理資料夾。☆42Updated 2 years ago
- a collection of tools to make the works better and easier☆15Updated 4 years ago
- 以Python實作資料結構☆18Updated 6 years ago
- A method that directly addresses the modality gap by aligning speech token with the corresponding text transcription during the tokenizat…☆81Updated last week
- ROUGE score calculator with traditional chinese word segmentation☆9Updated 4 years ago
- one script for xls-r/xlsr/whisper fine-tuning☆42Updated 2 years ago
- Official github repo for TMMLU+, Large scale traditional chinese massive multitask language understanding☆45Updated 11 months ago
- ☆24Updated 2 years ago
- A LINE Bot demo showcasing how to use a local LLM (Gemma) via Groq to modify personal information and detect the need for LLM assistance.☆17Updated 11 months ago
- 為了巨量資料而設計的資料載入器☆14Updated last year