mil-tokyo / Megatron-VLM
☆22Updated 3 months ago
Alternatives and similar repositories for Megatron-VLM
Users that are interested in Megatron-VLM are comparing it to the libraries listed below
Sorting:
- LLaVA-JP is a Japanese VLM trained by LLaVA method☆60Updated 10 months ago
- ☆22Updated last year
- ☆33Updated last month
- A lightweight framework for evaluating visual-language models.☆25Updated 2 weeks ago
- ☆16Updated 8 months ago
- 最新LLMの一覧を作成します☆17Updated last week
- Japanese LLaMa experiment☆53Updated 5 months ago
- Easily turn large English text datasets into Japanese text datasets using open LLMs.☆20Updated 3 months ago
- ☆11Updated last year
- CyberAgent AI Lab研修: "モデルコードの高速化・最適化チュートリアル"☆32Updated 2 months ago
- Swallowプロジェクト 大規模言語モデル 評価スクリプト☆17Updated last month
- 【2024年版】BERTによるテキスト分類☆29Updated 10 months ago
- Official implementation of "TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models"☆104Updated 3 months ago
- ☆60Updated 11 months ago
- Ongoing Research Project for continaual pre-training LLM(dense mode)☆40Updated 2 months ago
- ☆26Updated 6 months ago
- FlexGen with docker☆29Updated 2 years ago
- ☆16Updated 4 months ago
- Pytorch Tutorial for M1 students. This repository include Encoder Deocder model and Classification model building code.☆12Updated 2 years ago
- search papers of cvpr 2023 by chat gpt☆14Updated last year
- This project uses llama.cpp as an LLM server to perform inference and generate speech using Synthetic voice library☆22Updated last year
- JMultiWOZ: A Large-Scale Japanese Multi-Domain Task-Oriented Dialogue Dataset, LREC-COLING 2024☆23Updated last year
- ☆46Updated 8 months ago
- ☆14Updated 8 months ago
- A command-line tool that uses Gemini API to generate summaries of academic papers.☆43Updated 3 weeks ago
- ☆16Updated last year
- Ongoing research training Mixture of Expert models.☆19Updated 7 months ago
- 日本語CLIPモデル☆13Updated 2 years ago
- A collection of AI Agents papers (Updated biweekly)☆81Updated last week
- Flexible evaluation tool for language models☆43Updated this week