2023 ABCI Llama-2 継続学習プロジェクト
☆14Jan 22, 2024Updated 2 years ago
Alternatives and similar repositories for Megatron-Llama2
Users that are interested in Megatron-Llama2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆44Feb 2, 2024Updated 2 years ago
- Finetuning LLaMA with DeepSpeed☆10Apr 14, 2023Updated 2 years ago
- Code for our paper titled "Lens: Rethinking Multilingual Enhancement for Large Language Models"☆11Oct 15, 2024Updated last year
- Swallowプロジェクト 大規模言語モデル 評価スクリプト☆24Sep 17, 2025Updated 6 months ago
- Elevating Chess Strategy with Fine-Tuned Large Language Model☆17Dec 8, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- AML's goal is to make benchmarking of various AI architectures on Ampere CPUs a pleasurable experience :)☆23Feb 26, 2026Updated last month
- Listwise Reward Estimation for Offline Preference-based Reinforcement Learning (ICML 2024)☆17Jun 18, 2024Updated last year
- This is the code for the EMNLP2020 Finding paper "BERT for Monolingual and Cross-Lingual Reverse Dictionary"☆19Sep 27, 2020Updated 5 years ago
- NDL古典籍OCR学習用データセット(みんなで翻刻加工データ)☆20Mar 13, 2026Updated 2 weeks ago
- 音声を文字起こししてChatGPTと会話したい☆22Mar 8, 2023Updated 3 years ago
- Tensorflow: Generalizing Across Domains via Cross-Gradient Training☆15May 11, 2018Updated 7 years ago
- ☆57Jun 17, 2024Updated last year
- official code repo for paper "Merging Models on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging"☆24Oct 11, 2025Updated 5 months ago
- 日本語マルチタスク言語理解ベンチマーク Japanese Massive Multitask Language Understanding Benchmark☆38Oct 7, 2025Updated 5 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆12Feb 18, 2025Updated last year
- ☆15Apr 2, 2025Updated 11 months ago
- An algorithm that intelligently executes a crypto order over time via Coinbase☆13Oct 26, 2021Updated 4 years ago
- This repo contains a code that uses colabxterm and langchain community packages to install Ollama on Google Colab free tier T4 and pulls …☆13May 9, 2024Updated last year
- ☆11Oct 2, 2024Updated last year
- LEIA: Facilitating Cross-Lingual Knowledge Transfer in Language Models with Entity-based Data Augmentation☆23Apr 24, 2024Updated last year
- I introduce the basic idea and implementation of 5 imputation approaches. In short, filling with a single value works well for a shorter…☆11Jan 11, 2023Updated 3 years ago
- Practical Explainable AI Using Python by Pradeepta Mishra☆13May 18, 2022Updated 3 years ago
- In this implementation, using the Flan T5 large language model, we performed the Text Classification task on the IMDB dataset and obtaine…☆23May 12, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆20Jun 24, 2024Updated last year
- SegRef3D: AI-Powered Segmentation and Interactive Refinement for Labor-Saving 3D Reconstruction☆16Feb 9, 2026Updated last month
- ☆33Dec 9, 2022Updated 3 years ago
- I use various Data Science and machine learning techniques to analyze customer data using STP framework. I preprocessed the data, perform…☆12Apr 26, 2020Updated 5 years ago
- ☆12Nov 14, 2024Updated last year
- ☆46Updated this week
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆69Jul 20, 2023Updated 2 years ago
- Official PyTorch implementation of HealthyGAN - SASHIMI 2022☆10Feb 20, 2023Updated 3 years ago
- This is the implementation of the 4th place solution (yu4u's part) for CZII - CryoET Object Identification at Kaggle.☆16Mar 7, 2025Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Japanese LLaMa experiment☆54Dec 27, 2025Updated 3 months ago
- Code for the paper: "Fine-Tuning Discrete Diffusion Models with Policy Gradient Methods"☆31May 19, 2025Updated 10 months ago
- Causal Inference for Time Series Data (with CausalML Demo)☆14Jun 11, 2023Updated 2 years ago
- ☆12Dec 13, 2023Updated 2 years ago
- This is the public repository of AAAI 2024 paper "Is a Large Language Model a Good Annotator for Event Extraction"☆10Feb 16, 2024Updated 2 years ago
- scrape, clean and model IPO data with supervised ML☆10Aug 20, 2020Updated 5 years ago
- NDL-DocLデータセット(資料画像レイアウトデータセット)☆30Mar 2, 2023Updated 3 years ago