the newest version of llama3,source code explained line by line using Chinese
☆22Apr 19, 2024Updated 2 years ago
Alternatives and similar repositories for llama3_explained
Users that are interested in llama3_explained are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Jan 9, 2024Updated 2 years ago
- Llama3开源模型中文版-全方位测评,基于SuperCLUE基准 | Llama3 Chinese Evaluation with SuperCLUE☆16Apr 21, 2024Updated 2 years ago
- Codebase for Math Neurosurgery: Isolating LLMs' Math Reasoning Abilities Using Only Forward Passes☆23Jun 15, 2025Updated 11 months ago
- ☆20Dec 14, 2024Updated last year
- (CVPR 2023) TokenHPE: Learning Orientation Tokens for Efficient Head Pose Estimation via Transformers☆12Oct 29, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Pre-training Multi-task Contrastive Learning Models for Scientific Literature Understanding (Findings of EMNLP'23)☆11Aug 24, 2024Updated last year
- Offical code repository for PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation, EMNLP 2023☆12Dec 13, 2023Updated 2 years ago
- Tasks and tutorials using Graphore's IPU with Hugging Face. Originally at https://github.com/gradient-ai/Graphcore-HuggingFace☆16Mar 12, 2024Updated 2 years ago
- 本项目采用Firefly模型训练框架,使用LLAMA-2模型对多项选择阅读理解任务(Multiple Choice MRC)进行微调,取得了显著的进步。☆11Sep 16, 2023Updated 2 years ago
- 🎓Automatically Update agent Papers Daily using Github Actions (Update Every 12th hours)每日更新agent相关论文(已附带中文摘要翻译)☆37May 16, 2026Updated last week
- implementation code for 'PLATE: A Prompt-Enhanced Paradigm for Multi-Scenario Recommendations' in SIGIR 2023☆13Sep 27, 2024Updated last year
- The code of CIKM 2023 (Oral Presentation) : A Multi-Task Semantic Decomposition Framework with Task-specific Pre-training for Few-Shot NE…☆14Jul 19, 2024Updated last year
- ☆13May 11, 2022Updated 4 years ago
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆22Jul 9, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Temporal and Causal Reasoning (dataset)☆10Apr 19, 2022Updated 4 years ago
- ☆27Jul 18, 2023Updated 2 years ago
- ☆98Mar 20, 2024Updated 2 years ago
- ☆13Apr 18, 2024Updated 2 years ago
- An LLM training library for instruction-tuning.☆26Mar 4, 2024Updated 2 years ago
- Code and data for paper "Large language models can rate news outlet credibility"☆13Aug 10, 2024Updated last year
- ☆17Jan 1, 2019Updated 7 years ago
- 基于大模型搭建的微信聊天机器人,同时支持微信、企业微信、公众号、飞书接入,可选择GPT3.5/GPT4.0/Claude/文心一言/讯飞星火/通义千问/Gemini/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。☆14Jan 15, 2024Updated 2 years ago
- [VLDB'2025] LEAP: LLM-powered End-to-end Automatic Library for Processing Social Science Queries on Unstructured Data☆20Nov 3, 2025Updated 6 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆12Dec 13, 2022Updated 3 years ago
- Generate the WizardCoder Instruct from the CodeAlpaca☆21Jun 27, 2023Updated 2 years ago
- Use transfer learnig(VGGFace) to detect facial landmark. Alse use MobileNet to compresse(mean absolute error is 0.964)☆14Mar 8, 2018Updated 8 years ago
- ☆13Oct 4, 2022Updated 3 years ago
- A dataset and CLIP baseline for unrepresentative news thumbnail detection (ACL 2022 workshop)☆12May 26, 2022Updated 4 years ago
- Implementation of QKVAE☆11Feb 24, 2023Updated 3 years ago
- ☆13Jun 18, 2024Updated last year
- A Chainer implementation of doc2vec☆10Nov 16, 2017Updated 8 years ago
- ☆22Dec 13, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This code accompanies the paper DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question Answering.☆16Mar 20, 2023Updated 3 years ago
- This is the source code of IJCNN 2023 paper TieFake: Title-Text Similarity and Emotion-Aware Fake News Detection (TieFake).☆16Dec 21, 2023Updated 2 years ago
- Young Labeled Faces in the Wild (YLFW): A Dataset for Children Faces Recognition☆14Sep 18, 2024Updated last year
- The code for domain-robust language identification with adversarial loss☆15May 29, 2018Updated 7 years ago
- Repo for the EMNLP2021 paper: Lifelong Event Detection with Knowledge Transfer☆14Sep 2, 2021Updated 4 years ago
- neuralpy - neural network library written in python☆12Jun 25, 2023Updated 2 years ago
- LLM-driven browser automation library built on Playwright with 67 CLI/SDK tools, stable snapshot refs, and stealth mode.基于 Playwright 的 L…☆70Updated this week