LoRA supervised fine-tuning, RLHF (PPO) and RAG with llama-3-8B on the TLDR summarization dataset
β14Feb 2, 2025Updated last year
Alternatives and similar repositories for llm-summarization
Users that are interested in llm-summarization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ποΈ1λ±(μ₯κ΄μ) μ루μ ] 2022 κ΅λ¦½κ΅μ΄μ μΈκ³΅ μ§λ₯ μΈμ΄ λ₯λ ₯ νκ° (μΌνλͺ° 리뷰 λ°μ΄ν° μμ± κΈ°λ° κ°μ± λΆμ : Aspect-Based Sentiment Analysis)β11Jun 6, 2023Updated 3 years ago
- Codebase for ICML'24 paper: Learning from Students: Applying t-Distributions to Explore Accurate and Efficient Formats for LLMsβ27Jun 25, 2024Updated last year
- Children's Programming and Artificial Intelligence Educationβ11Dec 30, 2019Updated 6 years ago
- Source code of our paper "Focus on the Targetβs Vocabulary: Masked Label Smoothing for Machine Translation" @ ACL 2022β13Apr 13, 2022Updated 4 years ago
- A list of Rasa resources curated by Rasa and the community.β11Apr 29, 2020Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Stacking Machine Learning Models. Tunning; feature engineering, scaling, models combinations and parameters.β11Oct 4, 2020Updated 5 years ago
- This repository provides the code for applying Contrastive Learning Penalty Loss (CLPL) and Mixture of Experts (MoE) to the BGE-M3 text eβ¦β11Dec 27, 2024Updated last year
- β12Feb 22, 2023Updated 3 years ago
- Toonification of real face images using PyTorch, Stylegan2 and Image-to-Image translationβ13Jun 14, 2022Updated 4 years ago
- Official repository for the paper "Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation"β216May 28, 2026Updated 2 weeks ago
- jQuery VS JS comparison table, Learn JS through jupyter notebook.β11Sep 27, 2019Updated 6 years ago
- β16Oct 6, 2024Updated last year
- [EMNLP 2025] The official implementation of "Zero-shot Multimodal Document Retrieval via Cross-Modal Question Generation"β15Aug 26, 2025Updated 9 months ago
- Bot that addresses typical questions about the COVID-19 virus to help you handle high volumes of questions from your customers, partners β¦β12Dec 5, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- a plugin for stackstormβ14Feb 13, 2019Updated 7 years ago
- Claude-router is a best project for using open model in claude-codeβ56Sep 4, 2025Updated 9 months ago
- Rasa X Jokebot Demoβ16Apr 8, 2024Updated 2 years ago
- Accompanies Finastra's Hack to the Future 4 Learning Session "Sustainability reports & NLP"β10Mar 17, 2022Updated 4 years ago
- [NAACL 2024] TabSQLify: Enhancing Reasoning Capabilities of LLMs Through Table Decompositionβ17Jan 5, 2026Updated 5 months ago
- β11Sep 16, 2024Updated last year
- RISCV C and Triton AI-Benchmarkβ25Jan 28, 2026Updated 4 months ago
- [ACM MM 2025] Phys4DGen: Physics-Compliant 4D Generation with Multi-Material Composition Perceptionβ13Apr 18, 2026Updated last month
- Fine-Tuning Llama3-8B LLM in a multi-GPU environment using DeepSpeedβ21May 27, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- β10Feb 12, 2024Updated 2 years ago
- Bazel defs and rules for building Python projects with nanobind extensions.β12Mar 12, 2026Updated 3 months ago
- hwpxlib ν¨ν€μ§ pythonμμ μ½κ² μ¬μ© ν μ μκ² λ§λ github repo μ λλ€.β34Mar 29, 2025Updated last year
- This repository is the official implementation of Topology-Informed Graph Transformer (Choi et al., GRaM Workshop at ICML 2024).β12Dec 28, 2024Updated last year
- This repo Implements "Dense Passage Retrieval for Open-Domain Question Answering" using Korean Datasetβ74Oct 21, 2022Updated 3 years ago
- LLM λͺ¨λΈμ μΈκ΅μ΄ ν ν° μμ±μ λ§λ μ½λ ꡬνβ87Aug 7, 2025Updated 10 months ago
- By fine tuning GPT2 on News Aggregator dataβ15Jan 24, 2021Updated 5 years ago
- Semantic and Instance Segmentation on iOS Using a Flask API β DeepLabV3+ and Mask R-CNNβ20Oct 3, 2020Updated 5 years ago
- β14Sep 16, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Render, select coordinates, export to video and more.β13Apr 28, 2024Updated 2 years ago
- Simply drag and drop your PDF files into Preve to get started. Ask Preve questions about your document. Get Summaries, key points, specifβ¦β11Apr 9, 2026Updated 2 months ago
- Easy and flexible way of encoding and decoding data into either strings or bytes.β11Dec 17, 2025Updated 5 months ago
- β10Sep 21, 2024Updated last year
- Text perturbation methods to evaluate the robustness of NLP modelsβ20Oct 6, 2021Updated 4 years ago
- memory-efficient fine-tuning; support 24G GPU memory fine-tuning 7Bβ21May 26, 2024Updated 2 years ago
- Quick start for Errbot on Windows with PowerShell Integrationβ17Jul 9, 2021Updated 4 years ago