☆17Aug 7, 2024Updated last year
Alternatives and similar repositories for self-learning-llm-public
Users that are interested in self-learning-llm-public are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for the NAACL 2024 HCI+NLP Workshop paper "LLMCheckup: Conversational Examination of Large Language Models via Interpretability Tool…☆13Mar 24, 2024Updated 2 years ago
- (WWW'25 + Netflix) The first CRS that retrieves collaborative filtering knowledge with two-step context-aware reflection.☆21Sep 10, 2025Updated 7 months ago
- DatasetResearch: Benchmarking Agent Systems for Demand-Driven Dataset Discovery☆20Sep 24, 2025Updated 6 months ago
- Exchange-of-Thought: Enhancing Large Language Model Capabilities through Cross-Model Communication☆21Mar 21, 2024Updated 2 years ago
- ☆10Jan 23, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Import or partially refresh your Google Sheets from Excel files☆17Mar 18, 2026Updated last month
- Effective Unsupervised Domain Adaptation of Neural Rankers by Diversifying Synthetic Query Generation☆15Apr 23, 2025Updated 11 months ago
- Teaching materials for BayesCog workshop, UKE Hamburg (Part 1).☆15Dec 4, 2023Updated 2 years ago
- Transformer experiments☆16May 8, 2023Updated 2 years ago
- ☆10Sep 29, 2024Updated last year
- MultiModN – Multimodal, Multi-Task, Interpretable Modular Networks (NeurIPS 2023)☆36Sep 26, 2023Updated 2 years ago
- Experimental sampler to make LLMs more creative☆31Aug 2, 2023Updated 2 years ago
- MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer (EMNLP 2025)☆12Apr 18, 2025Updated last year
- Tutorials for working with ADCIRC data and the CERA visualization software☆10Mar 12, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆49Feb 4, 2026Updated 2 months ago
- This repository provides the code for applying Contrastive Learning Penalty Loss (CLPL) and Mixture of Experts (MoE) to the BGE-M3 text e…☆11Dec 27, 2024Updated last year
- The official implementation of HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization☆19Mar 7, 2025Updated last year
- ☆11Jun 21, 2025Updated 9 months ago
- Code for "AtTGen: Attribute Tree Generation for Real-World Attribute Joint Extraction", ACL 2023☆13May 19, 2023Updated 2 years ago
- [ICLR 2025] Released code for paper "Spurious Forgetting in Continual Learning of Language Models"☆60May 9, 2025Updated 11 months ago
- JsonTuning: Towards Generalizable, Robust, and Controllable Instruction Tuning☆10Nov 3, 2024Updated last year
- GoldFinch and other hybrid transformer components☆46Jul 20, 2024Updated last year
- ☆58Jul 9, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆17Dec 23, 2025Updated 3 months ago
- [ICML 2025 Spotlight] RAPID: Long-Context Inference with Retrieval-Augmented Speculative Decoding☆22Mar 2, 2025Updated last year
- Towards LLM-RecSys Alignment with Textual ID Learning☆57Aug 15, 2024Updated last year
- A prototype agent with the purpose of evaluating the performance of a Large Language Model within a python terminal.☆13Aug 28, 2023Updated 2 years ago
- This repository contains our latest research focused on enhancing the accuracy of large language models (LLMs) in mathematical applicatio…☆29Sep 17, 2025Updated 7 months ago
- MPI Code Generation through Domain-Specific Language Models☆15Nov 19, 2024Updated last year
- Using Huggingface to generate relation expressions☆15Jan 15, 2021Updated 5 years ago
- incremental symbol learning for natural language understanding☆10Jun 12, 2023Updated 2 years ago
- Blueprint for training and deploying a machine learning model that effectively detects synthetic and modified audio content.☆14Apr 29, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- JPush's officially supported PhoneGap/Cordova plugin (Android & iOS). 极光推送官方支持的 PhoneGap/Cordova ionic2/3 Native插件(Android & iOS)。 http:/…☆10Jul 9, 2017Updated 8 years ago
- This is the code of a agentic rag method with dynamic workflow.☆12Jan 22, 2026Updated 2 months ago
- This project scrapes telugu newspaper articles.☆15Sep 9, 2018Updated 7 years ago
- Implementation of "Face detection in untrained deep neural networks" (Baek et al., Nature Communications, 2021)☆10Nov 2, 2021Updated 4 years ago
- [ACL 2026] R-Search: Empowering LLM Reasoning with Search via Multi-Reward Reinforcement Learning☆28Jan 4, 2026Updated 3 months ago
- ☆12Mar 1, 2025Updated last year
- The zhong [|] Chinese grammars☆15Mar 13, 2026Updated last month