davidsvaughn / prompt-loss-weightView external linksLinks
code for Towards Data Science article on prompt-loss-weight
☆11Jun 4, 2025Updated 8 months ago
Alternatives and similar repositories for prompt-loss-weight
Users that are interested in prompt-loss-weight are comparing it to the libraries listed below
Sorting:
- TASU: A New Style of Alignment of Speech LLM with only Text Training Data, zero-shot on ASR and Other SU tasks☆21Jan 19, 2026Updated 3 weeks ago
- Terrier's desktop search demo product☆13Aug 2, 2018Updated 7 years ago
- Project Gold ✨☆11Jan 29, 2026Updated 2 weeks ago
- Concise Reasoning via Reinforcement Learning☆13Apr 16, 2025Updated 10 months ago
- SimEc code relying on the theano library - check out the simec repo instead for keras based code!☆10Feb 28, 2018Updated 7 years ago
- ☆12May 30, 2025Updated 8 months ago
- Lossless normalization of uppercase characters☆11Jul 3, 2023Updated 2 years ago
- ☆13Jun 18, 2024Updated last year
- Testing sets for semanticVAD☆20Feb 18, 2025Updated 11 months ago
- Unofficial repo for SubTab with additional code and data for Adult Income and BlogFeedback datasets. BlogFeedback data is attached as zip…☆10Jun 24, 2022Updated 3 years ago
- eSNN - Learning similarity measure from data☆12Nov 28, 2019Updated 6 years ago
- Tool for tweaking dbpedia spotlight's models☆16Dec 1, 2017Updated 8 years ago
- ☆12Jan 11, 2018Updated 8 years ago
- The resources for the paper "User Modeling with Click Preference and Reading Satisfaction for News Recommendation"☆11Jan 17, 2021Updated 5 years ago
- A tool for exploring HID devices on OS X☆10Feb 3, 2015Updated 11 years ago
- ☆11May 8, 2020Updated 5 years ago
- Mirror of https://gerrit.wikimedia.org/g/wikimedia/textcat See https://www.mediawiki.org/wiki/Developer_access for contributing☆11Jan 27, 2026Updated 2 weeks ago
- ☆16May 8, 2020Updated 5 years ago
- ChatGPT solutions for the MLE interview☆14Dec 9, 2022Updated 3 years ago
- Scikit-learn vectorizer implementing "A simple but tough-to-beat baseline for sentence embeddings." by Arora, Sanjeev, Yingyu Liang, and …☆12Apr 1, 2018Updated 7 years ago
- Implementation of Siamese CBOW using keras whose backend is tensorflow.☆12Feb 2, 2023Updated 3 years ago
- Các thí nghiệm liên quan tới LLMs cho tiếng Việt (insprised by Physics of LLMs Series)☆11Oct 21, 2024Updated last year
- ☆10Mar 31, 2022Updated 3 years ago
- [AAAI 24] GradTree: Gradient-Based Axis-Aligned Decision Trees☆15Aug 28, 2024Updated last year
- FMS Model Optimizer is a framework for developing reduced precision neural network models.☆20Updated this week
- Exploratory search engine based on hierarchical topic models from BigARTM☆13Mar 8, 2022Updated 3 years ago
- ☆11Jul 23, 2023Updated 2 years ago
- Entity linker for the newspaper collection of the National Library of the Netherlands. Links named entity mentions to DBpedia description…☆11Dec 8, 2022Updated 3 years ago
- Infinite relational model (IRM) for datamicroscopes☆14Oct 26, 2015Updated 10 years ago
- Tools for creating DBpedia Spotlight Lucene Index☆10Oct 5, 2022Updated 3 years ago
- Python code for "Bayesian hybrid matrix factorisation for data integration", published at 20th International Conference on Artificial Int…☆13Jun 10, 2018Updated 7 years ago
- Neural Response Ranker for Alana, Heriot-Watt University's Alexa Prize Socialbot☆13Nov 21, 2022Updated 3 years ago
- A TensorFlow implementation of dependency-based word embeddings (dependency-based word2vec)☆12Jan 26, 2016Updated 10 years ago
- Temporal Collaborative Topic Regression for recommendation. Extends Collaborative Topic Modelling (Wang and Blei) to consider the tempora…☆11Aug 4, 2018Updated 7 years ago
- Code accompanying to "COVID-19 transmission in supermarkets using agent-based modelling"☆11May 3, 2021Updated 4 years ago
- Embeddings for all geonames populated locations with population greater than 0☆13May 15, 2017Updated 8 years ago
- Llama-Mimi is a speech language model that uses a unified tokenizer (Mimi) and a single Transformer decoder (Llama) to jointly model sequ…☆28Sep 20, 2025Updated 4 months ago
- 🌳 MCTS-inspired parallel beam search for conversation optimization. Explore multiple dialogue strategies simultaneously, stress-test a…☆35Jan 18, 2026Updated 3 weeks ago
- ☆13Dec 31, 2023Updated 2 years ago