BioLORD: Learning Ontological Representations from Definitions (for Biomedical Concepts and their Textual Descriptions) Paper • 2210.11892 • Published Oct 21, 2022 • 2
Extreme Multi-Label Skill Extraction Training using Large Language Models Paper • 2307.10778 • Published Jul 20, 2023
EduQG: A Multi-format Multiple Choice Dataset for the Educational Domain Paper • 2210.06104 • Published Oct 12, 2022
Tik-to-Tok: Translating Language Models One Token at a Time: An Embedding Initialization Strategy for Efficient Language Adaptation Paper • 2310.03477 • Published Oct 5, 2023 • 1
BioLORD-2023: Semantic Textual Representations Fusing LLM and Clinical Knowledge Graph Insights Paper • 2311.16075 • Published Nov 27, 2023 • 6
In-Context Learning for Extreme Multi-Label Classification Paper • 2401.12178 • Published Jan 22, 2024 • 2
Career Path Prediction using Resume Representation Learning and Skill-based Matching Paper • 2310.15636 • Published Oct 24, 2023
Design of Negative Sampling Strategies for Distantly Supervised Skill Extraction Paper • 2209.05987 • Published Sep 13, 2022
DWIE: an entity-centric dataset for multi-task document-level information extraction Paper • 2009.12626 • Published Sep 26, 2020