Skip to main navigation Skip to search Skip to main content

Cross-Lingual Sentence-Level Skill Identification in English and Danish Job Advertisements

Research output: Chapter in Book/Conference proceedingPublished conference proceedingScientificpeer-review

13 Downloads (Pure)

Abstract

The increasing influence of artificial intelligence (AI), the availability of textual data, and large language models (LLMs) over the past decade is evident in the growth of scholarly work on identifying skills from job advertisements. In this work, we examine the detection of sentences that express skills as well as the
explainability of model decisions with respect to their dependence on skill related tokens. We compare traditional machine learning (ML) approaches with a pretrained multilingual model and domain-adapted models for the task of English skill identification, and we assess the role of skill tokens in the classification process. We also investigate the ability of these models to generalize from English (EN) to Danish (DA) in both few-shot and zero-shot settings. Our findings indicate that both models achieve high performance in sentence classification achieving an F1-score of 94% for EN and overall accuracy between 93%–94% for both EN and DA. The results show that traditional ML methods can remain relevant under certain circumstances reinforcing the importance of realistic baselines in the context of skill identification.
Original languageEnglish
Title of host publicationProceedings of the 8th International Conference on Natural Language and Speech Processing (ICNLSP-2025)
Place of PublicationOdense, Denmark
PublisherAssociation for Computational Linguistics
Pages410-415
ISBN (Electronic)979-8-89176-297-8
Publication statusPublished - 2025
MoE publication typeA4 Article in a conference publication
Event
International Conference on Natural Language and Speech Processing
- Odense, Denmark
Duration: 25 Aug 202527 Aug 2025
Conference number: 8th
https://www.icnlsp.org/

Conference

Conference
International Conference on Natural Language and Speech Processing
Abbreviated titleICNLSP
Country/TerritoryDenmark
CityOdense
Period25/08/2527/08/25
Internet address

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 4 - Quality Education
    SDG 4 Quality Education
  2. SDG 8 - Decent Work and Economic Growth
    SDG 8 Decent Work and Economic Growth

Fingerprint

Dive into the research topics of 'Cross-Lingual Sentence-Level Skill Identification in English and Danish Job Advertisements'. Together they form a unique fingerprint.

Cite this