Hybrid Pipelines for Intelligent Human Resources Text Classification: LLMs, RAG, and Generative AI
- 1 Department of Mathematics, Faculty of Science and Technology, University Hassan II, Laboratory of Mathematics, Computer Science and Applications (LMCSA), Mohammedia, Morocco
Abstract
The digital transformation of Human Resources Information Systems (HRIS) requires advanced approaches to process unstructured textual data originating from Curriculum Vitae (CVs), cover letters, and job postings. Traditional text classification methods exhibit limitations when faced with current needs for contextual understanding and fine-grained skill detection. This paper proposes a hybrid pipeline combining advanced text classification, contrastive learning (SimCSE, Contriever), Retrieval-Augmented Generation (RAG), and generative AI (LLMs) to enhance candidate pre-screening, CV–job matching, and profile generation. Experimental results obtained on a corpus of 50,000 CVs show that the hybrid pipeline with RAG achieves an accuracy of 94.2% with a macro-F1 score reaching 92.3%, outperforming standard Transformer-based approaches and improving performance by +2.5% compared to the hybrid pipeline without RAG. When integrated into an HRIS, the proposed system accelerates recruitment processes while improving accuracy and efficiency, all while maintaining inference times compatible with operational deployment.
DOI: https://doi.org/10.3844/jcssp.2026.1387.1395
Copyright: © 2026 Soumia Chafi, Mustapha Kabil and Abdessamad Kamouss. This is an open access article distributed under the terms of the
Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
- 67 Views
- 23 Downloads
- 0 Citations
Download
Keywords
- LLaMA
- Mixtral
- LLM
- BERT
- NLP
- Generative Learning
- Contrastive Learning
- Deep Learning
- Contriever
- SimCSE
- RAG
- SIRH