Logo PTI Logo rice

Proceedings of the 2024 Ninth International Conference on Research in Intelligent Computing in Engineering

Annals of Computer Science and Information Systems, Volume 42

Optimizing Resume Clustering in Recruitment: A Comprehensive Study on the Integration of Large Language Models (LLMs) with Advanced Clustering Algorithms

, , ,

DOI: http://dx.doi.org/10.15439/2024R29

Citation: Proceedings of the 2024 Ninth International Conference on Research in Intelligent Computing in Engineering, Vijender Kumar Solanki, Tran Duc Tan, Pradeep Kumar, Manuel Cardona (eds). ACSIS, Vol. 42, pages 1116 ()

Full text

Abstract. This study investigates the application of Large Language Models (LLMs) combined with clustering algorithms to automate and optimize the resume screening process in recruitment. The research evaluates the effectiveness of various LLMs such as BERT, RoBERTa, DistilBERT, and STSB RoBERTa in conjunction with clustering algorithms like K-means, DBSCAN, and hierarchical clustering. These combinations are assessed based on their ability to group similar resumes efficiently and accurately, considering factors such as content, context, and semantic relevance. Our research contributes to the field by rigorously analyzing the interplay between advanced NLP models and clustering techniques, identifying the optimal combinations for accurate and meaningful resume grouping. Additionally, we have developed a web application that integrates the most effective LLM-clustering combination, providing recruiters with an intuitive and interactive platform for analyzing clustered resumes. The results demonstrate that the integration of advanced NLP models with clustering techniques significantly improves the precision and relevance of resume clusters, leading to a more streamlined and efficient recruitment process. The final implementation shows promise in handling large datasets, enhancing the speed and accuracy of candidate evaluation and selection.

References

  1. Zhang X., Chen Y., Wang J., & Liu H. "Automatic Resume Processing for Recruiting System." International Journal of Advanced Research in Computer Science and Software Engineering, 2018. Li et al. “Feature Extraction and Clustering of Resume Data Using NLP Techniques, Journal of Information & Data Management”, (2020).
  2. Li Q., Zhao T., Huang L., & Feng R. "Feature Extraction and Clustering of Resume Data Using NLP Techniques." Journal of Information and Data Management, 2020. 
  3. Liu Y., Zhou P., Yang X., & Xu W. "Skill Gap Analysis Through Resume Clustering." IEEE Transactions on Knowledge and Data Engineering, 2019. 
  4. Devlin J., Chang M. W., Lee K., & Toutanova K. "Contextualized Embeddings for Improved Resume Clustering." Proceedings of the Annual Conference on Neural Information Processing Systems, 2019. 
  5. Brown A., Johnson C., Smith D., & Lee P. "Automating Recruitment with Machine Learning and Resume Clustering." ACM Transactions on Intelligent Systems and Technology, 2021. 
  6. Smith R., Taylor S., White M., & Carter J. "Enhancing Resume Clustering Using Ensemble Learning Techniques." Data Mining and Knowledge Discovery, 2020. 
  7. Chen W., Liu Z., Yang H., & Zhang Q. "Deep Learning Approaches for Resume Representation and Clustering." Journal of Computational Science, 2020. 
  8. Wang L., Zhang T., Chen Y., & Li F. "Temporal Analysis of Resume Data Using Time-Series Clustering." Information Processing & Management, 2021. 
  9. Garcia M., Torres A., Martinez S., & Lopez R. "Interactive Resume Clustering System for HR Decision Support." Expert Systems with Applications, 2020.