Evaluation of applicant quality for a recruitment company using machine learning

Connolly, Adrian

Evaluation of applicant quality for a recruitment company using machine learning

Mark

Connolly, Adrian (2019) MASK01 20191
Mathematical Statistics

Abstract: It is important to the recruitment company that only quality candidates, and the candidates most likely to receive an offer of employment get short-listed to go through the carriers selection process. It is this second phase of the recruitment process, the short-listing of potential candidates, that will be the focus of this thesis.

Data has being gathered by the recruitment company through an initial candidate screening process as well as responses from carriers regarding the selection outcome for each candidate.
The topic of this thesis is to apply and evaluate machine learning algorithms to this data in order to predict which candidates are of the highest caliber and thereby most likely to receive an offer of employment from the... (More); It is important to the recruitment company that only quality candidates, and the candidates most likely to receive an offer of employment get short-listed to go through the carriers selection process. It is this second phase of the recruitment process, the short-listing of potential candidates, that will be the focus of this thesis.

Data has being gathered by the recruitment company through an initial candidate screening process as well as responses from carriers regarding the selection outcome for each candidate.
The topic of this thesis is to apply and evaluate machine learning algorithms to this data in order to predict which candidates are of the highest caliber and thereby most likely to receive an offer of employment from the carrier.

The original screening data consisted of over 16 000 observations, However after removing observations with missing values and matching with the available response data the final data set consisted of only 1101 observations.

One conjecture is that candidates with less experience or opportunities are more motivated to answer all screening questions thereby providing a more complete profile. Candidates with more experience, contacts and opportunities are less motivated to answer screening questions and thereby leave incomplete profiles, and consequently not making it into the final data set. The failure of the machine learning algorithms evaluated in this thesis to successfully classify candidates is not so much a failure of the algorithms to find a pattern in the data rather a reflection of the fact that there is a fundamental absence of a pattern
in this particular data set. This does not mean that there are no patterns that could eventually be exploited to create accurate machine learning models, rather that improvements in the candidate screening and data gathering process are needed first. (Less)

Please use this url to cite or link to this publication: http://lup.lub.lu.se/student-papers/record/8986590

author

Connolly, Adrian

supervisor

Magnus Wiktorsson ^LU

organization

Mathematical Statistics

course

MASK01 20191

year

2019

type

M2 - Bachelor Degree

subject

Mathematics and Statistics

language

English

id

8986590

date added to LUP

2019-06-20 12:26:21

date last changed

2019-06-20 17:12:33

@misc{8986590,
  abstract     = {{It is important to the recruitment company that only quality candidates, and the candidates most likely to receive an offer of employment get short-listed to go through the carriers selection process. It is this second phase of the recruitment process, the short-listing of potential candidates, that will be the focus of this thesis.


Data has being gathered by the recruitment company through an initial candidate screening process as well as responses from carriers regarding the selection outcome for each candidate.
The topic of this thesis is to apply and evaluate machine learning algorithms to this data in order to predict which candidates are of the highest caliber and thereby most likely to receive an offer of employment from the carrier.


The original screening data consisted of over 16 000 observations, However after removing observations with missing values and matching with the available response data the final data set consisted of only 1101 observations.


One conjecture is that candidates with less experience or opportunities are more motivated to answer all screening questions thereby providing a more complete profile. Candidates with more experience, contacts and opportunities are less motivated to answer screening questions and thereby leave incomplete profiles, and consequently not making it into the final data set. The failure of the machine learning algorithms evaluated in this thesis to successfully classify candidates is not so much a failure of the algorithms to find a pattern in the data rather a reflection of the fact that there is a fundamental absence of a pattern
in this particular data set. This does not mean that there are no patterns that could eventually be exploited to create accurate machine learning models, rather that improvements in the candidate screening and data gathering process are needed first.}},
  author       = {{Connolly, Adrian}},
  language     = {{eng}},
  note         = {{Student Paper}},
  title        = {{Evaluation of applicant quality for a recruitment company using machine learning}},
  year         = {{2019}},
}

LUP Student Papers

LUND UNIVERSITY LIBRARIES

Evaluation of applicant quality for a recruitment company using machine learning