JUHE API Marketplace
DATASET
Open Source Community

Hiring Decision Analysis Dataset

The dataset contains multiple variables related to recruitment decisions, such as age, gender, education level, work experience, number of previous employers, distance to the company, interview score, skill score, personality score, and recruitment strategy. The target variable is the recruitment decision, classified as hired or not hired.

Updated 7/24/2024
github

Description

Recruitment Decision Analysis Dataset

Data Description

The dataset includes the following columns:

Variable Description

  • Age:

    • Range: 20 to 50 years
    • Type: Integer
  • Gender:

    • Categories: Male (0) or Female (1)
    • Type: Binary
  • Education Level:

    • Categories: 1: Bachelor (type 1), 2: Bachelor (type 2), 3: Master, 4: Doctorate
    • Type: Categorical
  • Years of Work Experience:

    • Range: 0 to 15 years
    • Type: Integer
  • Number of Previously Worked Companies:

    • Range: 1 to 5 companies
    • Type: Integer
  • Distance to Company:

    • Range: 1 to 50 km
    • Type: Continuous Float
  • Interview Score:

    • Range: 0 to 100
    • Type: Integer
  • Skill Score:

    • Range: 0 to 100
    • Type: Integer
  • Personality Score:

    • Range: 0 to 100
    • Type: Integer
  • Recruitment Strategy:

    • Categories: 1: Aggressive, 2: Moderate, 3: Conservative
    • Type: Categorical
  • Recruitment Decision (Target Variable):

    • Categories: 0: Not hired, 1: Hired
    • Type: Binary Integer

Exploratory Data Analysis (EDA)

Correlation Analysis

Correlation analysis identifies relationships between features and the target variable (recruitment decision), revealing the most influential factors in the hiring process.

Age and Recruitment Decision

Analysis of age distribution and its impact on recruitment decisions uncovers age‑related trends and biases, using histograms and box plots for different age groups.

Years of Work Experience and Recruitment Decision

Exploration of how years of work experience affect the likelihood of being hired, visualized with scatter and line plots.

Predictive Modeling

Logistic Regression

A logistic regression model is employed to predict recruitment decisions. Logistic regression is suitable for this binary classification problem, where the target outcome is either hired or not hired.

Hyperparameter Tuning

GridSearchCV is used for hyperparameter optimization, tuning parameters such as regularization penalty (penalty: l1, l2) and regularization strength (C: [0.01, 0.1, 1, 10, 100]).

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Hiring Decision
Data Analysis

Source

Organization: github

Created: 7/16/2024

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.