Skip to content

utkarshranaa/SkillScan.AI

Repository files navigation

πŸ“„SKILLSCAN AI: Resume Categorization with Machine Learning

An AI-driven resume classification tool leveraging NLP, TF-IDF, and Machine Learning to automatically categorize resumes into job roles with high accuracy.


πŸ“Œ Overview

SKILLSCAN AI is a cutting-edge resume categorization system that processes resumes and classifies them into multiple job roles using TF-IDF vectorization and multiple ML models:

  • Java Developer
  • Python Developer
  • Data Scientist
  • DevOps Engineer
  • Machine Learning Engineer
  • And many more...

By utilizing TF-IDF + Logistic Regression, Random Forest, KNN, and Naive Bayes, SKILLSCAN AI achieves high classification accuracy while maintaining a fast and scalable pipeline.


πŸ”₯ Key Features

βœ… Fast & Accurate: ML models trained for resume categorization with up to 99% accuracy.
βœ… TF-IDF Feature Extraction: Advanced text processing for better classification.
βœ… Multi-Model Comparison: Uses Logistic Regression, SVC, KNN, Naive Bayes, and Random Forest.
βœ… Explainability with Confusion Matrix & Radar Charts: Visual insights into model performance.


πŸ“Š Dataset

πŸ“Œ Dataset: Custom dataset of 20+ job categories.
πŸ“Œ Data Preprocessing: Includes text cleaning, stopword removal, and TF-IDF transformation.

πŸ“Š Classification Report

Class Precision Recall F1-Score Support
Java Developer 0.98 0.99 0.99 50
Python Developer 0.97 0.98 0.97 47
Data Scientist 0.99 1.00 0.99 60
DevOps Engineer 0.98 0.97 0.98 45

Overall Metrics

Metric Score
Accuracy 0.99
Macro Avg 0.98
Weighted Avg 0.99

πŸ“Œ Model Training & Evaluation

  • Feature Extraction: TF-IDF Vectorization.
  • Models Used: Logistic Regression, Random Forest, KNN, Naive Bayes, and SVC.
  • Loss Function: Categorical Crossentropy (for multi-class classification).
  • Optimizer: GridSearchCV to fine-tune hyperparameters.
  • Evaluation Metrics: Accuracy, F1-Score, Precision, Recall, and Confusion Matrix.

πŸ›  Technologies Used

  • Python
  • Scikit-Learn
  • NLTK
  • TF-IDF Vectorizer
  • Seaborn / Matplotlib (for visualization)
  • Jupyter Notebook

πŸ“¬ Contact

πŸ“§ Email: utkarshranaa06@gmail.com
πŸ”— GitHub: utkarshranaa
πŸ”— LinkedIn: www.linkedin.com/in/utkarshranaa
πŸ”— X/Twitter: @utkarshranaa

πŸš€ If you found this project useful, please ⭐ star the repository!

About

SKILLSCAN AI is an AI-powered resume classification tool that leverages NLP, TF-IDF, and machine learning models to categorize resumes into job roles with high accuracy.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors