Text Analysis of Applicants for Personality Classification Using Multinomial Naïve Bayes and Decision Tree
Main Article Content
Employees' qualities affect companies' performances and with a large number of applicants, it's difficult to find suitable applicants. To help with it, companies carry out psychological tests to know applicants' personalities, since personality's considered to have a relationship with work performances. But psychological testing requires a lot of effort, cost, and human resources. Thus with a system that can classify personalities through text can help reduce the effort needed. Similar studies carried out with the big five personalities as the theoretical basis and used one of the personality traits, namely using the k-NN method with 65% accuracy. Based on these studies, accuracy can improve by finding the best parameters using all of the big five personalities. This research is conducted based on the big five personality traits and related traits, namely consciousness and agreeableness. The data used is text data that's been labelled, pre-processed and feature selected. The clean text data is used to create a classification model using multinomial Naive Bayes and decision trees. There are 6 models built based on 3 work cultures, decision tree with an accuracy of 33%, 66%, 80%, and multinomial naïve Bayes with an accuracy of 83%, 50%, 60%, which resulted as better performance.
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work
 S. Sawleshwarkar, N. Rangnani, V. Mariwalla, and A. Halbe, “Simplified Recruitment Model Using Text-Mining on Psychometric and Aptitude Tests,” Proc. 2nd Int. Conf. Electron. Commun. Aerosp. Technol. ICECA 2018, no. Iceca, pp. 586–589, 2018, doi: 10.1109/ICECA.2018.8474769.
 N. R. Ngatirin, Z. Zainol, and T. L. C. Yoong, “A comparative study of different classifiers for automatic personality prediction,” Proc. - 6th IEEE Int. Conf. Control Syst. Comput. Eng. ICCSCE 2016, no. November, pp. 435–440, 2017, doi: 10.1109/ICCSCE.2016.7893613.
 M. Fallahnezhad, M. Vali, and M. Khalili, “Automatic Personality Recognition from reading text speech,” 2017 25th Iran. Conf. Electr. Eng. ICEE 2017, no. C, pp. 18–23, 2017, doi: 10.1109/IranianCEE.2017.7985447.
 D. J. Ozer and V. Benet-Martínez, “Personality and the Prediction of Consequential Outcomes,” Annu. Rev. Psychol., 2006, doi: 10.1146/annurev.psych.57.102904.190127.
 R. Hogan, G. J. Curphy, and J. Hogan, “What we know about leadership: Effectiveness and personality.,” Am. Psychol., 1994, doi: 10.1037/0003-066x.49.6.493.
 A. Furnham, C. J. Jackson, and T. Miller, “Personality, learning style and work performance,” Pers. Individ. Dif., 1999, doi: 10.1016/S0191-8869(99)00053-7.
 M. Komarraju and S. J. Karau, “The relationship between the big five personality traits and academic motivation,” Pers. Individ. Dif., 2005, doi: 10.1016/j.paid.2005.02.013.
 I. Conference, I. Technology, R. Bhannarai, and C. Doungsa, “Person Identification Through Personality Test and k NN Classification,” pp. 215–219, 2016.
 H. Chaudhari, N. Yadav, and Y. Shukla, “A predictive analysis on job recruitment,” pp. 6 (5 .)-6 (5 .), 2018, doi: 10.1049/cp.2016.1474.
 B. Y. Pratama and R. Sarno, “Personality classification based on Twitter text using Naive Bayes, KNN and SVM,” Proc. 2015 Int. Conf. Data Softw. Eng. ICODSE 2015, pp. 170–174, 2016, doi: 10.1109/ICODSE.2015.7436992.
 R. M. Kaplan and D. P. Saccuzzo, Psychological Testing seventh edition. 2012.
 S. John, O. P., & Srivastava, “Big Five Inventory (Bfi),” Handb. Personal. Theory Res., 1999, doi: 10.1525/fq.1998.51.4.04a00260.
 S. Dang, “A Review of Text Mining Techniques Associated with Various Application Areas,” Int. J. Sci. Res., vol. 4, no. February, pp. 2461–2466, 2015.
 S. Sathya and N. Rajendran, “A Review on Text Mining Techniques,” Int. J. Comput. Sci. Trends Technol., vol. 3, no. 5, pp. 274–284, 2013, [Online]. Available: www.ijcstjournal.org.
 D. L. Olson and D. Delen, Advanced data mining techniques. 2008.
 P. P. Surya, L. V Seetha, and B. Subbulakshmi, “Analysis of user emotions and opinion using Multinomial Naive Bayes Classifier,” 2019 3rd Int. Conf. Electron. Commun. Aerosp. Technol., pp. 410–415, 2019, doi: 10.1109/iceca.2019.8822096.
 H. Sharma and S. Kumar, “A Survey on Decision Tree Algorithms of Classification in Data Mining,” Int. J. Sci. Res., vol. 5, no. 4, pp. 2094–2097, 2016, doi: 10.21275/v5i4.nov162954.