Personality Prediction using Machine Learning Classifiers | Assignments Artificial Intelligence

Journal of Applied Technology and Innovation (e -ISSN: 2600-7304) vol. 5, no. 1, (2021) 1

Personality Prediction using Machine Learning

Classifiers

Xin Yee Chin

School of Computing

Asia Pacific University of Technology

and Innovation (APU)

Kuala Lumpur, Malaysia

[email protected]

Han Yang Lau

School of Computing

Asia Pacific University of Technology

and Innovation (APU)

Kuala Lumpur, Malaysia

[email protected]

Zhi Xin Chong

School of Computing

Asia Pacific University of Technology

and Innovation (APU)

Kuala Lumpur, Malaysia

[email protected]

Man Pan Chow

School of Computing

Asia Pacific University of Technology

and Innovation (APU)

Kuala Lumpur, Malaysia

[email protected]

Zailan Arabee Abdul Salam

School of Computing

Asia Pacific University of Technology

and Innovation (APU)

Kuala Lumpur, Malaysia

[email protected]

Abstract— Personality is a fundamental basis of human

behaviour. At most basic, personality including patterns of

thought, feeling, behaviours that make an individual unique.

Personality will directly or indirectly influence the interaction

or preferences of a person. This research using different

learning algorithms and concepts of data mining to mine on the

data features and learn from the pattern. The aim of this

experiment is to explore different options of the algorithm on

modifying the personality prediction source code by using

logistic regression algorithm, and to find whether the accuracy

of the classification can be improved. There are five

characteristics of different people that are known as the Big

Five characteristic, which is openness, neuroticism,

conscientiousness, agreeableness and extraversion that have

been stored in the dataset used for training. Then, an overview

and comparison will be provided on the different measures

taken to reduce the issues faced by researchers in this field.

Classification methods implemented are Support Vector

Machine, Ridge Algorithm, Naive Bayes, Logistic Regression

and Voting Classifier. Testing results showed that the Logistic

Regression still outperformed the other methods.

Key words- machine learning; personality prediction; Big Five

Personality; regression

I. INTRODUCTION

Personality is all about the different characteristic of an

individual’s pattern of feeling, behaving and thinking.

Personality embraces the mood, opinion and attitude of

someone, it is the best expression way to express clearly and

in an understandable form when interacting with someone.

Personality provides the ability to distinguish one person

from another that can be observed in the workplace

environment and so on. Although there are many more ways

to explain what personality exactly is, from the psychological

point of perspective, there are two main explanations. First

pertains to the consistency of differences between humans.

In this way, the study of personality can focus on

classifying and identifying human’s psychological patterns.

Second belongs to the emphasis of quality which is mostly

likely to make people alike and that will help to distinguish

psychological man from the other species. Personality

theorists are then directed to research about those regulations

among people that can usefully define the nature of man and

other factors that influence the course of live. Understanding

personality is important and useful. Personality provides

people the idea of how leading, influencing communication

can take place in certain conditions. For example, personality

traits such as agreeableness and extraversion are mostly

going to improve the chance of communication. Whereas

personality traits such as high self-esteem are most likely

going to remain silent at the workplace.

Therefore, personality shows that it can be useful in many

situations. In this experiment, machine learning is being

apply to judge and classify personality. Based on the

previous source code, logistic regression was used to classify

the big five personalities. Big five personality traits include

openness, neuroticism, conscientiousness, agreeableness and

extraversion. Classifying these personality traits is useful in

many ways, one of the reasons to classify personality is to

check the suitability of an employee. Employee’s personality

is often tested in real time to determine which position of the

job he or she is particularly fitting in well.

In this research, different algorithms are added to further

explore the dataset to test if higher accuracy can be found

and created. Classification methods will be added to the

original code is Support Vector Machine, Ridge Algorithm,

Naïve Bayes, Logistic Regression and Voting Classifier.

Logistic Regression is being the default algorithms to the

source code.

Critical analysis was performed on similar

projects/papers that used different methods. [2] used

Multiclass Support Vector Machine (SVM) to perform

personality classification based on handwriting. The

personalities are Optimistic, Extrovert, Introvert, Sloppy,

Energetic. Multiclass classification. Histogram of oriented

gradient performs feature extraction on handwriting data, and

noise removal performed using adaptive thresholding

Personality Prediction using Machine Learning Classifiers, Assignments of Artificial Intelligence

Related documents

Partial preview of the text

Download Personality Prediction using Machine Learning Classifiers and more Assignments Artificial Intelligence in PDF only on Docsity!

Personality Prediction using Machine Learning

Classifiers

TABLE I. SPECIFICATION OF ACER SWIFT 3

TABLE II. DATASET DESCRIPTION

X 2 )

Za = ∅ ( X ¿¿ a )=(1, a 1 , a 2 , a 1

, a 2

,a 1 ∗ a 2 ) ¿

Zb = ∅ ( X ¿¿ b )=(1, b 1 , b 2 , b 1

,b 2

, b 1 ∗ b 2 )¿

Za

Zb = k ( X a , Xb )=( 1 + Xa

X b )

Za

Zb = 1 + a 1 b 1 + a 2 b 2 + a 1

a 1

+ a 2

a 2

+ a 1 a 1 b 1 b 1

y = w 0 + w 1 x 1 + w 2 x 2 + w 3 x 3 ….

wi xi

¿ w 0 + w

X

¿ b + w

X (11)

estimator, β =( X ' X )−^1 X ' Y , forming:

βridge =( X ' X + λ I p )

X ' Y

xij β j )

β j

Xij β j ¿)

β j

< c. (15)

P ( t ∨ d )= P ( C ) × P ( t 1 ∨ ×c ) × P ( t ¿¿ 2 ∨ ×c ) × P ( t ¿¿ 3 ∨ × c )... × P ( t ¿¿ n ∨ ×c )¿ ¿ ¿

P ( C )=

N c

N

P ( tn ∨ C )=

count ( tn , c )+ 1

count ( c )+¿ V ∨¿ ¿

tfid f t = f t ,d × log

N

df t

( tn ∨ C )=

W ct + 1

W

ct +^ B

y = mode { C 1 ( x ) , C 2 ( x ) , ... , Cm ( x ) }

y = arg max

i

w j pij (22)