Professional Documents
Culture Documents
Data Skills Project
Data Skills Project
Arshya Pooladi-Darvish
2022-07-08
Description: This a a study on psychological traits and academic honesty/dishonesty in university students
from Poland.
Variables:
participant: the person being observed for the study. categorical and nominal variable
sex: determines which sex the participant identifies with. categorical and nominal variable
age: determines age of participant. quantitative and ratio variable
field: determines which academic background each participant has. categorical and nominal
efficacy: a score from 0-40 measuring the degree to which an individual believes they can achieve their goals.
quantitative and interval variable
dishonesty: level of academic dishonesty, from 0-65, displayed by each participant was determined by the
Academic Dishonesty Scale determined from. quantitative and ratio variable
Three Research Questions:
1. How does a participant’s age and field affect their efficacy
2. What is the difference in mean dishonesty rating across men and women
3. Is there a significant difference in academic dishonesty across participants fields
Question 1: How does a participant’s age and field affect their efficacy?
library(ggplot2)
table(AHSdata$age)
##
## 19 20 21 22 23 24 25 26 27 28 56
## 6 22 66 84 71 66 41 21 9 3 1
#as portrayed above, the one individual with the age of 56 is clearly an outlier and
#I will exclude them for the sake of making interpreting the graph easier
ggplot(subset(AHSdata,AHSdata$age < 30), aes(x=age, y=efficacy, shape=field, color=field)) +
geom_point(size=4, shape=20) + geom_smooth(method=lm, se=FALSE, size=1) +
ggtitle("Comparing efficacy against age in participants from different fields")
1
## ‘geom_smooth()‘ using formula ’y ~ x’
field
30
H
efficacy
LA
MS
20 SS
ST
10
20 22 24 26 28
age
From the graph above I can conclude that on average individuals in Humanities and Medical Science saw
a decrease in their efficacy as they got older, while those individuals in undeclared fields, Law, Science and
Technology and Social Sciences saw an increase in their efficacy as they got older. ### Question 2: What
is the difference in mean dishonesty rating across men and women?
attach(AHSdata)
t.test(dishonesty ~ sex)
##
## Welch Two Sample t-test
##
## data: dishonesty by sex
## t = -1.151, df = 133.17, p-value = 0.2518
## alternative hypothesis: true difference in means between group female and group male is not equal to
## 95 percent confidence interval:
## -4.403985 1.163985
## sample estimates:
## mean in group female mean in group male
## 12.70 14.32
Since the p value is greater than the alpha value (p>0.05), this confidence interval for the difference of mean
illustrates (with 95% confidence) that there is no true difference in academic dishonesty between men and
women.
2
Question 3: Is there a significant difference in academic dishonesty across participants fields?
Since our p-value is very low (p<0.005), we reject the null hypothesis and accept the alternate hypothesis and
conclude that there is a significant difference in academic dishonesty across participant fields.