FHCT1014 September2021

You might also like

Download as pdf
Download as pdf
You are on page 1of 11
UNIVERSITI TUNKU ABDUL RAHMAN, ACADEMIC YEAR 2021/2022 MAY 2021 TRIMESTER FINAL ASSESSMENT FHCT1014 INTRODUCTION TO DATA ANALYTIC! FRIDAY , 24 SEPTEMBER 2021 TIM :9.00 AM — 12.00 PM (3 HOURS) FOUNDATION IN SCIENCE REMINDER: You are reminded to read and adhere to the Final Assessment Instructions to Candidates that has been made available through the UTAR Portal before the commencement of Final Assessment (FA). The detailed instructions for this FA are as follows: General 1. This Final Assessment (FA) is an Individual, time restricted assessment which consists of THREE (3) questions. Each question carries 50 marks 2. You are required to answer TWO (2) out of THREE (3) questions: Question 1: Compulsory. Question 2 and Question 3: Answer ONE (1) question only. In cases where a choice of questions is provided, only answers to the initially chosen questions will be marked. Any additional questions answered will not be considered. Only ONE (1) online submission is allowed. You must submit the ANSWER SCRIPT by 12.00 pm, 24 SEPTEMBER 2021 3. During the period of this FA, if you require any clarification from your lecturer(s), the contact details can be obtained from the FA guide/FA guidelines available on WBLE. 4, You may refer to any books, lecture notes, published materials, online resources etc. when answering the questions. Candidates are reminded that Copy-and-Paste, Consultation, Discussion and Sharing of Answers with others are STRICTLY PROHIBITED in this FA. Answer Seript File 5. Please refer to the FA guide/FA guidelines for the submission format of the answer script file. Note: Please keep the file size NOT exceeding 30MB per file. This final assessment paper consists of 3 questions on 11 printed pages. 2 FHCT1014 INTRODUCTION TO DATA ANALYTICS 6. Please check your Index Number generated by the Division of Examinations, Awards, and Scholarships (DEAS). You MUST name your answer script file using the following file name for submission: Index_Number_FHCT1014_INTRODUCTION_TO_DATA_ANALYTICS Answer Script File Submission 7. Your answer script file has to be submitted following the platform(s) as stated in the FA guide/FA guidelines before the due time/date. 8. Please make sure you submit the correct, complete and final version of your answer script to the platform(s) as stated in the FA guide/FA guidelines. Contents of Answer Script 9. The first page of your submission is the Final Assessment Cover Page. You MUST use the template given and fill in the following information: + Your Programme (Foundation in Science) + Your Index Number + Your Name + Your Student ID 10. The second page of your submission is the Final Assessment Declaration Statement. You MUST use the template given, and digitally sign on the form to indicate the authenticity of your submitted work is without plagiarism, ML rach question should be answered starting on a new page. 12. For answer scripts that have text-based answers only, all texts MUST be typed using Times ‘New Roman characters with font size 12, with proper spacing and alignment, except for drawings and equations/calculations. 13. For answer scripts that require/contain drawings, equations and calculations with short text descriptions, you can hand-write your answers and then use the scanner apps in your smartphone to take a scanned copy, or you can type if necessary and include the scanned copy taken in the Word document, as part of your submitted answers. 14, Please include a page number on each and every page of your answer script. Ensure that each page of answer scripts is in sequence prior to online submission. WARNING OF PLAGIARISM 15. In the case of suspected plagiarism, the evidence will be submitted to the Examination Disciplinary Committee of the University. Disciplinary action shall be taken against any candidate who is found to have plagiarized in the answer submitted. Hence, candidates are reminded to abide by all University Rules and Regulations and any instructions/guidelines relating to examinations/assessments. This final assessment paper consists of 3 questions on 11 printed pages. 3 FHCT1014 INTRODUCTION TO DATA ANALYTICS Question 1: Compulsory. [Total : 50 marks] Qi. @) ) © Based on the types of knowledge that you have leamed in this course, identify the type of knowledge involved in the following cases. @ (i) Gi) wy) W wi) Jayden leamt how to install a home security alarm by referring to the user manual. (1 mark) Matthew suggested to Mei Ling that she should put more milk to make her latte tastes nicer. (1 mark) Amelia excels in her programming course as she has been practising day and night to improve her programming skills. (1 mark) Aunty Sally is an experienced babysitter as she has been taking care of babies for years. (1 mark) Lianne leamt how to use some common Excel functions such as SUM, AVERAGE, COUNT, MAX and MIN from a reference book. (1 mark) Dunean is good in using Excel to perform data analysis. By just glancing through the data, he knows the appropriate functions to be used to analyse the data. (1 mark) Describe the FOUR (4) characteristics of big data for each of the following cases. a @ Figure 1.1 shows a portion of the order detail Waze is a GPS navigation app that works on mobile devices such as smartphones and tablet PCs. It provides navigation and route information to a destination and allows users to share any incident on route such as car erash, traffic condition, hazard, ete. (8 marks) Shopee is an e-commerce shopping platform that allows users to buy and sell products online. Activities such as inventory management, product marketing, payment transaction, and online chat can be carried out on the platform. (8 marks) stored in an Excel worksheet. aA 8 c D E LF 1 No Amount Order Date Order ID Customer ID Delivery Date 201 430.5. 15/7/2021 150701 KL1221 17/7/2021 3] 2 430.5 15/7/2021 150701 KL1221 37/7/2021 4 © 3-RM1,900.20 16/7/2021 160701 485028 19/7/2023] 5 4 RM 288.00 16/7/2021 160702 P2440 17/7/2023 65 230.99 17/7/2021 170701 _PK6909 22/7/2021) Figure 1.1 This final assessment paper consists of 3 questions on 11 printed pages. FHCT1014 INTRODUCTION TO DATA QL. (©) (Continued) @ @ Cla quantitative data. ify the FIVE (5) variables in Figure 1.1 as categorical data or (5 marks) (ii) Based on the data in Figure 1.1, elaborate the cleanse, structure, and enrich steps of the pre-processing stage in Excel. (6 marks) (iii) Identify the variable(s) with data that can be measured or calculated in Figure 1.1 and describe TWO (2) appropriate statistical measures or calculations that can be applied to analyse the data. (4 marks) (iv) Justify whether the data in Figure 1.1 could be stored in database ‘management system. (2 marks) (v) Figure 1.2 shows the same set of data (as Figure 1.1) stored in delimited text file, Differentiate between delimited text file and spreadsheet file by giving ONE (1) advantage and ONE (1) disadvantage for each file type. Bossa tees ox No,Order Date,Order ID,Customer ID, Amount,Delivery Date ~ 1,15/7/2021, 150701, KL1221,438.5,17/7/2021, 2,15/7/2021, 150701,KL1221,430.5,17/7/2621 3,16/7/2021, 160701, 385028, RN1900.20, 18/7/2021 14, 16/7/2021 , 160702, P32440, RN288.00, 18/7/2021 5,17/7/2021, 170701, PK699, 230.99, 19/7/2021 7 teveat 1% Widow US Figure 12 (4 marks) Identify the following as structured data or unstructured data. @ Gi) Gi) (iy) 1} wd (vii) Comments on student's performance given by academic advisor. (1 mark) Blog posts (1 mark) Email addresses (1 mark) User reviews and comments on products (1 mark) Payment transaction details (1 mark) Discussions on findings from a research. (1 mark) Learning materials for a course (mark) [Total : 50 marks] This final assessment paper consists of 3 questions on TI printed pages. 5 FHCT1014 INTRODUCTION TO DATA ANALYTICS Question 2 and Question 3: Answer ONE (1) question only. [Total : 50 marks] @ @ () A survey on cyberbullying was conducted among teenagers in Malaysia. Five hundred females and five hundred males participated in the survey. Table 2.1 shows the percentages of the female and male teenagers being cyberbullied on four common social media sites. Table 2.1 Facebook | Instagram ‘Twitter Snapchat |" Female 50% 26% 12% 1% Male 60% 35% 15% 2% (State the TWO (2) variables in Table 2.1 and identify the data type of the variables. (4 marks) (ii) Find the number of females and the number of males who were being cyberbullied on Facebook. Hence, calculate the ratio. (4 marks) ii) Suggest an appropriate type of chart to present the information shown in Table 2.1. Justify your answer. (2 marks) (iv) Construct the chart suggested in Q2. (a) (iii) with appropriate formatting on the chart. (4marks) (v) Derive TWO (2) insights on the cyberbullying issue from Table 2.1 (4 marks) Table 2.2 shows the number of new COVID-19 cases reported in Malaysia and the Malaysia’s stock market index (KLSE index) in certain 15 days. Table 2.2 ‘Number of New COVID-19 Cases | KLSE Index 6276 1532.63 6437 1548.31 5218 1544.71 5812 1559.68 5841 1555.71 5244 1564.76 47483 1574.02 4611 1572.24 6440. 1589.05, 5738. 1570.86 5150 1578.32 5419) 1581.37 4949) 1582.46 6849) 1575.16 S671 1579.90 This final assessment paper consists of 3 questions on 11 printed pages. 6 FHCT1014 INTRODUCTION TO DATA ANALYTICS Q2. (b) (Continued) © @ Gi) (ii) Gy) @) Suggest an appropriate type of chart to present the data for both variables in Table 2.2. Justify your answer. (2 marks) Find the correlation coefficient of both variables in Table 2.2. Describe how the correlation coefficient can be calculated using the Excel function. (2 marks) Interpret the correlation coefficient obtained in Q2. (b) (ii). (2 marks) Identify the Excel function involved in finding the interquartile range. Find the interquartile range of both variables in Table 2.2. (5 marks) Identify the outlier, if any, in the number of new COVID-19 cases from Table 2.2. Justify your answer. (2 marks) The frequency distribution of payment methods of 1000 transactions in a hypermarket is shown in Figure 2.1. @ (i) Payment Methods of 1000 Transactions in a Hypermarket 70 600 500 Frequency Be 6 8 cash Cowal Figure 2.1 Justify whether line chart is suitable to be used to present the information in Figure 2.1. (2 marks) Suggest the best chart type that shows proportionate distribution of the data in Figure 2.1. Justify your answer. (2 marks) This final assessment paper consists of 3 questions on IT printed pages. 7 FHCT1014 INTRODUCTION TO DATA ANALYTICS ntinued) @ Figure 2.2 shows a portion of data collected from a coffee house survey. PivotTable in Figure 2.3 shows the summary of certain part of the survey data. Fomibi fey asaya ate 3b: {hase Fomaotez8 Enpoyes asa eu Moretan Shas Seema po Figure 2.2 uw 1 a ilioceanve ao , 4 Figure 2.3 (@ Describe in SIX (6) steps how the PivotTable in Figure 2.3 is constructed in Excel. (6 marks) (ii) Describe in THREE (3) steps how the PivotTable in Figure 2.3 can be modified into Figure 2.4 in Excel. Sarre 2 = Fa Fa * =a Figure 2.4 (3 marks) (iii) Based on the data collected from the survey, derive THREE (3) useful insights that can be produced for the owner of the coffee house to understand the preferences of the customers. (6 marks) [Total : 50 marks} This final assessment paper consists of 3 questions on 11 printed pages. 8g FHCT1014 INTRODUCTION TO DATA ANALYTICS Q3. @)_ Figure 3.1 shows a portion of a dataset. Describe in SIX (6) steps how the PivotTable in Figure 3.2 is constructed in Excel. Rank Name Platform Year Genre Publisher 1 Wii Sports wi 2006 Sports Nintendo 2 Super Mario Bros. Nes 1985 Platform Nintendo 3 Mario kart Wil wit 2008 Racing Nintendo 4 Wii Sports Resort wai 2009 Sports Nintendo 5 Pokemon Red/Pokemon Blue ce 1996 Role-Playing Nintendo 6 Tetris ce 1989 Pure Nintendo 7 New Super Mario Bros. 2006 Platform Nintendo 8 Wii Play 2006 Mise Nintendo 9 New Super Mario Bros, Wi 2009 Platform Nintendo 10 Duck Hunt 1984 Shooter _ Nintendo 11 Nintendogs 2005 Simulation Nintendo 12 Marto Kart 0s 2005 Racing _Nintendo 13 Pokemon Gold/Pokemon Silver 1999 Role-Playing Nintendo 14 wi Fit 2007 Sports Nintendo 15 Wil Fit Plus 2009 Sports Nintendo 16 Kinect Adventures! 2010 Mise Microsoft Game Studios 17 Grand Theft Auto V 2013 Action Take-Two Interactive 18 Grand Theft Auto: San Andreas ps2 2004 Action Take-Two interactive 19 Super Mario World SNES 1990 Platform Nintendo 20 Brain Age: Train Your Brain in Minutes a Day _0S 2008 Mise Nintendo Figure 3.1 [Platform (al) | |Count of Name 11980-1989 1990-1999 2000-2009 2010-2020 Grand Total lAction 66 162 15851440, 3253) ladventure 2 97 634 5a3 127 Fighting 4 193, 442 197 83 Mise 8 116 1023 563 an Platform 33 2s 567 151 ark Puzzle 19 n 365 116 s7j| Racing 8 183 201 234 1226 Role-Playing 9 m 732 558 sari) |shooter 30 137 m0 395 1282) lsimutation 3 85 352 210 851/ Sports 23 3041407 570 2304) strateay 168 en lGrand Total 2051769 8208514516327] Figure 3.2 (6 marks) s final assessment paper consists of 3 questions on 11 printed pages. 9 FHCT1014 INTRODUCTION TO DATA ANALYTICS, An cust employee of a telecommunication company studied the waiting time at the omer service counters at one of their branch offices. The waiting time in minutes of counters A and B are shown in Table 3.1 Table 3.1 Counter A |1D43 |1D21 |3.3 [5.0 |16 [25 [28 |16 |23 | 12 Counter B | IN43 | IN21 | 1.7 [26 |21 |36 [31 |16 |20 [32 ©) Notes: a cy 1D43 and ID2I will be decided by the last four digits of your student ID. Assuming your student ID is 2009520; 1D43 and 1D21 will be 9.5 and 2.0. IN43 and IN2I will be decided by the last four digits of your Examination Index Number. Assuming your Examination Index Number is X01314HFS: IN43 and IN21 will be 1.3 and 1.4. Calculate the mean and standard deviation of the waiting time for counters A and B. (4 marks) Calculate the median and interquartile range of the waiting time for counters A and B. (4 marks) Construct a box plot for both counters A and B. Identify the outlier(s), if'any. (4 marks) (iv) Based on suitable statistical measure, identify the counter with the more consistent performance. (4 marks) Suggest the best type of chart for the data given in Table 3.2 and Table 3.3. Justify your answers. 0) Table 3.2 Daily Vaccine Doses Administered, source from JKJAV. Date Luly | 2duly | 3July | 4duly [ STuly | 6 fuly | 7iuh otal Doses Given _| 263012 | 236196 | 217807 | 206015 | 313761 | 340043 | 375842 G marks) (i) Table 3.3 Daily New and Recovery Cases for COVID-19, source from KKM Daie Luly [2July | 3July [ 4suly [3 Suly | 6suly | 7Jul New Cases 6983 |_6982 | 6658 | 6085 | 6387 | 7654 | 1097 Recovery Cases | 5580 | 6278 | s677 | s271_| 4532_| 4797 | 4863 G marks) This final assessment paper consists of 3 questions on 11 printed pages. 10 FHCT1014 INTRODUCTION TO DATA ANALYTICS, Q3. (Continued) (@ The test marks of students in a class are listed in Table 3.4 Table 3.4 Name Marks Name Marks Jin 76 ‘Abe [68 a 32 Tennie 82 66. Ethan’ 45 B Sith [si 82 Timin 85 6 Pandelela 88 90. Ayumi 7 75 Edward 98 12 Mikasa 70 77 Shinji 31 (@ By setting the number of bins to 5 or 6, construct a distribution table that consists of mark range, frequency, relative frequency, and percent frequency. (4 marks) (ii) Describe the steps used to calculate the frequency distribution in Q3. (@) (i) by using the FREQUENCY function in Excel. (4 marks) Gii) Construct a histogram based on the frequency distribution in Q3. (d) (i). (4 marks) (©) __ Identify the correct type of correlation for each situation given below. (i) As the level of water lowers in a fish tank, the volume of the habitat for the fish decreases. (1 mark) A student who has many absences has a decrease in grades. (1 mark) The faster a jet pilot flies, the higher the G-forces are. (1 mark) (iv) You are provided with 2 datasets: the amount of coffee consumed by each student, and CGPAs of students. As the amount of coffee consumed by a student increases, the CGPA of the student stays about the same. (mark) This final assessment paper consists of 3 questions on 11 printed pages, W FHCT1014 INTRODUCTION TO DATA ANALYTICS Q3. (Continued) (® Extract THREE (3) insights from the infographic in Figure 3.3 Number of examination candidates > How they fared Beginning 2016, new curticulum and formats are used for UPSR 301582 103,847 70,593 Figure 33 (6 marks) [Total : 50 marks] This final assessment paper consists of 3 questions on 11 printed pages.

You might also like