Professional Documents
Culture Documents
Yinxuan Dai CV
Yinxuan Dai CV
30 Thistlewaite Crescent, Scarborough, ON, M1S 3Y9 | Cell: 647-783-8561 | Email: yinxuandai@gmail.com
SKILL SET
• Skill set: SQL server, Python, R, SAS, Microsoft Office Excel, Data Modelling, Data visualization, Data reporting
• Data Science Packages: Pandas, Numpy, TensorFlow, Scikit-Learn, PySpark, Matplotlib, Seaborn, SciPy, Pytorch
• Platforms: Databricks, Apache Spark, Amazon Web Service, Anaconda, Tableau
EDUCATION
University of Toronto September 2021 – January 2023
Master of Engineering, emphasis in Analytics, CGPA:3.8/4.0
University of Toronto Scarborough September 2016 - August 2021
Honours Bachelor of Science, Major in Statistics and Economics for Management Studies
WORK EXPERIENCE
Business Intelligence and Quality Assurance January 2020 – January 2021
Canada Revenue Agency Toronto, ON
• Utilized SQL Aginity workbench and other ETL tools to process complex datasets with over 13 billion datapoints, translate
complex data into actionable insights and recommendations in a mainframe environment.
• Responsible for quality assurance testing of Departmental database-driven applications utilized by IT project team and
improved department communication issues by implementing a new project management solution.
• Automated reports for clients using filters, parameters, and calculated sets that automatically qualified their needs by Excel
worksheets and VBA, saving 6 hours of manual work each week.
• Worked in Cross-functional Analysis team that holding the weekly meeting and presentation to assist Income Tax team and
Pension Plan team to complete their tasks efficiently and change priorities reasonably.
• Collaborated with business teams on data quality issues and translate business requirements to technical architecture and
structure design of data repositories
• Built presentations, with excellent oral communication skills, via PowerPoint with both technical and non-technical
audiences and proposed innovative techniques for executing competitive analysis.
PROFESSIONAL CERTIFICATES
SAS Certified Advanced & Base Programmer for SAS 9 September 2019 – Current
SAS Global Certification Program
RELEVANT PROJECT
Process Data Analytics January-April 2022
• Using data mining techniques to visualize and analyze COVID-19 data from the European Centre for Disease Prevention
and Control, with over 30,000 data points, on Databricks.
• Implement time series analysis using the Rolling-Window Analysis of Time-Series Model in PySpark to predict client
behaviours by years.
• Established predictive models includes SEM, logistic regression and Bayesian models to predict infection rates and mortality
rates in following 60 days and made statistic inference with well-presented documentation and visualized data.
• Completed comprehensive written reports documenting all relevant details from the result and presented to the professor and
other graduate students during the lecture.