–
Compulsory
2
This course starts with an introduction to the phenomenon of big data. The learning in this course follows the framework of the Data Science process, commencing with a foundational exploration of data collection methods. After this, a thorough examination of data preprocessing techniques is undertaken, encompassing data cleaning and other essential preparatory steps. Moreover, a comprehensive Exploratory Data Analysis (EDA) is conducted, utilizing both graphical and non-graphical approaches. A review of fundamental Statistical concepts is incorporated, laying the groundwork for effective data processing. The course then transitions to an exploration of machine learning methods, with a particular emphasis on supervised learning techniques. Practical applications of machine learning algorithms are demonstrated through the implementation of predictive modeling using regression, specifically Multiple Linear Regression (MLR). To assess the efficacy of the developed models, the model performance is evaluated. The students are required to use appropriate programming languages related to data preprocessing, visualization, and data processing and analysis, i.e. Excel, Tableau dan Python.
Introduction to Data Analytics and Visualization
Review of Statistics
Data Acquisition and Collection
Data Preprocessing
Exploratory Data Analysis (EDA)
Feature Engineering
Introduction to Machine Learning
Supervised Method: Regression
Model Performance Evaluation