Statistical Methods for Data Analytics - MTH622 Fall 2025
Course

Lessons
Here is the course outline:
1. Introduction to Empirical Research and Gretl SetupCourse overview, grading composition Types of Data (cross-sectional, time series, panel) Research questions and hypotheses Hands-on Gretl installation and navigation |
2. Descriptive Statistics and Graphical AnalysisMeasures of central tendency (mean, median, mode) Measures of dispersion (variance, standard deviation, range) Introduction to probability distributions (Normal, t, Chi square) Data visualization: histograms, boxplots, scatterplots Hands-on Gretl practice: computing descriptive statistics and generating plots |
3. Sampling and Confidence IntervalsTypes of sampling methods: simple random, stratified, cluster sampling Central Limit Theorem and its importance Construction and interpretation of confidence intervals for means and proportions Hands-on Gretl exercises: generating random samples and constructing confidence intervals. Assignment 1 released (20%) |
4. Hypothesis TestingFormulating null and alternative hypotheses One-tailed and two-tailed tests Type I and Type II errors Conducting t-tests and chi-square tests Gretl exercises: performing t-tests and chi-square tests |
5. Simple Linear Regression IIntroduction to the simple linear regression model Ordinary Least Squares (OLS) estimation Interpretation of slope and intercept coefficients Goodness of fit: R² and Adjusted R² Gretl exercises: running simple regression models Assignment 1 Due (20%) |
6. Simple Linear Regression IIHypothesis testing for regressioncoefficients Confidence intervals for coefficients Residual analysis and model diagnostics Gretl exercises: interpreting regression outputs and residuals |
7. Midterm exam (25%, in-class closed book)The midterm exam will cover all materials from Weeks 1–6 |
8. Multiple RegressionExtending regression to multiple independent variables Interpretation of multiple regression coefficients Detecting and addressing omitted variable bias Gretl exercises: running and interpreting multiple regressions |
9. Midterm break |
10. Functional Forms and SpecificationChoosing the correct functional form (linear, log-linear, etc.) Creating and interpreting interaction terms Consequences of model misspecification Gretl exercises: testing alternative functional forms Assignment 2 Released (20%) |
11. Violations of Classical AssumptionsMulticollinearity: detection (VIF) and remedies Heteroskedasticity: testing (Breusch-Pagan) and correction Introduction to serial correlation Gretl exercises: multicollinearity and heteroskedasticity tests |
12. Dummy Variables and Qualitative AnalysisIncorporating qualitative (categorical) variables Creating and interpreting dummy variables Interaction effects involving dummy variables Gretl exercises: building models with dummy variables Assignment 2 Due (20%) |
13. Introduction to Time Series DataUnderstanding time series characteristics: trend, seasonality, stationarity Introduction to autocorrelation Basics of ARIMA modeling (conceptual overview) Gretl exercises: plotting and analyzing time series data |
14. Content Completion, Revisions, and Q&AReserved week to finish any incomplete topics Comprehensive course revision session Addressing student questions and clarifying key concepts Interactive Q&A and revision exercises Mock test questions practice |
15. Final exam (35%, in-class closed book)The final exam will cover all materials from Weeks 1–14 |