Practice

Practices

Air Pollution Regression Evaluation

You are given a dataset containing information about air pollution levels in different cities. Your task is to evaluate the performance of a regression model that predicts air pollution levels based on various factors such as population, industrial activity, and traffic. The dataset is as follows:

Dataset: Air Pollution Levels

CityPopulation (thousands)Industrial Activity IndexTraffic IndexActual Pollution LevelPredicted Pollution Level
New York81730.870.7845.244.8
Los Angeles39810.650.9155.656.2
Chicago27160.560.7238.939.4
Houston23200.920.8461.360.9
Phoenix16840.740.6848.748.3
Philadelphia15840.580.7541.542.1
San Antonio15430.880.6253.253.6
San Diego13990.670.7847.146.7
Dallas13410.790.7949.849.5
San Jose10300.630.6542.342.7
Lagos140830.750.8258.458.0
Cairo100030.680.7349.149.6
Johannesburg95750.710.6747.847.4
Nairobi43970.530.7535.636.1
Casablanca33500.620.7443.943.5
Accra22980.490.6833.433.9

Questions:

  1. What is the dependent variable (target) in this regression problem?

    • Population (thousands)
    • Industrial Activity Index
    • Traffic Index
    • Actual Pollution Level
    • Predicted Pollution Level
  2. What are the independent variables (features) in this regression problem? (Select all that apply)

    • Population (thousands)
    • Industrial Activity Index
    • Traffic Index
    • Actual Pollution Level
    • Predicted Pollution Level
  3. Calculate the Mean Absolute Error (MAE) to evaluate the model's performance.

  4. Calculate the Mean Squared Error (MSE) to evaluate the model's performance.

  5. Calculate the Root Mean Squared Error (RMSE) to evaluate the model's performance.

  6. Interpret the MAE value in the context of this regression problem.

Submission

You are required to submit documentation for practice exercises over the course of the term. Each one will count for 1/10 of your practice grade, or 2% of your overall grade.

  • Practice exercises will be graded for completion not perfect correctness.
  • You MUST supply the answers and upload your analysis as a single file to Practice - Evaluation on Gradescope after the exercise to get the grade for this exercise.

Your log will count for credit as long as:

  • It is accessible to your instructor, and
  • It shows your own work.