Assignment - Student Performance Analysis

student-performance.png

Student Performance Analysis

This assignment is all about data cleaning and visualization using Microsoft Excel. The dataset for this assignment are student data from a hypothetical school, which consists of 7 Columns and contains information about gender, race, scores of students in different subjects, and more.

Click to open the project

TODOs

  • Clone the assignment repository using the link above
  • Look through the data - student_performance_data.csv.
  • Read the questions below to have an idea of what is required to do with the data.
  • Put all your charts/graphs in a single file as this will be submitted as part of the assignment on gradescope.
  • Once you have the answers to the questions below, goto assignment on Gradescope
    • Look for Assignment - Intro to DS
    • Attempt the questions
    • Submit once you're done

Questions

  1. How many UNIQUE data points or samples are in the dataset?
  2. What are the percentages based on gender 2.1 What is the percentage of Male student 2.2 What is the percentage of Female student
  3. What percentage of student "completed" the test preparation course
  4. What percentage of student had a "standard" Lunch
  5. What percentage of parent has MORE THAN "high school" level of education
  6. Which group in Race/ethnicity has the lowest percentage.
  7. Distribution of scores in subject What distribution score range has the highest frequency for Math score What distribution range has the highest frequency for Reading score What distribution range has the highest frequency for Writing score
  8. Who score higher in Math? Male or female?
  9. Which race/ethnicity scored the HIGHEST in Math?
  10. Mention ONE insight you derive from the data