SOCR ≫ DSPA ≫ Topics ≫

1 Common plots

Use the Divorce data (Case Study 01) to generate appropriate visualization of histograms, density plots, pie charts, heatmaps, barplots, and paired correlation plots.

2 Trees and Graphs

Use the SOCR Resource Hierarchical data (JSON) to generate some tree/graph displays of the structural information.

3 Data EDA examples

  • Use SOCR Oil Gas Data to generate plots: 1) read data table, you may need fill the inconsistent table with NA; 2) data preprocessing: select variables, type convert, etc. 3) generate two plots: the first plots includes two subplots, consumption plots and production plots; the second figure includes three subplots, for fossil, nuclear and renewable, respectively. To draw the subplots, you should practice facet_grid(); 4) all figures have year as x axis; 5) the first figure includes three curves(fossil, nuclear and renewable) for each subplot; the second figure includes two curves(consumption and production) for each subplot.
  • Use SOCR Ozone Data to generate a correlation plot with the variables MTH_1, MTH_2, …, MTH_12. (Hint: you need obtain the correlation matrix first, then apply the corrplot package. Try to some alternative methods as well, circle, pie, mixed etc.)
  • Use SOCR CA Ozone Data to generate a 3D surface plot (Using variables Longitude, Latitude and O3).
  • Generate a sequence of random numbers from student t distribution. Draw the sample histogram and compare it with normal distribution. Try different degrees of freedom. What do you find? Does varying the seed and regenerating the student t sample change that conclusion?
  • Use SOCR Parkinson’s Big Meta data (only rows with time=0) to generate a heat plot. Set RowSideColors, ColSideColors and rainbow. [Hint: you may need to select columns, properly convert the data, and normalize it.]
  • Use SOCR 2011 US Jobs Ranking draw scatter plot Overall_Score vs. Average_Income(USD) include title and label the axes. Then try qplot for Overall_Score vs. Average_Income(USD): 1) fill with the Stress_Level; 2) Size the points according to Hiring_Potential; and 3) label using Job_Title.
  • Use SOCR Turkiye Student Evaluation Data to generate trees and graphs, using cutree() and select any \(k\) you prefer. (Use variables Q1 - Q28).

SOCR Resource Visitor number Dinov Email