SOCR ≫ DSPA ≫ Topics ≫

1 Wrapper feature selection

2 Feature Selection in Parkinson’s Disease (PPMI Data)

Use the 06_PPMI_ClassificationValidationData_Short dataset setting ResearchGroup as class variable.

  • Delete irrelevant columns (e.g. X, FID_IID) and select only the PD and Control cases
  • Properly convert the variables types
  • Apply Boruta to train a model, try different parameters (e.g., try different pValue, maxRuns). What are the differences?
  • Summarize and visualize the results
  • Apply Random Feature Elimination (RFE) and tune the model size
  • Evaluate the Boruta model performance by comparing with REF
  • Report and compare the variables selected by both methods. How much overlap is there in the selected variables?

SOCR Resource Visitor number Dinov Email