Assigned Project 2 Solution

Starting from:

~~$35~~

$29

Home

Final Policy Evaluation

ECR

`python3 MDP_policy.py -input Training_data.csv` will run the MDP program and get the best ECR value along with its policy.

IS

`python3 MDP_policy.py -input Training_data_is.csv` will run the MDP program and get the best IS value along with its policy.

Necessary packages

```

numpy

pandas

pymdptoolbox

seaborn

matplotlib

pyswarms

sklearn

```

All Data Used

`binned_{2,3,4,5}_reorder.csv`

All Logs

All the CSV files in `bin_{2,3,4,5}_history`

Feature Discretization

` python3 discretize.py` will discretize the columns. If you got the error of `ImportError: cannot import name 'KBinsDiscretizer'`, please upgrade your `sklearn` to the latest version (at least >= 0.20).

Feature Selection

Genetic Algorithm

`python3 repeat.py` will call `genetic.py` to run some iterations of genetic algorithm to select up to 8 features. The log is saved in `bin_NUM_history` folder where NUM is the number of bins from 2 to 5. When it runs, a CSV file named `temp.csv` will be created as input to `MDP_policy.py` to evaluate the selected features.

Please change the parameters in `repeat.py` to do feature selection using other number of bins and number of features.

PSO Algorithm

`python3 pso_pyswarms_control.py` calls `pso_pyswarms.py` to further run the algorithm with the features that were output in the previous run.

The parameters are clearly defined in `python3 pso_pyswarms_control.py`.

(However, we abandoned our use of this algorithm due to poor preliminary results).

Correlation and Heat Map

```

cd correlation

python3 correlation.py

```

will generate heat map and do correlation calculation