parent 912eb1ad31
commit 749ac24c13
  1. 24
      README.md

@ -36,18 +36,18 @@ FLASK_APP=tsneGrid.py flask run
```
# Reproducability of the Results #
The following instructions describe how to reach the results present in Figure 1 of the article. The aforementioned figure is connected with the Subsection 5.2 ("Use Case: Improving Diabetes Classification"). This is more or less the main use case described in the paper.
**Tip:** The "Reset Filters" button illustrated in Figure 1(h), resets all the applied interactions in case you made a mistake and you want to redraw something.
- Step 1: We press the "Execute new t-SNE analysis" button that is observable in Figure 1(a).
- Step 2: From the new pop-up tab named as "t-SNE Grid Search Resulting Diverse Projections [Sorting Projections According to Metric:" we select "Continuity (C)" quality metric.
- Step 3: From the five projections seen on the top (out of the 25), we choose the fourth projection with "Quality Metrics Average (QMA)" approximately 0.43 and rather high "Continuity (C)" (around 0.69). Next, we press the "Confirm" the projection and wait for the system to load all the views. The t-SNE overview is the same as in Figure 7(a) and the main projection can be seen in Figure 7(a). Also, Figures 7(b) and 7(d) show the exact same representations as you should have in front of you.
- Step 4: To reproduce Figure 7(e), you have to use the "Group Selection" option from the "Interaction Modes" illustrated in Figure 1(h). Afterward, you should carefully select the areas depicted in red color in Figure 7(c). To select those areas, you hold the left mouse and draw a shape with the lasso interaction provided by the tool. The first area is "High remaining cost", the second is "Low-density ”tip”", and the third "Low-density cluster". Please note that the results in Figure 7(e) start from 20 to 40 k-values of nearest neighbors.
- Step 5: For Figure 7(g), you use the "Dimension Correlation" option from the "Interaction Modes" illustrated in Figure 1(h). Then you use left click to draw a point and once more another left click for another point until you have drawn precisely the line seen in Figure 7(g) and Figure 7(c) in blue. Afterward, with the right mouse click you confirm the drawn shape and observe in the "Dimension Correlation" view depicted in Figure 1(j) the values of Figure 7(f). Finally, if you click the "Insulin" dimension you get Figure 7(g). The exact values of "Insulin" are now shown in the legend of Figure 1(i) instead of the "Density" and the "Remaining Cost" as it was before in Figure 7(c, bottom-right corner).
- Step 6: The next step is to choose the "Quality Metrics Average (QMA)" option instead of "Continuity (C)" with the dropdown selection seen in Figure 1(e). This will sort the projections based on the average value of all the provided quality metrics.
- Step 7: To receive the image shown in Figure 1, you choose with the "Group Selection" described in Step 4 the third cluster named as "Low-density cluster" (cf. Figure 7(c)). Then you should click the "Optimize Selection" button (see Figure 1(e), which is going to resort all the projections. Afterward, you click and choose the first projection out of the six shown in Figure 1(e). Now Figure 1 is loaded for you.
- Step 7: Finally, you use the "Dimension Correlation" option from the "Interaction Modes" illustrated in Figure 1(h) and draw a line (i.e., click to place two individual points) as shown in Figure 1(f). Next, press right click to confirm the drawn shape as in Step 5. If the line is drawn correctly, then you just go to the "Visual Mapping" panel (cf. Figure 1(i)) and you select "KNN" instead of "Distance" for the "Correlation measurement". After this modification, you will find the "K-value (KNN)" field which should be set to 34 (see Figure 1(i)).
The following instructions describe how to reach the results present in Figure 1 of the article. The aforementioned figure is connected with the Subsection 5.2 (*Use Case: Improving Diabetes Classification*). This is more or less the main use case described in the paper.
**Tip:** The *Reset Filters* button illustrated in Figure 1(h), resets all the applied interactions in case you made a mistake and you want to redraw something.
- Step 1: We press the *Execute new t-SNE analysis* button that is observable in Figure 1(a).
- Step 2: From the new pop-up tab named as *t-SNE Grid Search Resulting Diverse Projections [Sorting Projections According to Metric:* we select *Continuity (C)* quality metric.
- Step 3: From the five projections seen on the top (out of the 25), we choose the fourth projection with *Quality Metrics Average (QMA)* approximately 0.43 and rather high *Continuity (C)* (around 0.69). Next, we press the *Confirm* the projection and wait for the system to load all the views. The t-SNE overview is the same as in Figure 7(a) and the main projection can be seen in Figure 7(a). Also, Figures 7(b) and 7(d) show the exact same representations as you should have in front of you.
- Step 4: To reproduce Figure 7(e), you have to use the *Group Selection* option from the *Interaction Modes* illustrated in Figure 1(h). Afterward, you should carefully select the areas depicted in red color in Figure 7(c). To select those areas, you hold the left mouse and draw a shape with the lasso interaction provided by the tool. The first area is "High remaining cost", the second is "Low-density ”tip”", and the third "Low-density cluster". Please note that the results in Figure 7(e) start from 20 to 40 k-values of nearest neighbors.
- Step 5: For Figure 7(g), you use the *Dimension Correlation* option from the *Interaction Modes* illustrated in Figure 1(h). Then you use left click to draw a point and once more another left click for another point until you have drawn precisely the line seen in Figure 7(g) and Figure 7(c) in blue. Afterward, with the right mouse click you confirm the drawn shape and observe in the *Dimension Correlation* view depicted in Figure 1(j) the values of Figure 7(f). Finally, if you click the "Insulin" dimension you get Figure 7(g). The exact values of "Insulin" are now shown in the legend of Figure 1(i) instead of the *Density* and the *Remaining Cost* as it was before in Figure 7(c, bottom-right corner).
- Step 6: The next step is to choose the *Quality Metrics Average (QMA)* option instead of *Continuity (C)* with the dropdown selection seen in Figure 1(e). This will sort the projections based on the average value of all the provided quality metrics.
- Step 7: To receive the image shown in Figure 1, you choose with the *Group Selection* described in Step 4 the third cluster named as "Low-density cluster" (cf. Figure 7(c)). Then you should click the *Optimize Selection* button (see Figure 1(e), which is going to resort all the projections. Afterward, you click and choose the first projection out of the six shown in Figure 1(e). Now Figure 1 is loaded for you.
- Step 7: Finally, you use the *Dimension Correlation* option from the *Interaction Modes* illustrated in Figure 1(h) and draw a line (i.e., click to place two individual points) as shown in Figure 1(f). Next, press right click to confirm the drawn shape as in Step 5. If the line is drawn correctly, then you just go to the *Visual Mapping* panel (cf. Figure 1(i)) and you select *KNN* instead of *Distance* for the *Correlation measurement*. After this modification, you will find the *K-value (KNN)* field which should be set to 34 (see Figure 1(i)).
**Outcome:** The above process describes how you will be able to reproduce precisely the results presented in Figures 1 and 7 of the paper. Thank you for your time!

Loading…
Cancel
Save