Improving Imbalanced Machine Learning with Neighborhood-Informed Synthetic Sample Placement
Figures supplement
Figure 1. Calculation of the sum of the distances between a minority sample and its k-nearest minority neighbors (k=3)
Figure 2. Demonstration of the loneliness function
Figure 3. Finding local variation
Figure 4. Example of synthetic sample generation on a hypothetical 3-dimensional dataset
Figure 5. (a) Large leads
to most synthetic points generated around loneliest minority points
Figure 5. (b) leads
to synthetic points equally distributed for all existing minority points
Figure 5. (c) Larger k results in larger spread for synthetic points around their respective originating minority points
Figure 5. (d) Changing k
Figure 5. (e) Changing Lambda
Figure 8. Imbalanced Learning Framework using SANSA