[model:component] Add sampling techniques to address the imbalanced training dataset

The current training set does not utilize any sampling techniques (e.g., oversampling, undersampling, SMOTE) to address the imbalance in the dataset. We could implement appropriate sampling techniques to balance the dataset and equally represent bugs of all products and components.