Commit 162cfed7 authored by Joaquin Torres's avatar Joaquin Torres

Update README.md

parent 49ede9e2
...@@ -4,6 +4,15 @@ ...@@ -4,6 +4,15 @@
... ...
## Dealing with Class Imbalance ## Dealing with Class Imbalance
One of the primary challenges we encountered was a significant class imbalance, with a higher number of patients withdrawing from treatment compared to those staying.
To address this issue, we implemented four different training approaches or pipelines on both the pre-pandemic and post-pandemic training datasets:
1. **Using the Original Dataset**: The models were trained on the original datasets.
2. **Class Weight Adjustment**: The models were trained on the original datasets but were penalized more heavily for misclassifying the minority class.
3. **Oversampling**: Additional samples were generated for the minority class (patients staying) to balance the dataset.
4. **Undersampling**: Samples from the majority class (patients withdrawing) were reduced to achieve balance.
These approaches resulted in multiple training datasets. However, to ensure a fair comparison of the models' performance across different pipelines, we utilized a common test dataset for evaluation, irrespective of the training approach followed.
## Repository ## Repository
This repository is organized as follows: This repository is organized as follows:
......
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment