scVI: effect of unbalanced number of cells in batches

I was wondering whether imbalance in the number of cells in different cell types, batches or datasets will significantly impact scVI training.

For example, subsetting a large atlas to have maximum 10k cells for each cell type and roughly the same number of cells as other datasets, I seem to obtain better results. Is this expected or normally handled by scVI loss functions ?