The balance between compute resources and parallelization involves finding the right trade-off between using more compute resources to speed up the training process and the overhead introduced by parallelization. To determine the optimal level of parallelization, one should consider factors such as the problem size, the complexity of the model, the available compute resources, and the communication overhead between compute resources.